External Resources for ML
Links, blogs, courses, people, etc. in the fields of AI, ML, CV, NLP, etc. that I want to keep at one place (for my reference).
Websites
Learning ML
Daily papers
PapersWithCode (also see other portals)
Scholar Inbox: More personalized list of papers daily
AIModels.fyi blog notes (paper summaries)
Paper search
Conferences
AI Deadlines: A PapersWithCode Repository that keeps track of upcoming conferences in AI related subject areas
AI Summaries
Warning
AI generated summaries might not always be factual (or have high utility).
Bing Chat on Microsoft Edge: Open PDF in Edge, click Bing Chat, ask it questions about the PDF
Blogs
The AISummer: Getting started with AI and many concepts pages as articles
See page on attention, NLP transformers, and ViT transformer
See page on NeRF and InstantNGP
See page on Diffusion models
-
See overview of LLM powered autonomous agents
Podcasts
Projects
Interesting projects in the wild
HuggingFace Projects Documentation: A collection of awesome community projects
Transformers: Different transformer implementations
timm: SOTA computer vision implementations
Hub: Models, datasets, etc. at one place
Tokenizers: Tokenizers for production and research
Diffusers: Diffusion algorithms
Kornia: Computer vision algorithms (AI centric)
PyG - PyTorch Geometric: Geometric deep learning on PyTorch
PyTorch Points 3D: Implementation of various 3D algorithms in one repository
PyTorch Metric Learning: Implementation of distance-based data mining, losses, trainers, testers, etc. for metric learning
Numba: JIT compilation for making python code faster (even has CUDA acceleration and parallel for loops)
RAPIDS Ecosystem: Many libraries for accelerated pandas dataframe, Scikit-learn like API, analytics, clustering, solvers, etc.
HMMLearn: Unsupervised learning and inference of Hidden Markov Models
XGBoost: Optimized gradient boosting library that is efficient, flexible, and portable
LightGBM: Gradient boosting framework that uses tree-based learning algorithms
Ray: Scaling AI workloads (data, training, hyperparameter tuning, RL, serving, etc.). From AnyScale Platform team.
Projects on large language models
Lightning-AI/lit-gpt: Implementation of SOTA open-source LLMs with quantization and LoRA like enhancements
LLaMA Index: LLMs on your own data
LangChain: Build applications powered by language models
tiktoken: OpenAI’s BPE tokenizer
Guardrails: Adding guardrails to LLM outputs (prompting). Also see this blog on inspecting the under-the-hood prompting.
Collection of interesting research in the wild
Microsoft’s microsoft/unilm: Collection of Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities. Also see their website.
Google’s massive google-research repository: contains all publicly released code by Google Research.
Facebook’s Detectron2 repository: contains SOTA detection and segmentation algorithms. Also see their xformers repository for transformer optimization hacks.
Google’s Scenic Library: has many computer-vision research tools (coded in JAX).
spaCy: NLP tool
ESPnet: End-to-End Speech Processing Toolkit. Also see docs.
Acme: RL components and agents by Google DeepMind
Some PyTorch frameworks for (numerical) optimizers: torchimize (API), pytorch-minimize
Collection of parallel (distributed/multi-GPU and node) training resources
HF Accelerate: Use PyTorch models on any device and distributed configuration (by HuggingFace)
Horovod: Distributed DL framework for Tensorflow, PyTorch, and Keras
LambdaLabs Blog: Distributed training guide for PyTorch (using MPI)
Some more interesting ML projects in the wild
Teachable Machine (GitHub): Train a model to recognize images, sounds, and poses online and export TensorFlow model.
Quick Draw: Dataset and model to recognize hand drawings (doodling dataset).
ONNX Runtime: Deploy an ONNX model on multiple platforms (GitHub)
MLHub CLI: Command line framework for various ML models (not related to this project)
AutoML: Neural architecture search (NAS) and hyperparameter selection/optimization
Radiant Earth: Earth observation data (geo-spatial informatics)
ICESat-2: Ice, Cloud and land Elevation Satellite-2 (geo-spatial informatics)
Startups
ArtPark Ignite: Venture-building program for AI and Robotics from ARTPARK@IISc
Books
Courses
AI and Machine Learning
Stanford CS229 - Machine Learning - Prof. Anand Avati
Stanford’s Machine Learning course. There are five modules; supervised learning: linear and logistic regression, classification, linear models, generative learning, kernel methods, and support vector machines (SVMs); deep learning: neural networks and back propagation; generalisation and regularisation: complexity bounds and model selection; unsupervised learning: clustering, expectation maximisation (EM) algorithms (ELBO), VAEs, PCA, Independent Component Analysis, self-supervised learning (SSL) and foundation models; reinforcement learning: decision processes, policies, linear quadratic regulation (LQR), differential dynamic programming (DDP), linear quadratic gaussians (LQG), policy gradients. Main course design by Andrew Ng.
Related:
Links: Website (SEE Page, Stanford page), CS229 Fall 2023-24 Syllabus, Course Notes by Andrew Ng, YouTube Playlist - Spring 2023, YouTube Playlist - Autumn 2018
-
Deep learning course at NYU from Yann LeCun and Alfredo Canziani
Links: Course Docs - Spring 2020 (major release, other didactics), YouTube Playlist - Spring 2020, GitHub - Spring 2021
Stanford CS231n - Deep Learning for Computer VIsion - Fei Fei Li
Links: YouTube Playlist, Course website
CMU - 16-825 - Learning for 3D Vision - Spring 2023
Cornell Tech CS 5785 - Applied Machine Learning
MIT 6.S192: Deep Learning for Art, Aesthetics, and Creativity
Links: YouTube Playlist
MIT - Introduction to Deep Learning
Links: YouTube Playlist
Stanford CS25 - Transformers United
Links: YouTube Playlist - Cases
UC Berkeley - Full Stack Deep Learning
University of Tubingen - Statistical Machine Learning - Summer 2020
Links: YouTube Playlist
University of Tubingen - Introduction to Machine Learning - Winter 2020/21
Links: Dmitry Kobak’s Blog - Slides
UC Berkeley - CS294-158-SP20 - Deep Unsupervised Learning Spring 2020
Links: YouTube Playlist
Michigan - EECS 498.008 / 598.008 - Deep Learning for Computer Vision - Winter 2022
Links: YouTube Playlist
Geometric Deep Learning
UvA - An Introduction to Group Equivariant Deep Learning
Part of Geometric Deep Learning series from University of Amsterdam. Contains lecture videos on group theory, steerable group convolutions, and equivariant graph neural networks. Also has Colab assignments.
UPenn - Graph Neural Networks - ESE 5140
Reinforcement Learning
Computer Vision
University of Tubingen - Computer Vision - Prof. Dr. Andreas Geiger
Introduction and history of computer vision. Photogrammetry, image sensing pipeline, structure-from-motion, bundle adjustment, stereo reconstruction, probabilistic graphical models, optical flow, shape from shading, stereo, coordinate based networks, image recognition, semantic segmentation, object detection, self-supervised learning, and other advanced topics (compositional models, human body models, deepfakes, etc.). University of Tubingen Computer Vision course by Prof. Dr. Andreas Geiger.
Links: YouTube Playlist, Public Material: Slides and exercises
Natural Language Processing
CMU - CS 11-737 Multilingual NLP - Spring 2022
Links: YouTube Playlist
CMU - CS 11-711 - Advanced NLP - Fall 2022
Links: YouTube Playlist
Stanford CS224U: Natural Language Understanding
Links: GitHub, YouTube Playlist
UMass - CS685 - Advanced Natural Language Processing - Spring 2023
Links: YouTube Playlist - Fall 2020
YouTube Playlists
3Blue1Brown Course - Neural Networks: Awesome theory and explanation, from basic neural networks and back propagation to GPT-3 (transformers and tokenizers).
HuggingFace Course YouTube Playlist
Communities
Some communities you can follow
ML Collective: ML research opportunities, collaboration, and mentorship
People
Geoffrey E. Hinton, Yann LeCun, and Yoshua Bengio: Founders of modern deep learning (received the turing award for it in 2018)
Jurgen Schmidhuber (IDSAI, Swiss): LSTM
Jitendra Malik (UC Berkeley, Meta): Computer vision and AI
Leonidas J Guibas (Stanford): 3D computer vision backbones (PointNet).
Abhinav Gupta (CMU RI): Computer Vision and AI
Sergey Levine (UC Berkeley): Reinforcement Learning for Robotics
Dhruv Batra (Georgia Tech, Meta): Embodied AI Agents, Robotics
Michael Bronstein (CS Univ. of Oxford): Geometric deep learning and graph neural networks.
Max Welling (Qualcomm UvA): VAEs, graph CNNs
Luca Carlone (MIT): SPARK Lab; SLAM and robust perception.
Saurabh Gupta (UIUC, Meta): Computer vision, robotics, and AI
Follow these folks on social media (for new research)
Dmytro Mishkin: Kornia (CV+AI framework), tweets papers
Phil Wang a.k.a. Lucidrains: Open source contributions on GitHub
Ahsen Khaliq a.k.a. AK a.k.a. akhaliq: Tweets and HuggingFace papers, Gradio
Aran Komatsuzaki: Tweets papers, LAION and EleutherAI
Mike Young: Paper summaries
Ryohei Sasaki: Research on autonomous driving (LiDAR)
Dr Ronald Clark (CS, Oxford): Real time SLAM, bundle adjustment, scene understanding, and motion tracking
Devendra Singh Chaplot (CMU, FAIR): Visual navigation, object goal navigation, exploration, embodied AI
Dhruv Shah (UC Berkeley): Robotics & AI