
I build, write, showcase around zero-shot vision, multimodality, optimization and more (mostly transformers).
🤗 My Hugging Face profile has a lot of cool stuff and I also write blogs on everything cutting-edge over there.
🌱 smol-vision: notebooks, scripts and more on various zero-shot vision/multimodal model optimizations
🤖 smolagents: a lightweight library on agentic ML
Below are a couple of write-ups about some of the things I worked on at Hugging Face or illustrations of concepts related to ML ⬇️
🔖 SmolVLM - small vision LM 🔖 Even smaller SmolVLM - tiniest vision LM ever
🔖 Introducing smolagents 🔖 Vision LMs in smolagents
🔖 Vision Language Models Explained 🔖 Autoencoders Visualized
🔖 Introduction to Quantization 🔖 ConvNets Visualized
🔖 PaliGemma – Google's Cutting-Edge Open Vision Language Model and PaliGemma 2