Machine Learning Engineer @ LightOn
- π LightOnOCR, a family of efficient 1B end-to-end OCR VLMs β v2 achieves SOTA on OlmOCR-Bench while being 9Γ smaller and up to 5Γ faster than competing approaches
- ποΈ ModernBERT, contributed to architecture design, training and eval (ACL 2025)
- π ArabicWeb24, a 39B token Arabic corpus for LLM training
- π οΈ vit.cpp, a lightweight C++ inference engine for Vision Transformers using GGML
- π¬ Interested in Vision Language Models, Vision Transformers, LLM Pre-training, State-Space Models, Optimization, Code Generation, Efficient Inference, Quantization, GPU Kernels, Distributed Training, RL
- π Engineering degree in maths and machine learning from Γcole Centrale de Lyon
- π« Reach me: taghadouinisaid@gmail.com




