I am Shubham Kumar.
I build and explore machine learning systems with a focus on multilingual NLP, LLMs, and real-world AI applications.
-
π§ Hugging Face Transformers
π huggingface/transformers#9286- Contributed to resolving a token alignment issue affecting multilingual NER pipelines
- Improved tokenizer behavior across languages
-
π£οΈ Hugging Face Model Hub
π https://huggingface.co/zyberg2091/distilbert-base-multilingual-toxicity-classifier- Published a multilingual toxicity classifier (DistilBERT)
- Supports English, Hindi, and Hinglish
- Focused on real-world content moderation use cases
-
π TensorFlow Models (Community Contributions)
- Contributed to discussions on best practices for object detection on custom datasets
-
π Documentation Contributions (TensorFlow, Rasa)
- Improved developer experience through documentation fixes and clarity enhancements

