
IMRAN ULLAH
Building intelligent AI systems with NLP and Vision
Competenze

Consulta i miei servizi


Esperienza lavorativa
machine learning engineer
Upwork • Full time
Feb 2024 - Nov 2025 • 1 yr 9 mos
At Grinda AI I worked remotely as a Machine Learning Engineer. My primary role was spearheading the end to end lifecycle of large language models specifically tailored for high throughput banking environments. Because financial institutions require strict data privacy and cost efficiency I led the technical initiatives to fine tune and deploy multi billion parameter models entirely on premise. A major part of my job was solving data scarcity. For a specific Korean banking client I generated a custom synthetic dataset of over one million samples using GPT 4 and Claude. I used this high quality synthetic data to fine tune the Qwen2.5 32B model utilizing QLoRA on multi GPU clusters with DDP and FSDP. Beyond model training I was heavily responsible for production inference optimization. I deployed these fine tuned financial models using vLLM and SGLang. I engineered the on premise infrastructure to successfully handle over 4000 concurrent requests while perfectly optimizing the GPU memory usage. I also designed robust evaluation pipelines using Ragas and custom frameworks to constantly benchmark our models for accuracy latency and financial domain compliance. Additionally my role expanded into low resource speech AI. I fine tuned OpenAI Whisper models specifically for the Kazakh language which achieved a 25 percent Word Error Rate and significantly outperformed the baseline models for audio transcription.