Software Engineer
Software Engineer with deep expertise (8+ years) in LLMs, NLP, and Python-based AI application development. Skilled in building intelligent systems like NLP search platforms, Retrieval-Augmented Generation (RAG) systems, AI Agents and Chatbots. Experienced in developing end-to-end solutions using FastAPI/Flask, integrating vector databases, and deploying models on Cloud. Looking to apply cutting-edge AI/ML techniques in a dynamic and impactful engineering team.
I'm an Software Engineer with over 8 years of experience building intelligent systems and scalable AI applications. With a Master's degree in Computer Science, I specialize in LLMs, NLP, and creating robust, cloud-native AI solutions that combine powerful backend systems with intuitive user interfaces. My expertise spans Python-based AI development, RAG systems, AI agents, and modern ML frameworks.
Throughout my career, I've worked on diverse AI/ML projects from NLP search platforms for medical databases to recommendation engines, RAG-based conversational AI, and deep learning model deployment. I'm comfortable working across the full AI/ML stack, delivering end-to-end solutions from model training and optimization to production deployment and monitoring. My approach focuses on applying cutting-edge AI/ML techniques while leveraging modern cloud infrastructure and DevOps practices.
The Right Contact
Enso Consulting
Commune
Aimesoft
Nilai University
Nilai University
Developed a natural language processing (NLP) search platform enabling clinicians to query complex medical databases using plain English. Engineered transformer-based model to translate NL queries into optimized SQL, with image-based search capabilities and ML-powered personalization. Deployed on scalable AWS microservices.
Developed a full-stack application enabling users to engage in conversational interactions with multiple documents utilizing a novel RAG pipeline integrating LangChain and LlamaIndex. Implemented state-of-the-art RAG pipeline with PostgreSQL and ChromaDB, built scalable FastAPI backend, and developed user-friendly Svelte frontend based on Figma designs.
Designed and implemented a machine learning system that recognizes and extracts meaningful information from accounting documents using deep learning algorithms. Built highly scalable distributed backend services using Python, FastAPI, Celery, Redis, and MongoDB for real-time processing. Implemented detection and classification of important objects like QR codes, tables, headings, logos, and signatures.
Designed and trained deep learning models using PyTorch and TensorFlow for classification, regression, and sequence modeling across NLP and computer vision domains. Deployed models to production using ONNX, TorchScript, and TensorFlow Serving, integrated into cloud-native inference pipelines.
Built and evaluated traditional ML pipelines using scikit-learn, including feature engineering, model selection, and hyperparameter tuning for structured data problems. Developed end-to-end model training workflows with custom loss functions, callbacks, and early stopping for optimal performance and generalization.
Specialized in model optimization, data preprocessing, and scalable training/inference strategies for real-time and batch prediction use cases. Deployed models to production environments and integrated them into cloud-native inference pipelines with focus on performance and reliability.
Download my complete CV for detailed experience, education, certifications, and more insights into my professional journey.
Download CVI'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision. Feel free to reach out!