OPEN TO OPPORTUNITIES

AMAN
KUMAR

ML ENGINEER DATA SCIENTIST

I build production AI systems — agentic LLM pipelines, statistical forecasting models, and MLOps infrastructure that survives contact with real data. Currently shipping at SanDisk India.

R² 0.91 timing prediction model
24h → 47min analysis turnaround
Sharpe 0.937 macro strategy vs SPY 0.760

WHO I AM

Aman Kumar

ML Engineer and Data Scientist with 2+ years building statistical models, time-series forecasting systems, and production ML pipelines across the semiconductor and fintech domains.

I work across the stack — feature engineering and hypothesis testing on one end, agentic LLM systems and MLOps deployment on the other. I build things that work and can be explained clearly to the people who use them.

LOCATION Bengaluru, Karnataka, India
EDUCATION MTech Integrated Data Science — VIT Vellore · CGPA 8.1

EXPERIENCE

SanDisk India

Machine Learning Engineer · via Magnit Global

SEPT 2025 — PRESENT CURRENT
  • Built SwiftECO — BiLSTM + Multi-Head Attention model for multi-corner STA timing prediction with asymmetric loss and cross-design transfer learning: R² = 0.91, MAE ~6.7 ps, cutting analysis turnaround from 24+ hours to 47 minutes
  • Built NLP/RAG pipeline over 2000+ pages of technical documentation (LangChain, FAISS, AWS Bedrock) — natural-language querying over engineering knowledge bases
  • Designed scalable pipelines for 10+ GB PrimeTime reports — automated ETL, statistical profiling, and MLflow-tracked validation for reproducible model governance
  • Shipped RTL Lint & Stream IP agents automating Verilog cleanup with designer-in-the-loop review; improved STA timing consistency by 55%
PyTorch LangChain FAISS AWS Bedrock MLflow

Hyperbots

Applied ML Engineer

JUN 2025 — SEPT 2025
  • Built anomaly detection & classification system on operational invoice data — A/B validated, with a Kafka real-time feedback loop; reduced manual review by 40%
  • Maintained production ML services — MLflow tracking, automated drift detection, CI/CD with Docker, Jenkins & Kubernetes on AWS/GCP; >99% uptime
  • Developed monitoring dashboards (Power BI + React) surfacing model KPIs, anomaly trends, and pipeline health for business stakeholders
Kafka MLflow Docker React GCP
SEP 2024 — JUN 2025
  • Developed ML/DL models (scikit-learn, PyTorch) for timing prediction across semiconductor design workflows — 55% improvement in STA timing consistency
  • Geospatial data analysis on GCP — Parquet columnar datasets, spatial feature engineering; automated IR drop & PrimeTime report analysis in Python
scikit-learn GCP GeoPandas Parquet

PROJECTS

Talk to DB

Talk to DB

Natural language to SQL — production-grade agentic service with dual-layer security: sqlglot AST firewall blocks non-SELECT statements + enforced read-only DB sessions at driver level.

FastAPI Anthropic SDK sqlglot SQLAlchemy
Market Analysis Wizard

Market Analysis Wizard

Multi-agent market research pipeline — LangGraph DAG with parallel Competitor Intel + Market Sizing nodes, SSE streaming, and Tavily web search. Generates structured SWOT reports in real time.

LangGraph FastAPI Tavily Claude
AmanERP

AmanERP

Mobile-first ERP with full CRUD across Inventory, Sales, HR, and Finance modules. localStorage JSON persistence, AI insights panel via Claude API. Industrial neo-brutalist UI.

React localStorage Anthropic API
Anushree Vastralaya

Anushree Vastralaya

Offline-first digital khata app for a saree shop — credit sales, partial payments, customer ledger. IndexedDB, camera capture, 6-month dashboard, Hindi/English toggle. Zero backend.

IndexedDB PWA Vanilla JS
Know Your Website

Know Your Website

Autonomous web security audit agent — submits a URL, runs 7 parallel inspection modules, streams results live via SSE, and synthesises a structured threat report using an LLM. FastAPI backend, React frontend.

FastAPI React scikit-learn Security

TECHNICAL SKILLS

LANGUAGES

PythonSQLJavaTcl

STATISTICAL MODELING & FORECASTING

Time SeriesARIMAHMMGaussian ProcessesStatsmodelsHypothesis TestingA/B Testing

ML / DEEP LEARNING

PyTorchTensorFlowscikit-learnXGBoostHugging FaceDrift Detection

GENAI & AGENTS

LLMsRAGLangChainLangGraphMCPFAISSChromaDBAWS Bedrock

MLOPS & CLOUD

DockerKubernetesJenkinsMLflowAWSGCPCI/CD

EXPLAINABILITY & ANALYTICS

SHAPLIMEFeature ImportancePandasNumPySciPyPower BI

BACKEND & DATA

FastAPIFlaskPostgreSQLKafkaRedisReactStreamlitParquet

EDUCATION

Army Public School, Ranchi

CBSE XII

COMPLETED 2020 · 93.8%

ACHIEVEMENTS

GET IN TOUCH

Open to ML engineering roles, data science positions, and interesting collaborations. I read every email.

amankumar24052001@gmail.com

+91 9065939752  ·  Bengaluru, Karnataka, India