Production ML & LLM systems
Go beyond notebooks—learn how retrieval, agents, and model APIs behave under load, cost limits, and real user traffic.
Vegrade is a principal-led team building production data science and AI systems for startups and growth companies. Interns and full-time engineers learn by delivering — mentorship, client-grade problems, and the discipline of evals, observability, and handoff.
In the form or email, note Internship or Full-time — we route careers separately from project inquiries.
6–12
Week typical client programs
Fast cycles, real shipping
1:1
Mentor ratio on intern cohorts
Staff engineer pairing
100%
Production-minded delivery
Evals, observability, handoff
Outcomes
Whether you join for a summer or a career, the goal is durable skill — not resume keywords. These are the capabilities people build here.
Go beyond notebooks—learn how retrieval, agents, and model APIs behave under load, cost limits, and real user traffic.
Frame problems with metrics, build reproducible experiments, and understand when classical ML beats generative approaches.
Write code your teammates can maintain: typed APIs, tests where they matter, CI gates, and architecture notes that survive handoff.
Practice the skills consultants and staff engineers use daily—scoped updates, risk flags, and demos that tie to business outcomes.
Programs
Both paths emphasize mentorship, written communication, and production discipline. Headcount stays limited so every person gets meaningful scope.
Applied AI engineering
A bounded project with a named mentor—design through demo week on a stack used on real client work, not a toy repo in isolation.
Typical outcomes
Good fit if: Students and early-career builders with solid CS fundamentals and curiosity about applied ML/AI.
Mid-level and senior
Own workstreams across data, models, and product surfaces—partnering with principals and client engineering leaders with minimal hierarchy.
Typical outcomes
Good fit if: Engineers who have shipped something real and can explain why they made hard technical choices.
Roles
Two open internships — Vachai AI research and growth — plus delivery roles coming later this year. If your profile is close, reach out even if the title doesn't match exactly.
Project: Vachai AI — dialect-aware Telugu foundational model
Join the team building Vachai AI—a dialect-aware Telugu foundational small language model (SLM). You will help source diverse Telugu speech and text, engineer pipelines that turn raw data into training-ready corpora, and learn how those datasets feed pre-training and fine-tuning.
What you'll work on
What you'll learn
Join the growth team to promote Vegrade's AI engineering services and generate qualified leads. You will work directly with the founders on outbound experiments, content for technical buyers, and the early sales motion—learning how a principal-led services company builds a pipeline from zero.
What you'll work on
What you'll learn
Build and harden models, retrieval stacks, and eval harnesses on client programs.
Ship APIs, web surfaces, and AI features with clear boundaries between product and model layers.
Own pipelines, curated tables, and feature readiness for modeling teams.
Life at Vegrade
We're not a 500-person services factory. This is a focused engineering firm where how you work matters as much as what you ship.
Core collaboration windows align with India business hours (IST), with flexibility when US or EU clients need overlap. We protect blocks for deep work—fewer standing meetings than enterprise IT shops.
Team members work from across India with intentional in-person syncs when it helps (kickoffs, design reviews). Success is measured by delivery and clarity, not hours logged in a specific office.
You touch the same patterns we use on engagements: hybrid retrieval, policy-bound agents, MLOps gates, and procurement-friendly documentation—not disconnected take-home puzzles.
We ask questions early, document assumptions, and review work in the open. Standards are high, but the goal is learning and shipping—not performative hustle.
Headcount stays deliberately limited. Your commits, designs, and client readouts are seen by leadership—not buried under six layers of management.
We teach what we build. Interns pair with staff engineers; full-time hires co-own runbooks and KT sessions so knowledge moves across the team.
What we look for
Degrees help; shipped work and clear thinking matter more. We hire for judgment, communication, and reliability.
Strong fundamentals in TypeScript and/or Python — readable code and tests where they matter.
Comfort with ambiguity — you document assumptions and risks instead of hiding unknowns.
Genuine interest in data science, ML, LLM systems, agents, or retrieval — not slide decks about them.
Professional written English for client updates, RFCs, and runbooks.
For interns: a contiguous 10–14 week window. For full-time: core overlap with IST, flexibility for global client calls.
Hiring process
Résumé or portfolio, role preference (intern vs full-time), and two sentences on what you want to learn or own.
System design or code discussion at the level of the role—focused on tradeoffs, not trivia.
Short, bounded exercise for select candidates—only when mutually agreed, with clear time box.
Written compensation, start date, mentor assignment, and expectations for your first 30 days.
Vegrade is an equal opportunity employer. We consider qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other characteristic protected by law. If you need a reasonable accommodation during the process, mention it in your first email.
FAQ
Ready to apply?
Drop an email with your résumé, the role you're interested in, and two sentences on what you want to learn or own. We read every note.