Irfan Ali — Senior AI Engineer

Senior AI engineer building LLMs, RAG, and agents in production.

I help AI and product teams build the systems behind their products — RAG that returns the right answers, voice features that feel natural, agent pipelines that hold up under real use.

Open to full-time senior AI engineer roles — remote, based in India, working with global teams.

Seven-plus years building AI and data systems. Previously founding AI engineer at Kuration AI in Hong Kong, and the first AI hire at Schneider Electric in India, reporting to the CTO.

Download résumé

Get in touch

View open source

datacortex.summary

$datacortex --info

WHOIrfan Ali

WHATLLMs · RAG · Voice

YEARS7+

LIBSPyPI

LOCRemote / India

Open to full-time roles

ProductsStacksiftB2B SaaS ReflectaVoice AI AgentEnsembleAgents ragfallbackRAGConnect

LinkedIn GitHub PyPI

Explore

$ reflecta.live

Selected work — Reflecta

A wellness app you can call on the phone. Instead of typing into a journal app you'll never open, you call a number, talk to it for a few minutes, and it listens and helps you reflect. I designed and built the whole thing on my own — the AI, the phone system, the backend, the database.

Voice AI · Telephony · LLM analysis · Production

Watch demo

getreflecta.com

Voice-firstReal conversationMemory across callsReflection summaries

pypi.org/user/irfanalidv

$pypi list --user irfanalidv

> Open-source Python libraries I've published. Other engineers use these to build their own AI systems — they're free and on PyPI.

AgentEnsemble

Multi-agent orchestration. ReAct, Swarm, Pipeline, Debate, WorkflowGraph patterns. Routing, planning, RAG, cost tracking.

AgentCare

Voice AI for healthcare. Call intake, structured extraction, missing-data recovery, appointment orchestration, post-call analytics.

ragfallback

Stop RAG from failing silently. Query rewriting, retrieval confidence scoring, fallback strategies, retry logic.

RAGNav

Navigation-first RAG for long documents. Routes queries to the right pages, follows cross-references, retrieves coherent evidence.

scrapeflow-py

Playwright-based web scraping. LLM extraction, hybrid selectors, session persistence, rate limiting, anti-detection.

AskPandas

Natural-language queries on CSV data using local LLMs. No API keys, no cloud.

lingo-nlp-toolkit

Lightweight NLP utilities bridging traditional pipelines and transformer-ready workflows.

PyroChain

Agentic feature engineering. PyTorch + LangChain agents for multimodal feature extraction.

toxic-comment-classifier

Deep learning toxicity detection. Per-category scores for obscene language, threats, insults, identity hate.

Also: Stacksift — a B2B domain analyzer built on these patterns. stacksift.in →

View all on GitHub View all on PyPI

what-i-build

$what-i-build

What I build

Production LLM systems — the kind teams rely on after launch, not just in demos. Seven-plus years across startups and enterprise R&D.

RAG systems

Retrieval pipelines that return the right answers — query rewriting, confidence scoring, fallback strategies, and navigation-first retrieval for long documents.

Examples: ragfallback, RAGNav

Agent orchestration

Multi-agent pipelines with ReAct, swarm, pipeline, debate, and workflow patterns. Routing, planning, and cost tracking built for production.

Examples: AgentEnsemble

Voice AI

Phone and in-app voice features — call intake, structured extraction, post-call LLM analysis, and appointment orchestration.

Examples: Reflecta, AgentCare

Multi-provider LLM orchestration

Systems that route across providers, fall back when one fails, and stay reliable under real load — not just demo prompts.

Examples: Production patterns across shipped products

Hiring for a senior AI role? Let's talk.

Get in touch

experience

$resume --experience

> 7+ years building AI and data systems for startups and enterprise

ROLES

Independent / DataCortex IQ
Senior AI Engineer
Shipped Reflecta (voice-first AI wellness app) and Stacksift (B2B domain analyzer). Maintains open-source Python libraries on PyPI. Open to full-time senior AI engineering roles.
Kuration AI
Founding AI Engineer
First AI hire at an early-stage B2B startup. Owned the full AI architecture — agent pipelines, RAG, multi-provider LLM orchestration.
Luminous Power Technologies (Schneider Electric)
Senior Manager, Data & Analytics, R&D
Sole AI hire reporting to the CTO. Built the R&D data and AI function, hired and led a team of three.
Lynk
Data Analytics & Automation
Analytics and NLP-powered search for an expert-matching platform.
brainsfeed
Head of Data & Analytics
First data hire. Built the data team to 10+ and shipped an internal NLP-powered research platform.

BUILT_FOR

AI search and chatbots that need to be reliable
Voice AI on the phone or in apps
Multi-step AI agents that hold up in production
Data systems for startups and enterprise R&D

India · Previously Hong Kong

about

$whoami

> AI engineer · open-source maintainer on PyPI · two published papers

founder.png

Irfan AliBuilding AI that works in production

I_FOCUS_ON

Reliabilityoverhype

Systemsoverscripts

Maintainabilityovershort-term hacks

NAME

Irfan Ali

EDUCATION

M.Sc. Data Science & AI, IISER Tirupati · B.Tech CSE, Alliance University · Exchange semester at ISEP, Paris

BACKGROUND

Founding AI engineer at Kuration AI in Hong Kong. First AI hire at Schneider Electric in India, reporting to the CTO. Now building independently and open to full-time senior AI engineering roles.

-Open-source Python libraries on PyPI that other engineers use to build AI
-Built systems that switch between AI providers automatically when one fails
-Two peer-reviewed papers published in 2025

EXPERIENCE

Led data and AI teams across India and Hong Kong
Built AI systems that real businesses use to make real decisions

datacortex.contact

$datacortex contact

> Have a question or want to get in touch? Drop a message below.

formSend a message

Tell me what you're working on or what you'd like to discuss.

Quick & Secure

Your information is protected and will only be used to respond to your inquiry.

Privacy Protected • I'll respond within 24 hours

$other ways to reach me

Other ways to reach me

Email or social — whichever is easier.

irfan.ali@datacortex.in LinkedIn GitHub

What I work on

LLM systemsRAG pipelinesAgent orchestrationVoice AIOpen source

What to expect

1Reply within 24 hours on weekdays

2Direct, async-first conversation