abhishek.kumar
Open to Senior AI Architect & GenAI Solution Architecture roles · Remote-first · International

Abhishek
Kumar.

Solution Architect

Ten years turning ambiguous business and regulatory requirements into governed, production-ready AI. I own solution architecture for enterprise compliance and document-intelligence systems — defining classification taxonomies, retrieval architectures, and LLM integration patterns, and naming the trade-offs that make them scale, survive an audit, and ship.

Abhishek Kumar sitting on stone steps in a white shirt and sunglasses, string lights and a tiled-roof building behind him. Black and white.
Bangalore-based. Internet-based.
Currently mid-thought.

A short, dated list of what’s top of mind.

Last updated: May 2026 · in the spirit of nownownow.com
  • JobPilot — a personal job-hunt copilot for the v0 global hackathon. Next.js + Neon Postgres + Apify on Vercel. Shipping the v0 submission this week.building
  • Porting the Superpowers plugin to Kiro IDE on Windows — pairing on Windows is a love story I’m still negotiating.porting
  • Publishing Claude Agent Skills: v0-prompting, humanizer, AI Ark, 360Brew profile optimizer, and a LinkedIn content one I keep tweaking at 2am.shipping
  • Quietly exploring international remote roles and Senior AI Architect positions at AI-native companies. (Yes, this is a hint.)listening
  • Reading: PageIndex paper, agentic eval harnesses, anything Anthropic’s research org puts on arxiv at 11pm IST.reading

Four outcomes, not a wall of skills.

I think people hire architects for outcomes, not for buzzword bingo. So here’s the work, grouped by what it actually decides.

01 / architecture

GenAI Solution Architecture

Translating ambiguous business and regulatory requirements into reference architectures. Technology selection, integration patterns, schema and API design, scalability, and production-readiness.

Reference architecture Tech selection Integration design Production-readiness
02 / retrieval

Retrieval Architecture & Trade-offs

Designing and formally evaluating retrieval strategies — RAG, PageIndex, hybrid, reranking — with evaluation harnesses that make the approach-selection call defensible, not vibes-based.

RAG PageIndex Hybrid retrieval Eval harnesses RAGAS
03 / governance

Governance & Compliance by Design

Audit-ready systems with classification taxonomies, PII redaction, model risk assessment, and explainability — aligned to GDPR, EU AI Act, and enterprise security KPIs.

GDPR EU AI Act PII redaction Audit-readiness Explainability
04 / multi-cloud

Multi-cloud & MLOps Architecture

Reference implementations that transfer cleanly across AWS Bedrock, Azure AI, and GCP. Model lifecycle, drift detection, CI/CD, capacity planning, monitoring.

AWS Bedrock Azure AI GCP MLOps Drift detection

Architecture is the art of naming what you gave up.

Four principles I bring to every system.

01

Trade-offs over trends.

Every choice sacrifices something. I name what I gave up and why — that’s the line between picking a tool and architecting a system.

02

Governed by default.

Compliance, auditability, and explainability are designed in from day one — not bolted on the week before the audit.

03

Evaluation before production.

Formal quality gates and evaluation harnesses gate every prompt and retrieval change. Nothing ships on “looks good.”

04

Portable by design.

Patterns that transfer cleanly across Bedrock, Azure, and GCP — so a cloud choice is never a one-way door.

Patterns proven across telecom, healthcare, pharmaceuticals, and retail.

Six things I’ve shipped, roughly in order of recency.

Pulled from the resume. Each one has a working artifact behind it — happy to walk through eval results, retrieval configs, or the bits that almost shipped sideways.

2026
now

Compliance RAG Pipeline

Production

“Claude reads 300+ pages of regulatory text and cites its sources.”

End-to-end production RAG for a Tier-1 enterprise engagement: Claude Sonnet on Bedrock, Qdrant for hybrid retrieval, FastAPI orchestration, RAGAS gates — the decision framework other compliance workflows now follow.

Retrieval evaluation architecture pattern — RAG vs PageIndex comparison flow with evaluation gates
Claude SonnetBedrockQdrantRAGASPageIndexFastAPI
2025

Enterprise Asset Classification at Scale

Production

“Hundreds of asset classes, structured outputs, a multi-week process killed.”

LLM classifier on Claude/Bedrock for a regulated enterprise client: structured output validation, confidence scoring, rationale generation, and a human-in-the-loop review queue for audit defensibility. Replaced a multi-week manual categorization with near-real-time.

Compliance classification architecture pattern — structured-output validation with human-in-the-loop review queue
ClaudeBedrockstructured outputsHITLgovernance
2025

JobPilot

In flight

“A job-hunt copilot that actually reads the listings.”

Personal project, v0 global hackathon submission. Next.js + Neon Postgres + Apify scrapers on Vercel, with retrieval over my own work history and a small evaluator for fit. Currently being dogfooded on my own search.

Next.jsNeonApifyv0personal
2024

Claude Interview Platform

🏆 Hackathon Winner

“$10K and a year of validation.”

Winner of the AWS Global GenAI Hackathon. Claude on Bedrock with dynamic, context-aware question generation and RAG-grounded transcript evaluation against role competencies. Prompt orchestration, structured-output scoring, quality validation across both generated and evaluated content.

ClaudeBedrockRAGeval-LLM$10K prize
2024

PharmaGraph

Production

“A knowledge graph that explains itself to a pharmacist.”

Neo4j knowledge graph with LLM-based entity linking and a RAG retrieval interface for complex pharmacological queries, built for a large US healthcare deployment. Every response grounded to a source document with inline citation and auditor-visible provenance. Azure OpenAI + Azure AI Search.

Neo4jAzure OpenAIAI Searchgrounded RAGprovenance
2025

GTMind

Personal

“Multi-tenant B2B prospecting, wrapped in MCP.”

Multi-tenant B2B prospecting SaaS exposed as an MCP server. FastAPI + Anthropic MCP SDK. The clients pull leads through Claude — the agent does the boring part, the human does the call.

FastAPIMCPAnthropic SDKmulti-tenant

Six was an arbitrary number — here’s the rest of the rotation.

More builds, less depth. Same person.

#07 Production

Document intelligence layer

The ingestion layer downstream compliance and retrieval workflows stand on.

LLM-light document-to-structured-content service. Powers downstream compliance, classification, and retrieval workflows for a Tier-1 enterprise. The unglamorous infrastructure that everything else stands on.

FastAPIPythonOCRStructured extractionVector store
#08 Production

Multimodal document processing pipeline

Compliance review across docs, slides, spreadsheets, and images.

Extended a text-only compliance pipeline to handle PPTX, XLSX, embedded images, and OLE objects using Claude Vision. The pipeline now reads anything an auditor would actually open.

Multimodal document intelligence architecture pattern — multi-format ingestion via vision-capable LLM
Claude VisionFastAPIAPI gatewayPython
#09 R&D

LinkedIn Connection Intelligence

Multi-agent RAG over personal LinkedIn graph data.

Ingests a LinkedIn connection graph, enriches it, and surfaces second-order paths and warm introductions. Pitched to the Vercel AI Accelerator. The “LinkedIn GPT for individuals” idea.

LangGraphQdrantNeo4jPostgresNext.js
#10 Built

Voice Reservation Agent

Conversational restaurant booking over the phone.

Takes restaurant reservation calls end-to-end — handles availability, party size, dietary notes, and confirmation, then writes to the booking system. The first time hearing it pick up was uncanny.

n8nElevenLabsTwilioOpenAI
#11 Built

ICP Qualifier Agent

Lead qualification agent on a partner agent platform.

Qualifies inbound leads against an ideal customer profile — pulls firmographic and intent signals, scores, routes. Built on the Nasiko A2A platform as part of their agent ecosystem.

NasikoPythonAnthropic SDK
#12 Personal

OpenTravel

Travel chatbot grounded in YouTube creator transcripts.

Niche RAG that treats travel YouTube creators as the knowledge base. Extracts transcripts, builds a hybrid retriever, answers travel questions with cited timestamps. The internet’s actual travel guide, finally queryable.

YouTube APIOpenAIRerankingEmbeddings
#13 Personal

HydroLedger

Car wash management system, no-code first.

End-to-end ops for a small car wash — bookings, customer ledger, daily reports. Built to test how far Lovable + Supabase can take you before you reach for code. Further than expected.

LovableSupabaseReact

A decade, four chapters, one consistent through-line.

Started in analytics, ended up in production GenAI. The titles shifted; the work — turning messy real-world data into systems people can trust — didn’t.

2015 → 2020 Analytics & Data Science (multiple roles)
Gambit Sports · Rooster Properties · Jupiter Infrastructure

Started in sports analytics and operational analytics; built predictive models, ETL pipelines, and early ML systems across sports, real estate, and infrastructure verticals.

2021 → 2023 Lead Business Analyst
Practo Technologies · healthcare consumer tech

Marketing Mix Modeling, real-time campaign optimization, user segmentation. First serious production ML work — the kind that has SLAs.

2023 → 2025 Manager, Data Science & Analytics
Factspan Analytics

Transitioned into GenAI leadership. Built production RAG, knowledge graphs, and MLOps infrastructure. Won the AWS Global GenAI Hackathon during this period.

2025 → now Digital Engineering Staff Engineer, AI & GenAI
NTT DATA

Leading enterprise RAG delivery, LLM evaluation frameworks, and AI governance for Tier-1 clients. Architecting parallel cloud-native reference implementations across AWS Bedrock and Azure OpenAI.

7,000 practitioners, 40+ events, one chai habit.

I lead the Bangalore chapter of The AI Collective — the largest practitioner community in India for production AI engineers. Since March 2025, we’ve shipped 40+ events, hosted speakers from across the ecosystem, and built the kind of room where people argue about eval harnesses and then go get dinner.

Apr 2026 · Bangalore
Claude Code Meetup
200 attendees · speaker + organizer
Mar 2026 · Bangalore
Vercel v0 Buildathon
Buildathon co-lead
Feb 2026 · Microsoft
GenAI Demo Day 2.0
Hosted at Microsoft Bangalore
Jan 2026 · Bangalore
Genspark Event
80 attendees · talk + Q&A
2025 — ongoing
Andela — AI for DevOps
Cohort facilitator (paid)
Since Mar 2025
AI Collective — Bangalore
Chapter lead · 7,000+ members
In partnership with
Microsoft Anthropic Vercel Genspark Andela

Mostly Claude Agent Skills and what broke.

I publish small, sharp agent skills — opinionated prompts and tools designed to make Claude better at one specific thing. A few are below; the rest are in drafts.

Published agent skills
v0-prompting humanizer / de-AIs your copy AI Ark OpenClaw 360Brew — profile optimizer LinkedIn content
Soon · essay “What I got wrong about RAG eval (twice).”
Soon · teardown “PageIndex vs chunked RAG: an honest benchmark.”
Soon · field notes “Running a 5K-person AI community on weekends.”

The short version of the long PDF.

Download resume (PDF)
M.S. — Machine Learning & AI Liverpool John Moores University, UK
2019 → 2021
PG Diploma — ML & AI IIIT Bangalore
2019 → 2020
B.Tech — Engineering Manipal Institute of Technology
2012 → 2015
🏆 AWS Global GenAI Hackathon Winner GenAI Academy · Green Belt Azure AI Fundamentals Azure Generative AI GCP Intro to GenAI deeplearning.ai · DL Specialization CSPO® · Scrum Alliance

Working on something interesting? Send a note.

I read everything. Best for: AI / GenAI Solution Architect conversations at AI-native companies, production-RAG and governance consulting, community collabs, or just a good thread about retrieval trade-offs. Slow for: cold sales pitches.

Tweaks ×