How we cut search latency by 85% at Amazon
DynamoDB → Elasticsearch migration: sharding, indexing, and lessons learned
Building distributed systems, ranking & recommendation engines, and AI-powered products at scale.
Currently shipping video experiences for Amazon Rufus — $600M+ revenue impact.
IIIT Hyderabad · IIT Madras · Seattle, WA
Built the video product widget surfaced on Amazon detail pages — a high-traffic, latency-critical feature serving 10M+ requests/day. Solved the latency challenge using lazy loading and delayed REST calls so above-the-fold content renders instantly. Ranking and recommendation signals determine which videos surface per customer, driving $600M+ in increased revenue.
Built gRPC-based clip-generation and serving workflows for Amazon's AI shopping assistant Rufus. Core focus was ranking and relevance — surfacing the most useful video moment for each product question using ranking models, vector retrieval, and multimodal signals. Achieved 7% CTR.
Migrated DynamoDB read-heavy access paths to Elasticsearch for a Tier-1 search service. Designed shard strategy, tuned indexing and query relevance, and built ranking signals into search — cutting latency by 85% and enabling more accurate, personalized recommendations at 10% MoM traffic growth.
End-to-end async event-driven system for driver license verification using SQS queues and AWS Step Functions — eliminated manual review bottlenecks at scale.
Built RNN/LSTM & attention-based models to produce high-quality abstractive summaries. Trained and evaluated on a large corpus at IIIT Hyderabad's NLP Lab.
Trained Transformer models to learn symbolic math rules — simplification, factoring, and multi-step equation transformation — through structured reasoning tasks.
Event-driven webhook infrastructure supporting async callbacks for third-party integrations — reduced synchronous polling load and improved reliability across client APIs.
Coming soon — planning to write about distributed systems, AI/ML engineering, and lessons from building at Amazon scale.
DynamoDB → Elasticsearch migration: sharding, indexing, and lessons learned
gRPC clip serving, moment retrieval, and ranking at 10M+ requests/day
AWS Step Functions + SQS patterns for reliable onboarding workflows
Update the href above with your real GitHub username after deploying.
nalla.abhishek4@gmail.com
linkedin.com/in/abhishek-nalla
github.com/abhishek-nalla
Senior / Staff SDE roles · AI-first companies · Remote / Seattle / California
say hello →