Boston, Massachusetts, United States
Engineered a context-aware AI agent chatbot using LangChain and PaLM-2 LLM on GCP Vertex AI, integrating advanced reasoning, memory, and tool orchestration to enable dynamic interactions and boost user engagement by 40%. Improved retrieval accuracy by 60% with a robust RAG pipeline using Pinecone Vector DB, and implemented evaluation/monitoring with Ragas and LangSmith for automated debugging and iterative improvements. Built a scalable backend with FastAPI, a React/TypeScript frontend, and secure inference using Guardrails API, while accelerating deployment by 40% through enhanced CI/CD workflows with Docker and Cloud Build. Leveraged Looker for data mining and insights on customer behavior.
Key Skills: GCP, LangChain, PaLM-2, ReactJS, TypeScript, Python, FastAPI, RAG, Vertex AI, Looker