New York, New York, United States
• Owned the complete development and deployment of a Retrieval-Augmented Generation (RAG)-based solution, enabling publishers to track and monetize proprietary data in LLM applications, leading to a joint venture with a strategic partner and $100k+ in external investment
• Developed a scalable AI-powered platform that enables research groups and e-commerce businesses to access publisher data on an as-needed basis through tailored AI Agents, reducing data-acquisition costs compared to traditional subscription packages
• Thrived in a fast-paced early-stage startup environment, constantly adapting to rapidly changing requirements, adopting new skills, and independently scoping and defining technical tasks
• Optimized system performance by iterating on open-source LLMs (e.g., LLaMA, Mixtral), embedding models (using transformers and the HuggingFace API), RAG techniques (LangChain), and vector databases (chromadb, ElasticSearch) to improve search accuracy and response time
• Built internal tools, rapidly iterating on Streamlit apps, to test hypotheses and automate dataset creation, reducing manual workload and development time
• Streamlined CI/CD workflows with GitHub Actions and Docker, reducing deployment time and improving system reliability through automated unit and performance testing