San Francisco, California, United States
• Owned and optimized performance of the Abacus external data connector suite supporting all major cloud provider storage offerings and SaaS database services such as Snowflake, Databricks, and BigQuery
• Increased throughput, decreased memory footprint, and enabled parallel scaling of the distributed batched inference service
• Designed and implemented asynchronous LLM APIs / bots which allowed multiple users to have ChatGPT- like group conversations with Abacus LLMs in communication platforms such as Slack and Microsoft Teams
• Created PySpark-based incremental datasets from database services for ETL into product
• Built periodic full execution of cloud-hosted Jupyter notebooks for data science using scripting inside Kubernetes deployments
• Implemented functionality to create / update custom ML models using customer-provided python code