• Streamlined high-velocity sensor data processing via the design and implementation of robust data pipelines and storage mechanisms
• Conducted extensive benchmark tests on large-scale time series data across databases, delivering pivotal insights for data management
• Leveraged PCA, dimensionality reduction, and correlation analyses, unveiling intricate trends and driving hypothesis verification
• Deployed ML models on high-volume, high-velocity time series data, effectively enhancing the predictive accuracy and the understanding of bioreactor processes