GenAIx — AI Engineering Intern
Jun 2025 – Present · Remote
Engineered a Selenium/BeautifulSoup scraping pipeline automating job-data extraction (10K+ records/day). Designed a sharded relational database + ingestion scripts for millisecond-level filtering and reliable deduplication. Built a React front end with dynamic search/filter, integrating database queries and Tableau dashboards. Deployed a FastAPI microservice exposing subscription endpoints with scalable, well-structured APIs.
PythonSeleniumBeautifulSoupSQLReactFastAPITableau
AWS — AI/ML Scholars (SageMaker)
Jul 2025 – Present · Remote
Automated the ML lifecycle with SageMaker Projects: data prep, training, evaluation, and deployment via scripted, reproducible pipelines. Applied workflows on an S3-hosted healthcare dataset; evaluated trade-offs to select features and a production-ready deployment strategy.
AWS SageMakerS3MLOps
AI Student Collective — Data Science Intern
Mar 2025 – May 2025 · Davis, CA
Built an automated ETL pipeline (53K+ records) with feature engineering & cleaning. Benchmarked Random Forest vs. Decision Tree (R² 0.92 vs. 0.85) and selected the optimal model for deployment. Delivered an interactive Streamlit app with real-time predictions using sliders and dynamic UI.
Pythonscikit-learnStreamlitETL
HCLTech — Software Engineering Bootcamp
Jun 2023 – Sep 2023 · Remote
Performed EDA on a security/anomaly dataset with Pandas/NumPy/Seaborn. Built simple data pipelines and baseline anomaly-detection models in scikit-learn. Created Tableau/BI dashboards to visualize key trends and insights.
PandasNumPySeabornscikit-learnTableau