I’m currently working as a Research Scientist for The Vera C. Rubin Observatory - Chile. I have an MS in Data Science from The University of Washington - Seattle. I've worked as a Data & Machine Learning Engineer for Shell where I developed and deployed some of the most sophisticated and successful data-powered products to help traders generate $500+ Million/year in revenue. I've hands-on experience in taking products from ideation to production in both, startup and corporate environments. I possess a strong foundation in data science, machine learning, data engineering, and operations. Let's connect and discuss how we can work together to build something awesome!
Subjects: Introduction to Statistics and Probability, Data Visualization, Software
Design, Applied Statistics and Experimental Design, Data Management,
Statistical Machine Learning, Human-Centered Data Science, Scalable Data Systems and
Algorithms.
Co-curricular: Organizer at The
RAISE Group, Graduate Research
Assistant at the
DiRAC Institute, Capstone
with
Virufy.
Subjects: Data Structures & Algorithms, Object-Oriented Programming Methodology, Big
Data, Open-Source Technologies,
Soft Computing, Database Management System, Software Engineering, Data Mining & Business
Intelligence, Distributed Systems,
Cloud Computing, Software Project Management, Intelligent System.
Co-curricular: Co-founder of the Coders' Club.
Concepts & Technologies
Data Science, Data Engineering, Machine Learning, Natural Language Processing
(NLP), Computer/Machine Vision,
MLOps (Machine Learning Operations), ETL (Extract-Transform-Load), Data Visualization, A/B
Testing,
Data Modeling, Database Management, Data Analysis, Data Wrangling, Data Warehousing, RAG.
Programming & Scripting Languages
Regular: Python, SQL, JavaScript, HTML, CSS
Past Experience: C, C#, Java, PHP
Tools & Framworks
Data Engineering: Databricks, Apache Spark, Git, Microsoft Azure, AWS, GCP, Microsoft
SQL
Server, Alteryx, Apache Airflow, Docker.
Data Science: Micorsoft Azure ML Studio, HuggingFace, PyTorch, Scikit-learn,
TensorFlow,
Pandas, Numpy,
Matplotlib, OpenCV, Tableau, Flask, Keras, NLTK, FastAPI, Streamlit, Seaborn, LangChain,
Vector Database (Pinecone).