SIAN JIN
forest

Sian Yujia Jin

Data Scientist & Engineer  ·  ML & AI Applications

View My Work Resume
Sian Jin

About Me

Data Scientist & Engineer

Hi, this is Sian! I love working with data, especially when it feels messy and complex at first. There is something really satisfying about turning it into something clear, reliable, and useful.

I work across data science and engineering, building systems from raw data all the way to machine learning and AI applications. Lately, I have been exploring how large language models can be used in healthcare and research to make a real impact.

Outside of work, I spend a lot of time taking care of my plants 🌿, exploring new places, and staying active. I am someone who enjoys simple things, being close to nature, and bringing a sense of calm and intention into both life and work.

Skills

Python R SQL JavaScript Tableau Machine Learning NLP LLM FastAPI PostgreSQL Spark Data Visualization

Projects

A selection of the day's work

HireTrail

A full-stack job search tracker supporting companies, applications, and interview rounds with status filtering, CSV bulk import, and a live dashboard. Backed by PostgreSQL with Alembic migrations, deployed on Render and Streamlit Cloud.

FastAPI Streamlit PostgreSQL Alembic Render
View on GitHub

Tableau Dashboards

A collection of interactive dashboards covering HR Analytics, Covid-19 trends, and more. Built to communicate complex data stories through clean, intentional visual design.

Tableau Data Visualization
View Dashboards

Volcano Analysis & Visualizations

A data science story about volcanoes — exploratory data analysis with interactive visualizations. Covers time series, geospatial, and quantitative data with statistical analysis.

Python R JavaScript Plotly Matplotlib
View Project

Reddit Political Engagement

Cloud-based big data project exploring behavioral trends in political engagement on social media, with NLP and ML analysis on Reddit data.

Spark Databricks NLP Machine Learning Python
View Project

Clinical Trials Analytics

Data cleaning, text mining, and machine learning applied to a clinical trials dataset, exploring data science questions behind trial outcomes and patterns.

Python Text Mining Machine Learning R
View Project

Weather Forecast — ML & Time Series

A weather forecasting project on climate change data using linear regression, vector autoregression, and LSTM neural networks. Published in ICJE.

Python LSTM Time Series VAR Published Paper
Read Paper

Stock Price Time Series — Healthcare

Modeling healthcare company stock prices using multiple time series methods to explore development and financial trends of the sector.

R Time Series ARIMA Finance
View Project

more coming soon...



Contact

Come say hello

sjin.data@gmail.com
Sian's plants