• HOME
  • ACADEMICS
  • RESEARCH
  • PROJECTS
  • ACHIEVEMENTS
  • OUTREACH
  • STUDY NOTES
  • CONTACT

Technical Projects

SDSS ML Plot
98.48% Accuracy
Machine Learning

Predicting Stars, Galaxies & Quasars

Supervised ML Pipeline for SDSS Photometric Classification

Python Scikit-Learn XGBoost Streamlit

Building a supervised machine learning pipeline to classify astronomical sources from the Sloan Digital Sky Survey (SDSS) based on photometric features. This simulates a real data-intensive astronomy workflow essential for modern large-scale surveys.


GitHub Repo Launch App

Methodology

  • Data cleaning (dropping objid/specobjid)
  • Feature scaling with StandardScaler
  • Hyperparameter tuning via GridSearchCV

Models Evaluated

  • Random Forest (98.48% Accuracy)
  • XGBoost & Neural Networks (MLP)
  • Logistic Regression & SVM

Tools Used

  • Pandas, NumPy, Matplotlib, Seaborn
  • Scikit-Learn & Streamlit
  • Publicly available SDSS CSV dataset

Acknowledgement: Dataset provided by SDSS. ML training supported by the Brown Physics AI Winter School.

© 2026 Radit Raian | All rights reserved