Skip to content
View SantiagoEnriqueGA's full-sized avatar
Block or Report

Block or report SantiagoEnriqueGA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SantiagoEnriqueGA/README.md

πŸ‘‹ Hello, I'm Santiago!

πŸš€ Senior Member of Technical Staff with a focus on analytics and machine learning
🏫 Currently studying Computational Data Analytics at Georgia Tech
πŸ“ Based in Texas
πŸ“§ Email: sega97@gmail.com
πŸ”— LinkedIn
πŸ‘¨β€πŸ’» GitHub: SantiagoEnriqueGA

About Me 🌐

Blending deep experience in data analytics, machine learning, and software engineering, I transform data into actionable insights and innovative solutions.

Professional Highlights 🌟

  • Senior Member of Technical Staff - AT&T, Dallas, Texas (Present)
  • Broadband Advanced Analyst - AT&T, Dallas, Texas
  • Metrics Reporting Analyst - AT&T, Dallas, Texas
  • Data Implementation Analyst - IHS Markit, Dallas, Texas
  • Software Developer - ServiceNow - Sabre Corporation, Southlake, Texas

Toolbox πŸ› οΈ

  • Languages: Python, R, SQL, C, Java, JavaScript
  • Libraries: Pandas, NumPy, Scikit-learn, Keras, PyTorch, PySpark
  • Analytical Tools: Power BI, Tableau, Palantir
  • Certifications: Advanced Data Science IBM Specialization, Advanced Machine Learning and Signal Processing, Applied AI with DeepLearning, Fundamentals of Scalable Data Science

Education πŸŽ“

  • Master of Science in Analytics (Computational Data Analytics Track) - Georgia Institute of Technology (Expected 2024)
  • Bachelor of Science in Information Technology and Systems - The University of Texas at Dallas (2020)

Projects University/Personal πŸ“Š

  • Rental Pricing Dashboard - ISYE 6414 Regression Analysis

    • Developed an XGBoost model for predicting apartment rental prices with a MAE of 47.90.
    • Led data collection, aggregation, and model fine-tuning for optimal accuracy and robustness.
  • Conway's Game of Life - Personal Project

    • Implemented Conway's Game of Life using Pygame for graphics and Numpy for array operations.
    • Developed multiple versions with enhancements such as optimized drawing, pause/resume functionality, and user interaction improvements.
    • Created various screens for user settings and controls, including start menu, wrap options, glider configurations, and game world size settings.
    • GitHub Repository
  • Semantic Artist Similarity Analysis - Personal Project

    • Analyzed artist similarity using the Semantic Artist Similarity (SAS) Dataset.
    • Implemented Python scripts (SAS_networkx.py) for building and analyzing artist similarity networks using NetworkX and visualizing them with Plotly.
    • Created an HTML file (SAS_D3.html) for interactive visualization of the network using D3.js.
    • Explored functionalities such as parsing data, network metrics calculation, furthest connectivity analysis, and shortest paths.
    • GitHub Repository
  • Used Car Data Scraping and Analysis - Personal Project

    • Created class for automatically scraping data from Cars.com using BeautifulSoup.
    • Created pipeline to scrape data on several vehicle models, clean and analize the data.
    • GitHub Repository
  • Custom Clustering - Personal Project

    • Created classes for KMeans and DBSCAN from scratch using only numpy for array processing.
    • Custom classes allow for use of multiple clutering evaluation measures: elbow method, Silhouette Score, Davies-Bouldin Index, Calinski-Harabasz Index.
    • Done in order to understand the workings of clustering algorithms, and how to fit them to specifit datasets by building them from scratch.
    • GitHub Repository
  • Custom Ensemble Learning - Personal Project

    • Created class for creating random forest classifiers and regressors, and class for gradient boosted decision tree classifiers, regressors.
    • Created in base python using only numpy for speedy array processing.
    • Done in order to understand the workings of ensemble learning algorithms, and how to fit them to specifit datasets by building them from scratch.
    • GitHub Repository

Leadership and Organizations πŸ…

  • Dallas CASA (Court Appointed Special Advocates) (2021 - Current)
  • Co-Recruitment Chair, Delta Tau Delta Fraternity, UT Dallas (2016 - 2020)

Pinned

  1. game_of_life game_of_life Public

    This repository contains simulates Conway's Game of Life using Pygame for graphics and Numpy for array operations. The game simulates cellular automata where cells evolve based on simple rules.

    Python

  2. artist_similarity_network artist_similarity_network Public

    This repository uses semantic artist similarity to visualize artist networks based on Last.fm data. It utilizes Python with NetworkX for network analysis as well as Plotly and D3.js for interactive…

    Python

  3. d3_apartment_rent_prediction_vis d3_apartment_rent_prediction_vis Public

    This repository focuses on predicting apartment prices and visualizing data related to apartment listings. It combines various datasets to create predictive models and an interactive visualization …

    Jupyter Notebook

  4. used_car_price_visualization used_car_price_visualization Public

    This repository focuses on scraping data from Cars.com using the BeautifulSoup in Python, automating the process, and analyzing the data for insights.

    Jupyter Notebook

  5. custom_ensemble_learning custom_ensemble_learning Public

    This repository focuses on building a random forest classifier and regressor as well as a gradient boosted regressor, building them from scratch using only NumPy for faster array processing.

    Python

  6. custom_clustering custom_clustering Public

    This repository focuses on building clustering algorithms from scratch using only Numpy for faster array processing.

    Jupyter Notebook