Skip to content
View sralter's full-sized avatar

Block or report sralter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sralter/README.md

1681226455427 3

Hello, my name is Samuel Alter. Welcome to my GitHub!

My top-five facts:

  1. 👨‍🔬 Proud science nerd who spent years thinking about how geology🪨, biology🌳, and water💧 interact
  2. 🔄 Pivoted career to focus on what underlies all scientific inquiry: data!
  3. 🧪 I employ a scientific- and data-focused approach to solve problems using data analytics📊
  4. 🐣 ➡ 🦅 I love working on the full insights lifecycle, from research to analysis to modeling to reporting
  5. 🌐 My focus is on solving geospatial problems using a mix of open source tools and applications

Languages and Tools:

Python  SQL  Tableau  QGIS  ArcGISPro  PostGIS  PyTorch  Git 

Some projects that I've done:

  • Sustainability Insights
    • Python-based project centered on gleaning insights from mining emissions datasets sourced from Climate TRACE
      • Python tools used: Numpy, Pandas, DuckDB, and Matplotlib
    • A Tableau Public Story was created to show which companies are most responsible for the world's mining emissions
    • I wrote an Executive Summary which explains my workflow and conclusions
  • ClassiFIRE
    • Python- and QGIS-based wildfire prediction model
      • Python tools used: Numpy, Pandas, Matplotlib, Seaborn, Scikit-Learn, Tensorflow
    • Presentation of my findings
    • Executive Summary of my work
  • Potential Talents
    • Python-based NLP project designed to utilize Learning-To-Rank systems to organize job candidates based on their similarity to particular terms
      • Visualization techniques used: histograms, boxplots, bar charts, choropleth map, word cloud
      • Python tools used: Text embedding (Tfidf, Word2Vec, GloVe, fastText, SBERT), Scikit-Learn's Cosine Similarity, RankNet with PyTorch, LambdaRank with LightGBM

More about me:

As a data professional, I employ the latest data techniques to uncover insights. I am skilled in:

  • Research: With my science mindset, I know how to bring in external data in to enrich the analysis
  • Reporting: My years as a researcher and consultant have taught me how to communicate effectively
  • Data Manipulation: Need help preparing the data? I'll utilize Pandas, DuckDB, SQL queries, and Polars
  • Visualization: Want some figures? I can use Python (Matplotlib, Seaborn), Tableau, R (ggplot2), or Excel
  • Business Intelligence Tools: Did I mention Tableau? Let's talk about how I can build dashboards for you
  • Python for Data Analysis and Machine Learning: Scikit-learn, Tensorflow, and Jupyter are my home
  • Predictive Modeling: Using Hyperopt and Optuna, I have tuned countless algorithms on varied datasets
  • Deep Learning: How about neural networks? I have used Tensorflow to analyze satellite imagery
  • GIS: Want to know "where"? I use my geospatial skills in Esri's ArcGIS Pro, QGIS, and PostGIS to solve it
  • Statistics: Want to know "why"? I use Python's Statsmodels to figure that out
  • Natural Language Processing: I am skilled in NLP and can help you perform text or sentiment analysis

You can find me on LinkedIn here:

LinkedIn Badge

Pinned Loading

  1. potential_talents potential_talents Public

    Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

    Jupyter Notebook

  2. sustainability_insights sustainability_insights Public

    A data analysis project that derived insights from an emissions dataset sourced from Climate TRACE.

    Jupyter Notebook 1

  3. classifire classifire Public

    Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project

    Jupyter Notebook 1

  4. term_deposit_marketing term_deposit_marketing Public

    Predicting which customers will most likely purchase a type of financial product, achieving a time savings of over 93%.

    Jupyter Notebook 1

  5. happy_customers happy_customers Public

    Predicting whether a customer is happy based on the results from a survey.

    Jupyter Notebook 1