Skip to content
View shaypal5's full-sized avatar
🐢
Working away...
🐢
Working away...

Organizations

@DataHackIL @NLPH @python-cachier

Block or report shaypal5

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shaypal5/README.md

Shay Palachy-Affek, AI & Data Leader, Lecturer, Nonprofit Builder, and OSS Maintainer

My Personal Website Medium LinkedIn

Hey there! I'm Shay. I lead AI & Data at the Adanim Institute, applying AI, data, and operational methods to social welfare systems. Alongside that work, I advise organizations on data science strategy, AI implementation, team building, and applied ML systems.

I co-founded and help lead DataHack, a nonprofit promoting diversity in data science and AI in Israel and supporting social-good data work through programs such as DataCoach, DataNights, DataTalks, and Kaggle-IL. I also teach Deep Learning, Text Mining, and Data Visualization at Tel Aviv University's Business School, and Intro to Machine Learning at The Academic College of Tel Aviv-Yaffo.

My open-source work spans Python data and ML tooling, AI workflow utilities, Hebrew NLP/OCR/audio dataset infrastructure, and coding-agent support tools.

Established Open Source Projects

Project Description Stars Downloads Forks Issues PRs
cachier Persistent caching decorators for Python functions Stars Downloads Forks Issues Pull Requests
pdpipe Composable pipelines for pandas DataFrames Stars Downloads Forks Issues Pull Requests
pulearn Positive-unlabeled learning with Python Stars Downloads Forks Issues Pull Requests
skift scikit-learn wrappers for Python fastText Stars Downloads Forks Issues Pull Requests
birch Hierarchical config for Python packages Stars Downloads Forks Issues Pull Requests
awesome-twitter-data Twitter/X datasets and resources Stars Forks Issues Pull Requests

Recent and Experimental Projects

Project Description Stars Forks Issues PRs
foldermix LLM-friendly folder packing CLI Stars Forks Issues Pull Requests
pr-agent-context GitHub Actions PR context for coding agents Stars Forks Issues Pull Requests
SynthBanshee Synthetic Hebrew audio dataset pipeline Stars Forks Issues Pull Requests
hocrgen Hebrew OCR dataset operations tooling Stars Forks Issues Pull Requests
leadforge Synthetic CRM and go-to-market datasets Stars Forks Issues Pull Requests
splendor Local-first, git-native knowledge compiler Stars Forks Issues Pull Requests

Recent Talks and Teaching

Selected Blog Posts

Pinned Loading

  1. pdpipe/pdpipe pdpipe/pdpipe Public

    Easy pipelines for pandas DataFrames.

    Jupyter Notebook 724 46

  2. python-cachier/cachier python-cachier/cachier Public

    Persistent, stale-free, local and cross-machine caching for Python functions.

    Python 654 72

  3. pulearn/pulearn pulearn/pulearn Public

    Positive-unlabeled learning with Python.

    Jupyter Notebook 250 35

  4. skift skift Public

    scikit-learn wrappers for Python fastText.

    Jupyter Notebook 233 23

  5. awesome-twitter-data awesome-twitter-data Public

    A list of Twitter datasets and related resources.

    1.1k 133

  6. birch birch Public

    Simple hierarchical configuration for Python packages.

    Python 14 10