List: Offline Reading | Curated by Elli Tzini

Dec 31, 2024

41 stories

Offline Reading
In
Data Engineer Things
by
Vu Trinh
I spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.
Aug 24, 2024
2.3K
23
Aug 24, 2024
2.3K
23
In
The Pythoneers
by
Abhay Parashar
9 MindBlowing Python Features You Aren’t Using EnoughThat can be a game-changer.
Sep 16, 2024
2.4K
23
Sep 16, 2024
2.4K
23
In
AI Advances
by
Yuki Shizuya
Explainable Clustering — The Introduction of Recursive Embedding and Clustering and its ApplicationSpotify developed the simple but powerful explainable clustering method
Aug 23, 2024
443
2
Aug 23, 2024
443
2
In
TDS Archive
by
W Brett Kennedy
Achieve Better Classification Results with ClassificationThresholdTunerA python tool to tune and visualize the threshold choices for binary and multi-class classification problems
Sep 7, 2024
482
3
Sep 7, 2024
482
3
In
The Generator
by
Jim the AI Whisperer
My one-word AI prompt to induce deeper reasoning and more accurate output from ChatGPT: “RUMINATE”Slow down, genius: A simple hack for smarter AI responses
Aug 28, 2024
4.5K
55
Aug 28, 2024
4.5K
55
In
Level Up Coding
by
Dr. Ashish Bamania
The New ‘Adam-mini’ Optimizer Is Here To Cause A Breakthrough In AIA deep dive into how Optimizers work, their developmental history, and how the 'Adam-mini' optimizer enhances LLM training like never…
Jul 5, 2024
341
5
Jul 5, 2024
341
5
In
TDS Archive
by
Heiko Hotz
RAG vs Finetuning — Which Is the Best Tool to Boost Your LLM Application?The definitive guide for choosing the right method for your use case
Aug 24, 2023
3.3K
30
Aug 24, 2023
3.3K
30
Thomas Bury
Interpretable ML 3: What nobody tells you about SHAP valuesOur journey through the intricacies of interpretable machine learning continues! In this episode, we’ll venture beyond simplistic…
Feb 11, 2024
87
Feb 11, 2024
87
Duy Huynh
Build your own RAG and run it locally: Langchain + Ollama + StreamlitWith the rise of Large Language Models and its impressive capabilities, many fancy applications are being built on top of giant LLM…
Dec 5, 2023
986
15
Dec 5, 2023
986
15
Prathamesh Sonawane
XGBoost — How does this workI’ll cover everything there is to cover about XGBoost in this blog. Lmk if you think something is missing in the comments.
Dec 4, 2023
226
3
Dec 4, 2023
226
3
In
TDS Archive
by
Samuele Mazzanti
What’s Wrong With R-Squared (And How to Fix It)Even if you think you are using R-Squared out-of-sample, you are not. Here is why
Aug 7, 2024
1K
19
Aug 7, 2024
1K
19
In
TDS Archive
by
Jacky Kaub
The Biggest Weakness Of Boosting TreesWhy distribution drifts can really hurt your models
Feb 12, 2024
360
10
Feb 12, 2024
360
10
In
Data Science in your pocket
by
Mehul Gupta
Graph Neural Networks for BeginnersUnderstanding Graph Theory, Networkx, and GNNs basics
Jan 11, 2024
610
1
Jan 11, 2024
610
1
Devansh
A simple introduction to MIT’s Lottery Ticket HypothesisOne of Deep Learning’s most exciting ideas
Feb 5, 2024
163
1
Feb 5, 2024
163
1
In
Level Up Coding
by
Trey Huffine
Best of Level Up Coding: A Recap of 2023As we enter 2024, let’s look back at the most impactful articles in Level Up Coding from the past year.
Jan 2, 2024
532
9
Jan 2, 2024
532
9
In
TDS Archive
by
Matteo Courthoud
Understanding Group Sequential TestingHow to run valid experiments, with peeking and early stopping.
Dec 26, 2023
363
6
Dec 26, 2023
363
6
PiML Tutorials
Model Diagnostics: Prediction UncertaintyToday’s PiML tutorial is about quantification of prediction uncertainty for a pre-trained machine learning model. We take a deep dive into…
Oct 30, 2023
104
Oct 30, 2023
104
Rukshan Pramoditha
7 Types of Cross-Validation (CV) Techniques You Should Know as a Data Scientist in 2023With their Python implementations graphical visualizations
Oct 9, 2023
251
2
Oct 9, 2023
251
2
In
Optuna
by
Kohei Ozaki
LightGBM Tuner: New Optuna Integration for Hyperparameter OptimizationOptuna provides the automation of LightGBM hyperparameter tuning. Users can now enjoy hyperparameter tuning-free LightGBM!
Mar 3, 2020
438
12
Mar 3, 2020
438
12
In
TDS Archive
by
Bex T.
10 Confusing XGBoost Hyperparameters and How to Tune Them Like a Pro in 2023
Jun 11, 2023
776
4
Jun 11, 2023
776
4

Offline Reading

I spent 8 hours learning Parquet. Here’s what I discovered

I finally sat down and learned about it.

9 MindBlowing Python Features You Aren’t Using Enough

That can be a game-changer.

Explainable Clustering — The Introduction of Recursive Embedding and Clustering and its Application

Spotify developed the simple but powerful explainable clustering method

Achieve Better Classification Results with ClassificationThresholdTuner

A python tool to tune and visualize the threshold choices for binary and multi-class classification problems

My one-word AI prompt to induce deeper reasoning and more accurate output from ChatGPT: “RUMINATE”

Slow down, genius: A simple hack for smarter AI responses

The New ‘Adam-mini’ Optimizer Is Here To Cause A Breakthrough In AI

A deep dive into how Optimizers work, their developmental history, and how the 'Adam-mini' optimizer enhances LLM training like never…

RAG vs Finetuning — Which Is the Best Tool to Boost Your LLM Application?

The definitive guide for choosing the right method for your use case

Interpretable ML 3: What nobody tells you about SHAP values

Our journey through the intricacies of interpretable machine learning continues! In this episode, we’ll venture beyond simplistic…

Build your own RAG and run it locally: Langchain + Ollama + Streamlit

With the rise of Large Language Models and its impressive capabilities, many fancy applications are being built on top of giant LLM…

XGBoost — How does this work

I’ll cover everything there is to cover about XGBoost in this blog. Lmk if you think something is missing in the comments.

What’s Wrong With R-Squared (And How to Fix It)

Even if you think you are using R-Squared out-of-sample, you are not. Here is why

The Biggest Weakness Of Boosting Trees

Why distribution drifts can really hurt your models

Graph Neural Networks for Beginners

Understanding Graph Theory, Networkx, and GNNs basics

A simple introduction to MIT’s Lottery Ticket Hypothesis

One of Deep Learning’s most exciting ideas

Best of Level Up Coding: A Recap of 2023

As we enter 2024, let’s look back at the most impactful articles in Level Up Coding from the past year.

Understanding Group Sequential Testing

How to run valid experiments, with peeking and early stopping.

Model Diagnostics: Prediction Uncertainty

Today’s PiML tutorial is about quantification of prediction uncertainty for a pre-trained machine learning model. We take a deep dive into…

7 Types of Cross-Validation (CV) Techniques You Should Know as a Data Scientist in 2023

With their Python implementations graphical visualizations

LightGBM Tuner: New Optuna Integration for Hyperparameter Optimization

Optuna provides the automation of LightGBM hyperparameter tuning. Users can now enjoy hyperparameter tuning-free LightGBM!

10 Confusing XGBoost Hyperparameters and How to Tune Them Like a Pro in 2023

Elli Tzini

For Work

Experimentation

Coding