InData Engineer ThingsbyVu TrinhI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 24, 20242.3K23Aug 24, 20242.3K23
InThe PythoneersbyAbhay Parashar9 MindBlowing Python Features You Aren’t Using EnoughThat can be a game-changer.Sep 16, 20242.4K23Sep 16, 20242.4K23
InAI AdvancesbyYuki ShizuyaExplainable Clustering — The Introduction of Recursive Embedding and Clustering and its ApplicationSpotify developed the simple but powerful explainable clustering methodAug 23, 20244432Aug 23, 20244432
InTDS ArchivebyW Brett KennedyAchieve Better Classification Results with ClassificationThresholdTunerA python tool to tune and visualize the threshold choices for binary and multi-class classification problemsSep 7, 20244823Sep 7, 20244823
InThe GeneratorbyJim the AI WhispererMy one-word AI prompt to induce deeper reasoning and more accurate output from ChatGPT: “RUMINATE”Slow down, genius: A simple hack for smarter AI responsesAug 28, 20244.5K55Aug 28, 20244.5K55
InLevel Up CodingbyDr. Ashish BamaniaThe New ‘Adam-mini’ Optimizer Is Here To Cause A Breakthrough In AIA deep dive into how Optimizers work, their developmental history, and how the 'Adam-mini' optimizer enhances LLM training like never…Jul 5, 20243415Jul 5, 20243415
InTDS ArchivebyHeiko HotzRAG vs Finetuning — Which Is the Best Tool to Boost Your LLM Application?The definitive guide for choosing the right method for your use caseAug 24, 20233.3K30Aug 24, 20233.3K30
Thomas BuryInterpretable ML 3: What nobody tells you about SHAP valuesOur journey through the intricacies of interpretable machine learning continues! In this episode, we’ll venture beyond simplistic…Feb 11, 202487Feb 11, 202487
Duy HuynhBuild your own RAG and run it locally: Langchain + Ollama + StreamlitWith the rise of Large Language Models and its impressive capabilities, many fancy applications are being built on top of giant LLM…Dec 5, 202398615Dec 5, 202398615
Prathamesh SonawaneXGBoost — How does this workI’ll cover everything there is to cover about XGBoost in this blog. Lmk if you think something is missing in the comments.Dec 4, 20232263Dec 4, 20232263
InTDS ArchivebySamuele MazzantiWhat’s Wrong With R-Squared (And How to Fix It)Even if you think you are using R-Squared out-of-sample, you are not. Here is whyAug 7, 20241K19Aug 7, 20241K19
InTDS ArchivebyJacky KaubThe Biggest Weakness Of Boosting TreesWhy distribution drifts can really hurt your modelsFeb 12, 202436010Feb 12, 202436010
InData Science in your pocketbyMehul GuptaGraph Neural Networks for BeginnersUnderstanding Graph Theory, Networkx, and GNNs basicsJan 11, 20246101Jan 11, 20246101
DevanshA simple introduction to MIT’s Lottery Ticket HypothesisOne of Deep Learning’s most exciting ideasFeb 5, 20241631Feb 5, 20241631
InLevel Up CodingbyTrey HuffineBest of Level Up Coding: A Recap of 2023As we enter 2024, let’s look back at the most impactful articles in Level Up Coding from the past year.Jan 2, 20245329Jan 2, 20245329
InTDS ArchivebyMatteo CourthoudUnderstanding Group Sequential TestingHow to run valid experiments, with peeking and early stopping.Dec 26, 20233636Dec 26, 20233636
PiML TutorialsModel Diagnostics: Prediction UncertaintyToday’s PiML tutorial is about quantification of prediction uncertainty for a pre-trained machine learning model. We take a deep dive into…Oct 30, 2023104Oct 30, 2023104
Rukshan Pramoditha7 Types of Cross-Validation (CV) Techniques You Should Know as a Data Scientist in 2023With their Python implementations graphical visualizationsOct 9, 20232512Oct 9, 20232512
InOptunabyKohei OzakiLightGBM Tuner: New Optuna Integration for Hyperparameter OptimizationOptuna provides the automation of LightGBM hyperparameter tuning. Users can now enjoy hyperparameter tuning-free LightGBM!Mar 3, 202043812Mar 3, 202043812
InTDS ArchivebyBex T.10 Confusing XGBoost Hyperparameters and How to Tune Them Like a Pro in 2023Jun 11, 20237764Jun 11, 20237764