Concerns about the environmental impacts of Large Language Models (LLMs) are growing. Although detailed information about the actual costs of LLMs can be difficult to find, let’s attempt to gather some facts to understand the scale. Generated with ChatGPT-4oSince comprehensive data on ChatGPT-4 is not readily available, we can consider Llama 3.1 405B as an…
Understand missing data patterns (MCAR, MNAR, MAR) for better model performance with Missingno In an ideal world, we would like to work with datasets that are clean, complete and accurate. However, real-world data rarely meets our expectation. We often encounter datasets with noise, inconsistencies, outliers and missingness, which requires careful handling to get effective results.…
This November 30 marks the second anniversary of ChatGPT’s launch, an event that sent shockwaves through technology, society, and the economy. The space opened by this milestone has not always made it easy — or perhaps even possible — to separate reality from expectations. For example, this year Nvidia became the most valuable public company…
|LLM|INTERPRETABILITY|SPARSE AUTOENCODERS|XAI| A deep dive into LLM visualization and interpretation using sparse autoencoders Image created by the author using DALL-EAll things are subject to interpretation whichever interpretation prevails at a given time is a function of power and not truth. — Friedrich Nietzsche As AI systems grow in scale, it is increasingly difficult and pressing…
Decoding One-Hot Encoding: A Beginner’s Guide to Categorical Data | by Vyacheslav Efimov | Nov, 2024
Learning to transform categorical data into a format that a machine learning model can understand When studying machine learning, it is essential to understand the inner workings of the most basic algorithms. Doing so helps in understanding how algorithms operate in popular libraries and frameworks, how to debug them, choose better hyperparameters more easily, and…
Building a 28% more accurate multimodal image search engine with VLMs. Until recently, AI models were narrow in scope and limited to understanding either language or specific images, but rarely both. In this respect, general language models like GPTs were a HUGE leap since we went from specialized models to general yet much more powerful…
What working as a data scientist at various companies and industries over the past 6+ years has taught me of the future of data science and AI engineering GenAI and Large Language Models (LLMs) continue changing how we work and what work will mean in the future, especially for the data science domain, where in…
And how much I made my first year Photo by Amy Hirschi on UnsplashCongratulations, you have landed a data science position! You open your offer letter and … Well, you’re a bit disappointed. This is completely normal, at least for most companies, and especially if you are a junior or just starting out in the…
How to build ML products users love. 23 min read · Oct 14, 2024 Photo by Pavel Danilyuk: https://www.pexels.com/photo/a-robot-holding-a-flower-8438979/Data science offers rich opportunities to explore new concepts and demonstrate their viability, all towards building the ‘intelligence’ behind features and products. However, most machine learning (ML) projects fail! And this isn’t just…
Working with ODEs Physical systems can typically be modeled through differential equations, or equations including derivatives. Forces, hence Newton’s Laws, can be expressed as derivatives, as can Maxwell’s Equations, so differential equations can describe most physics problems. A differential equation describes how a system changes based on the system’s current state, in effect defining state…