Data Science – Page 7 – The Ai Innovation

Skip to content Skip to sidebar Skip to footer

Bayesian Data Science: The What, Why, and How | by Samvardhan Vishnoi | Apr, 2024

Data ScienceFebruary 22, 2025123Views 0Likes 0Comments

Choosing between frequentist and Bayesian approaches is the great debate of the last century, with a recent surge in Bayesian adoption in the sciences. Number of articles referring Bayesian statistics in sciencedirect.com (April 2024) — Graph by the authorWhat’s the difference? The philosophical difference is actually quite subtle, where some propose that the great bayesian…

(Un)Objective Machines: A Look at Historical Bias in Machine Learning | by Gretel Tan | Apr, 2024

Data ScienceFebruary 22, 2025129Views 0Likes 0Comments

A deep dive into biases in machine learning, with a focus on historical (or social) biases. Humans are biased. To anyone who has had to deal with bigoted individuals, unfair bosses, or oppressive systems — in other words, all of us — this is no surprise. We should thus welcome machine learning models which can…

Using Clustering Algorithms for Player Recruitment | by Pol Marin | Apr, 2024

Data ScienceFebruary 22, 202596Views 0Likes 0Comments

Sports Analytics Which players could help Fulham overcome their major flaws? Photo by Mario Klassen on UnsplashSome days ago, I was fortunate to be able to participate in a football analytics hackathon that was organized by xfb Analytics[1], Transfermarkt[2], and Football Forum Hungary[3]. As we recently received permissions to share our work, I decided to…

Feature Engineering with Microsoft Fabric and PySpark | by Roger Noble | Apr, 2024

Data ScienceFebruary 22, 2025115Views 0Likes 0Comments

Fabric Madness part 2 Image by author and ChatGPT. “Design an illustration, focusing on a basketball player in action, this time the theme is on using pyspark to generate features for machine leaning models in a graphic novel style” prompt. ChatGPT, 4, OpenAI, 4 April. 2024. https://chat.openai.com.A Huge thanks to Martim Chaves who co-authored this…

Need for Speed: cuDF Pandas vs. Pandas | by Thomas Reid | Apr, 2024

Data ScienceFebruary 22, 2025112Views 0Likes 0Comments

A comparative overview What is cuDF Pandas? If you’re a user of the Pandas library in Python, and you want or need to maximise your program run times, then you have a few options available to you. Most of these options revolve around the use of external libraries that supplant existing Pandas operations and are…

Computational Thinking is What You Need | by Louis Chan | Mar, 2024

Data ScienceFebruary 22, 202590Views 0Likes 0Comments

Missing puzzle piece to LLM Enterprise Augmentation Since early last year, when we led the development of an enterprise-level GenAI-as-a-service platform, we have understandably been bombarded with questions like “What are the art of possibles for …” or “Can LLM do …” In this blog post, we will dive into a critical skill that will…

How to Improve your RFM Model in BigQuery

Data ScienceFebruary 22, 2025114Views 0Likes 0Comments

Advanced strategies for better customer insights The RFM (Recency, Frequency, Monetary) model, with its simplicity and ease of implementation, remains a great tool for customer relationship management, offering valuable insights into customer behaviour. Building on the groundwork from my previous article “How to Create an RFM Model in BigQuery”, in this article, we will explore…

Understanding the Spare Mixture of Experts (SMoE) Layer in Mixtral | by Matthew Gunton | Mar, 2024

Data ScienceFebruary 22, 2025115Views 0Likes 0Comments

Let’s begin with the idea of an ‘expert’ in this context. Experts are feed-forward neural networks. We then connect them to our main model via gates that will route the signal to specific experts. You can imagine our neural network thinks of these experts as simply more complex neurons within a layer. Figure 1 from…

How Lucky is a Bowl of Lucky Charms? | by G. Jay Kerns | Mar, 2024

Data ScienceFebruary 22, 202596Views 0Likes 0Comments

tl;dr version: A team of students helped design and carry out an experiment to determine whether bowls of Lucky Charms are equally “lucky” over the course of a box of cereal. Turns out, not so much. We estimate a decrease of approximately 2.7 total charms per additional bowl on average. This corresponds to more than…

Using Causal Graphs to answer causal questions | by Ryan O’Sullivan | Jan, 2024

Data ScienceFebruary 22, 2025108Views 0Likes 0Comments

Causal AI, exploring the integration of causal reasoning into machine learning This article gives a practical introduction to the potential of causal graphs. It is aimed at anyone who wants to understand more about: What causal graphs are and how they work A worked case study in Python illustrating how to build causal graphs How…