How Meta-CoT enhances system 2 reasoning for complex AI challenges Image created by the author using Generative AI (Flux-pro)What makes a language model smart? Is it predicting the next word in a sentence ‒ or handling tough reasoning tasks that challenge even bright humans? Today’s Large Language Models (LLMs) create smooth text plus solve simple…
Practical insights for a data-driven approach to model optimization Photo by Joshua Sortino on UnsplashIt is said that in order for a machine learning model to be successful, you need to have good data. While this is true (and pretty much obvious), it is extremely difficult to define, build, and sustain good data. Let me…
Data Science Explore the power of regex and save time in data analysis Image created by Author using Meta AIData is rarely clean and never in the required structure!! Whether you are starting with data science or are an experienced professional — You won’t deny the above statement! In a data analyst’s career extracting actionable…
Polynomial Fit in Python with NumPy Ball Tracking and Trajectory PredictionIn a previous project I visualized the trajectory of a ball that I threw vertically into the air with a real-time position, velocity and acceleration plot. Extending upon this project, I wanted to calculate and visualize a trajectory prediction based on a simple physics model.…
Using Qwen2.5–7B-Instruct powered code agents to create a local, open source, multi-agentic RAG system Photo by Jaredd Craig on UnsplashLarge Language Models have shown impressive capabilities and they are still undergoing steady improvements with each new generation of models released. Applications such as chatbots and summarisation can directly exploit the language proficiency of LLMs as…
Discover how to set up an efficient MLflow environment to track your experiments, compare and choose the best model for deployment Training and fine-tuning various models is a basic task for every computer vision researcher. Even for easy ones, we do a hyper-parameter search to find the optimal way of training the model over our…
My experience using VSCode (GitHub Copilot) and Cursor (Claude 3.5 Sonnet) as a Data Scientist. Image artificially generated using FLUX.1 by Black Forest Labs (via Grok 2).As developers, we’re constantly searching for tools to enhance our productivity and make coding more enjoyable. I have been using Visual Studio Code (VSCode) for over six years, it…
Let’s dive into the most important libraries in R and Python to visualise data and create different charts, and what the pros and cons are Being a pro in certain programming languages is the goal of every aspiring data professional. Reaching a certain level in one of the countless languages is a critical milestone for…
If you have been a data scientist for a while, sooner or later you’ll notice that your day-to-day has shifted from a VSCode-loving, research paper-reading, git-version-committing data scientist to a collaboration-driving, project-scoping, stakeholder-managing, and strategy-setting individual. This shift will be gradual and almost unnoticeable but one that will require you to put on different hats…
Implementing Speculative and Contrastive Decoding Large Language models are comprised of billions of parameters (weights). For each word it generates, the model has to perform computationally expensive calculations across all of these parameters. Large Language models accept a sentence, or sequence of tokens, and generate a probability distribution of the next most likely token. Thus,…