Advanced techniques for beginners AI generated image using KandinskyIn this story, I would like to raise a discussion on how we transform data. Whether it’s a database, data warehouse or reporting solution we run data transformations based on data models but how do we organise them? I would like to talk about the modern data…
To start, I would choose an introduction/beginner course that I like the look of or recommendations from another person who I know has good Python skills. You might have heard me say in one of my previous posts that there is no such thing as the “right” course. While this is definitely true, some courses…
Running a multimodal LLaVA model, camera, and speech synthesis Image by Enoc Valenzuela, UnsplashModern large multimodal models (LMMs) can process not only text but also different types of data. Indeed, “a picture is worth a thousand words,” and this functionality can be crucial during the interaction with the real world. In this “weekend project,” I…
Meta’s open-source Seamless models: A deep dive into translation model architectures and a Python implementation guide using HuggingFace This post was co-authored with Rafael Guedes. The growth of an organization is not limited to its country boundaries. Some organizations only sell or operate on external markets. This globalization comes with several challenges, one being how…
And easy solutions that can immediately turn them around Photo by t Kaiser on UnsplashEvery data engineer wants to feel like they are constantly evolving as a professional and growing their technical skills. As data engineers we like to be challenged and feel we are progressing towards our end goal. This is the nature of…
In this case, assuming I am the owner of an ecommerce website. I would like to create a Chatbot, so my users can ask specific questions regarding anything about this website (price, product, service, shipping, etc.) as they are in the store. The Chatbot will be supplied with the “private knowledge” and ground its answers…
After a credit card? An insurance policy? Ever wondered about the three-digit number that shapes these decisions? Introduction Scores are used by a large number of industries to make decisions. Financial institutions and insurance providers are using scores to determine whether someone is right for credit or a policy. Some nations are even using social…
Learn how to ensure the quality of your embeddings, which can be essential for your machine-learning system. Creating quality embeddings is an essential part of most AI systems. Embeddings are the foundation on which an AI model can do its job, and creating high-quality embeddings is, therefore, an important element in making high-accuracy AI models.…
If you like or want to learn machine learning with scikit-learn, check out my tutorial series on this amazing package: Sklearn tutorial All images by author. Dummy models are very simplistic models that are meant to be used as a baseline to compare your actual models. A baseline is just some kind of reference…
The dataset used in Part 1 is simple and can be easily modeled with just a mixture of Gaussians. However, most real-world datasets are far more complex. In this part of the story, we will apply several synthetic data generators to some popular real-world datasets. Our primary focus is on comparing the distributions of maximum…