It’s no secret that supervised machine learning models need to be trained on high-quality labeled datasets. However, collecting enough high-quality labeled data can be a significant challenge, especially in situations where privacy and data availability are major concerns. Fortunately, this problem can be mitigated with synthetic data. Synthetic data is data that is artificially generated…
The emergence of Large Vision-Language Models (LVLMs) characterizes the intersection of visual perception and language processing. These models, which interpret visual data and generate corresponding textual descriptions, represent a significant leap towards enabling machines to see and describe the world around us with nuanced understanding akin to human perception. A notable challenge that impedes their…
Automation, machine learning and LLMs in the chip industry (source: chatGPT)I felt like one of those guys from Monsters Inc. You know, the ones in the big yellow hazmat suits. A necessary precaution! I was entering the most complex manufacturing environment in the world. One that requires so much precision that even microscopic particulates from…
Optical character recognition (OCR) software help convert non-editable document formats such as PDFs, images, or paper documents into machine-readable formats that are editable & searchable. OCR applications are commonly used to capture text from PDFs & images and convert the text into editable formats such as Word, Excel, or a plain text file. OCR is…
Diffusion models are a set of generative models that work by adding noise to the training data and then learn to recover the same by reversing the noising process. This process allows these models to achieve state-of-the-art image quality, making them one of the most significant developments in Machine Learning (ML) in the past few…
How to Create a Speech-to-Text-to-Speech Program Image by Mariia Shalabaieva from unsplashIt’s been exactly a decade since I started attending GeekCon (yes, a geeks’ conference 🙂) — a weekend-long hackathon-makeathon in which all projects must be useless and just-for-fun, and this year there was an exciting twist: all projects were required to incorporate some form…
Image by Unsplash
A month into 2024 and I’ve never seen my LinkedIn profile so busy with ‘Open to Work’ banners and posts. 2023 was already tough, with a lot of companies announcing layoffs, and what’s even sadder is that the layoffs were quite brutal. Some were told ‘You’re fired, see you later’,…
Understand Semantic Structures with Transformers and Topic Modeling We live in the age of big data. At this point it’s become a cliche to say that data is the oil of the 21st century but it really is so. Data collection practices have resulted in huge piles of data in just about everyone’s hands. Interpreting…
Image by Author
SQL has become a must-have language for any data professional.
Most of us use SQL in our daily work, and after writing many queries, we all get our own style and have our habits, both good and bad.
SQL is usually learned by use, and in most cases, people do…
A handy reference on migrating bookmarks, terminal enhancements, and AWS Cli settings Image generated by author using midjourneyI recently received a new 16-inch MacBook Pro with the latest Apple M3 chip for my work computer. I had heard rave reviews about the blazing-fast Apple M1 and M2 chips, so I was incredibly excited to get…