Skip to content Skip to sidebar Skip to footer

Training AI Models on CPU. Revisiting CPU for ML in an Era of GPU… | by Chaim Rand | Sep, 2024

Revisiting CPU for ML in an Era of GPU Scarcity Photo by Quino Al on UnsplashThe recent successes in AI are often attributed to the emergence and evolutions of the GPU. The GPU’s architecture, which typically includes thousands of multi-processors, high-speed memory, dedicated tensor cores, and more, is particularly well-suited to meet the intensive demands…

Read More

Tackle Complex LLM Decision-Making with Language Agent Tree Search (LATS) & GPT-4o | by Ozgur Guler | Aug, 2024

Enhancing LLM decision-making: integrating language agent tree search with GPT-4o for superior problem-solving Image by the author: midjourney — abstract puzzleLarge Language Models (LLMs) have demonstrated exceptional abilities in performing natural language tasks that involve complex reasoning. As a result, these models have evolved to function as agents capable of planning, strategising, and solving complex…

Read More

The Evolution of SQL. Unlocking the power of large language… | by 💡Mike Shakhomirov | Aug, 2024

Unlocking the power of large language models Photo by ZHENYU LUO on UnsplashIn this article, I will examine how large language models (LLMs) can convert natural language into SQL, making query writing more accessible to non-technical users. The discussion will include practical examples that showcase the ease of developing LLM-based solutions. We’ll also cover various…

Read More

LLM Personalization. User Persona based Personalization of… | by Debmalya Biswas | Aug, 2024

User Persona based Personalization of LLM generated Responses ChatGPT, or the underlying large language models (LLMs) today, are able to generate contextualized responses given a prompt. As a next step in the LLM evolution, we expect the responses to be more and more personalized with respect to the persona, conversation history, current conversation context and…

Read More

Productionizing a RAG App. Adding evaluation, automated data… | by Ed Izaguirre | Aug, 2024

Adding evaluation, automated data pulling, and other improvements. From Film Search to Rosebud 🌹. Image from Unsplash.Table of Contents Introduction Offline Evaluation Online Evaluation Automated Data Pulling with Prefect Summary Relevant Links A few months ago, I released the Film Search app, a Retrieval-Augmented Generation (RAG) application designed to recommend films based on user queries.…

Read More

Using LLMs to Query PubMed Knowledge Bases for BioMedical Research | by Jillian Rowe | Jul, 2024

AI for fun and profit! Photo by 🇸🇮 Janko Ferlič on UnsplashIn this article, we’ll explore how to leverage large language models (LLMs) to search and scientific papers from PubMed Open Access Subset, a free resource for accessing biomedical and life sciences literature. We’ll use Retrieval-Augmented Generation, RAG, to search our digital library. AWS Bedrock…

Read More