Video generation has rapidly become a focal point in artificial intelligence research, especially in generating temporally consistent, high-fidelity videos. This area involves creating video sequences that maintain visual coherence across…
Building a 28% more accurate multimodal image search engine with VLMs. Until recently, AI models were narrow in scope and limited to understanding either language or specific images, but rarely…
Understanding and analyzing long videos has been a significant challenge in AI, primarily due to the vast amount of data and computational resources required. Traditional Multimodal Large Language Models (MLLMs)…
Technologies
…
What working as a data scientist at various companies and industries over the past 6+ years has taught me of the future of data science and AI engineering GenAI and…