The Future of AI: Optimism, Pessimism, and What Lies Ahead

Setting the stage. The sudden emergence of Large Language Models (LLMs) has sparked renewed interest in artificial intelligence. As usual, this has brought out both the “fanboys” who see AI as an Earth-shattering change and the “naysayers” who believe that the recent excitement around AI, as catalyzed by LLM makers like OpenAI, is just another […]

Common Enterprise GenerativeAI Use Cases With Examples

Overview As GenerativeAI has moved from research to consumer use to enterprise use, a series of enterprise use cases have come to the fore. These areas can benefit from GenAI now. Software development Generative AI is helping make software development more productive by automating code generation, documentation, and bug detection, allowing developers to focus on […]

The Rise of Compound AI Systems: Shifting Beyond Single Models

LLMs are not enough In 2023, Large Language Models (LLMs) took center stage in the AI world, showcasing their ability to perform general tasks through simple prompting. However, as we move through 2024, a significant shift is occurring in the AI landscape. State-of-the-art AI results are increasingly being achieved not by single, monolithic models, but […]

A Recap of Databricks Data & AI World Tour, 2024 – New York City

Introduction On Sept 17, Infinitive sent representatives to the Databricks Data & AI World Tour in New York City. This one-day conference is one of several conferences held in different locations around the globe where Databricks highlights their accomplishments and vision for the future. For those that attended the Databricks Data & AI Summit held […]

Infinitive Uses Retrieval Augmented Generation (RAG) F1 Scores to Guide Development

Summary Retrieval Augmented Generation (RAG) is a relatively new architecture which allows private data to be accessed through natural language queries using Large Language Model (LLM) technology. Applications like customer service have benefitted from this architecture. Swedish fintech company Klarna reports that it has implemented RAG for customer service with the RAG-based application doing the […]

Using Databricks-based RAG for Employee Onboarding at Infinitive

Summary Infinitive has finished the first phase of implementing a Retrieval Augmented Generation (RAG) application to help onboard new employees. Infinitive is a data and AI consultancy headquartered in Ashburn, Va – outside Washington, DC. Like most consulting firms, we are continually hiring new employees to meet our business growth. To help new employees understand […]

Advancing Autonomous AI Agents: How Agent Q is Revolutionizing AI Decision-Making

Summary. Artificial Intelligence (AI) has been making waves in recent years, especially with the advent of Large Language Models (LLMs) like ChatGPT and LLaMA. These models have shown incredible abilities in understanding and generating human-like text, which has opened new possibilities for their use in various fields, from customer service to complex problem-solving. However, despite […]

Is OpenAI in Trouble?

The clock is ticking. The combination of a rapid cash burn, the lack of a sustainable “moat”, and slow enterprise adoption of LLMs are causing people to question the mid-term viability of OpenAI. OpenAI’s many public controversies and leadership defections have only served to catalyze the concern for its future. Cash is king. OpenAI plans […]

Using Retrieval Augmented Generation (RAG) to Detect Cashback Fraud in Online Gaming

What is cashback fraud? Cashback fraud in online betting typically involves users exploiting promotional offers designed to refund a portion of their losses. Fraudsters might create multiple accounts to place opposing bets, ensuring that one account wins while the other loses, thereby manipulating the system to maximize the cashback rewards. This type of fraud undermines […]

OpenAI’s Project Strawberry: The Ghost of Q* (Q Star) Rises

Overview. Reuters is reporting that leaked information from inside OpenAI points to a major effort to incorporate long inference reasoning inside ChatGPT. The effort is code named Project Strawberry and seems to be very similar to the Q* (pronounced Q star) effort which was the rumored basis for Sam Altman’s very short-lived firing back in […]