Upcoming

AI Talks

November 1, 2024

Tech Hub Conference Room, San Francisco

Weekly discussions on the latest advancements in artificial intelligence and machine learning. Join us every Saturday at 6:00 PM for presentations, discussions, and networking with fellow AI enthusiasts. Each session focuses on a specific topic in AI, from foundational concepts to cutting-edge research. Sessions are led by experts in the field and include time for Q&A and open discussion.

Session 13: DeepSeek

February 1, 2025

DeepSeek-R1, an open-source large language model developed by DeepSeek AI Lab, has quickly gained traction due to its innovative use of pure Reinforcement Learning (RL) instead of Supervised Fine-Tuning (SFT), enabling autonomous reasoning improvements. With strong performance benchmarks, cost-effective training, and local deployment capabilities, DeepSeek-R1 presents a competitive alternative to larger commercial models while also facing challenges like jailbreaking vulnerabilities.

Generative UIAdaptive InterfacesAI-Driven Design

Session 12: Generative UI

January 25, 2025

Generative UI leverages AI to create dynamic, personalized user interfaces in real time, replacing traditional static designs with adaptive layouts that respond to user context and goals. While this approach enhances usability and scalability, it also introduces challenges like privacy concerns, infrastructure demands, and potential usability issues due to frequent UI changes.

TokenizationSubword TokenizationByte Pair Encoding (BPE)

Session 11: Tokenization

January 18, 2025

The 11th session of AI Talks explored Tokenization in NLP, covering word, subword, character, and sentence tokenization, along with their advantages and limitations. Special focus was given to Byte Pair Encoding (BPE) and SentencePiece, highlighting their role in handling out-of-vocabulary words and creating efficient vocabularies for diverse NLP applications.

Multi-Agent SystemsAI CollaborationAutomation

Session 10: Multi-Agent Frameworks, Let agents talk

January 10, 2025

In our 10th session of "AI Talks," we explored multi-agent frameworks, their structures, and their role in enabling agents to collaborate efficiently for automation and decision-making. We reviewed key frameworks like LangGraph, LlamaIndex Workflow, Eliza, and OpenAI Swarm, along with a supervisor-driven flow-based architecture, highlighting how these systems enhance AI capabilities by optimizing task distribution and coordination.

RAG evaluationAutomated metricsEthical considerations

Session 9: Evaluating the Generation Part of RAG Pipelines

January 4, 2025

The ninth session of *AI Talks* focused on evaluating the generation component of RAG pipelines, highlighting key challenges like hallucinations, coherence, and response latency while stressing the need to assess both retrieval and generation stages. Various evaluation methods, including automated metrics (BLEU, ROUGE, GPT-4-based scoring), human evaluation, A/B testing, and automation tools like OpenAI evals, were explored alongside ethical considerations for fairness and bias mitigation.

Retrieval QualityEvaluation MetricsRelevancy Analysis

Session 8: Evaluating RAG pipelines

December 28, 2024

RAG pipeline evaluation involves assessing retrieval and generation phases using traditional IR metrics (accuracy, precision, recall) and NLP evaluation methods (BLEU, ROUGE, METEOR). Evaluation is conducted through offline (static dataset) and online (live feedback) methods with similarity and relevancy score analysis used to ensure retrieval quality, leveraging statistical measures and LLM-based assessments for reliability and alignment.

Long-term memoryMemGPT frameworkLangChain memory

Session 7: Building stateful LLM applications

December 14, 2024

The discussion covered the distinction between short-term and long-term memory in LLMs, emphasizing their role in maintaining continuity and personalization. It explored the MemGPT framework’s memory management approach and LangChain’s categorization of long-term memory into Semantic, Episodic, and Procedural types for efficient information processing.

Multimodal RetrievalColPaliVision-Language Models

Session 6: Retrieval with Vision Language Model

December 7, 2024

The discussion covered Chain of Thought (CoT) reasoning, which enhances LLM problem-solving by breaking down complex queries into logical steps, and function calling, which enables LLMs to interact with external systems like APIs and databases. It concluded with an exploration of four function calling types, addressing challenges like latency, error handling, and security, reinforced through hands-on coding demonstrations.

Prompting TechniquesAI Model ParametersChain-of-Thought (CoT)

Session 5: Prompt Engineering

November 30, 2024

The promptingguide.ai documentation, as discussed in our AI Talks meeting, covers key parameters for controlling AI outputs, such as Temperature and Top P for randomness, frequency and presence penalties for repetition, and basic constraints like max length and stop sequences. It also emphasizes clear and precise prompt design, using separators and direct instructions, while highlighting few-shot prompting limitations and introducing chain-of-thought (CoT) prompting for complex reasoning tasks in large models.

AIRAGCoT

Session 4: Function Calling

November 22, 2024

AILLMInformation Retrieval

Session 3: RAG, Retrieval & Generation

November 15, 2024

The meeting covered Retrieval Augmented Generation (RAG) as a method to enhance Large Language Models (LLMs) with up-to-date knowledge by integrating retrieval and generation processes. Key aspects included data ingestion with quality control, retrieval using dense, sparse, and hybrid methods, and response generation optimized through prompt engineering and evaluation metrics for accuracy and contextual relevance.

AIRAG

Session 1: Retrieval Augmented Generation (RAG)

November 1, 2024

The meeting covered the necessity of RAG in addressing generative model limitations like outdated knowledge, hallucinations, and private data leaks, explaining how a retriever-generator framework enhances response accuracy. It also explored technical implementation, applications across different modalities, and challenges such as memory constraints and data ingestion, highlighting key tools like FAISS, Qdrant, LlamaIndex, and LangChain for effective RAG deployment.

AIRAG

Session 2: RAG, Data Ingestion

November 8, 2024

This session covered data preparation in RAG pipelines, emphasizing the ingestion phase for cleaning, organizing, and enriching data with metadata to enhance retrieval and embedding. Key topics included classical data cleaning methods, such as deduplication and text preprocessing, along with various chunking strategies like fixed-length, semantic, and dynamic chunking to optimize embedding efficiency while preserving context.

AI Talks

Related Content

Session 13: DeepSeek

Session 12: Generative UI

Session 11: Tokenization

Session 10: Multi-Agent Frameworks, Let agents talk

Session 9: Evaluating the Generation Part of RAG Pipelines

Session 8: Evaluating RAG pipelines

Session 7: Building stateful LLM applications

Session 6: Retrieval with Vision Language Model

Session 5: Prompt Engineering

Session 4: Function Calling

Session 3: RAG, Retrieval & Generation

Session 1: Retrieval Augmented Generation (RAG)

Session 2: RAG, Data Ingestion