AI Community

Reinforcement Learning (RL)Open-source AICost-effective LLM

Session 14: An Overview on GPT & Transformer

August 2, 2025

The document overviews neural network evolution from basic neurons to advanced Transformer architectures, highlighting how attention mechanisms improve efficiency and performance in sequence and image processing

vibe codingcursorwindsurf

Session 20: Vibe Coding

May 3, 2025

In the 20th AI Talks meeting, we built a fully AI-generated web data extraction and question-answering system using tools like firecrawl.dev, Qdrant, GPT-4-mini, and LangChain, showcasing how LLMs can create well-structured, end-to-end applications with strong software engineering practices

Multi Agent Challengeslangchain

Session 21: Multi Agent Challenges

May 3, 2025

The "Multi Agent System: From Challenges to Solutions" event discussed how multi-agent systems can tackle issues like tool overload and context limits by dividing tasks, summarizing outputs, and enhancing queries, highlighting their practical use in collaborative AI workflows and promoting continued discussion via an AI Talks Telegram group

n8n

Session 19: Practical AI Workflow Automation

April 26, 2025

In our recent AI meeting on Practical AI Workflow Automation, we explored how the open-source tool n8n enables low-code integration of AI models, APIs, and services to automate tasks like document processing, classification, and RAG pipelines, making it a powerful solution for building intelligent, real-world workflows

Model Context Protocol

Session 18: MCP | Model Context Protocol

April 19, 2025

The Model Context Protocol (MCP), introduced by Anthropic in 2024, is an open standard that enables seamless integration of large language models with tools and data through a universal client-host-server architecture, supporting secure, extensible, and multimodal AI applications.

Reinforcement Learning (RL)Open-source AICost-effective LLM

Session 13: DeepSeek

February 1, 2025

DeepSeek-R1, an open-source large language model developed by DeepSeek AI Lab, has quickly gained traction due to its innovative use of pure Reinforcement Learning (RL) instead of Supervised Fine-Tuning (SFT), enabling autonomous reasoning improvements. With strong performance benchmarks, cost-effective training, and local deployment capabilities, DeepSeek-R1 presents a competitive alternative to larger commercial models while also facing challenges like jailbreaking vulnerabilities.

Smart HomeChatBot

Session 16: AI Agent, A real world application

February 1, 2025

The meeting covered a real-world project showcasing a conversational AI agent built with LangChain and GPT-4, designed to integrate natural language interaction with IoT functions through a chat-based interface, persistent memory, and flexible tool execution

transfer learningfine-tuning

Session 17: Transfer Learning

February 1, 2025

The AI Talks presentation on Transfer Learning explains how reusing models trained on one task for related tasks improves efficiency, reduces data needs, and enhances performance across fields like computer vision and NLP, making it a key strategy in modern AI development.

Generative UIAdaptive InterfacesAI-Driven Design

Session 12: Generative UI

January 25, 2025

Generative UI leverages AI to create dynamic, personalized user interfaces in real time, replacing traditional static designs with adaptive layouts that respond to user context and goals. While this approach enhances usability and scalability, it also introduces challenges like privacy concerns, infrastructure demands, and potential usability issues due to frequent UI changes.

TokenizationSubword TokenizationByte Pair Encoding (BPE)

Session 11: Tokenization

January 18, 2025

The 11th session of AI Talks explored Tokenization in NLP, covering word, subword, character, and sentence tokenization, along with their advantages and limitations. Special focus was given to Byte Pair Encoding (BPE) and SentencePiece, highlighting their role in handling out-of-vocabulary words and creating efficient vocabularies for diverse NLP applications.

Multi-Agent SystemsAI CollaborationAutomation

Session 10: Multi-Agent Frameworks, Let agents talk

January 10, 2025

In our 10th session of "AI Talks," we explored multi-agent frameworks, their structures, and their role in enabling agents to collaborate efficiently for automation and decision-making. We reviewed key frameworks like LangGraph, LlamaIndex Workflow, Eliza, and OpenAI Swarm, along with a supervisor-driven flow-based architecture, highlighting how these systems enhance AI capabilities by optimizing task distribution and coordination.

RAG evaluationAutomated metricsEthical considerations

Session 9: Evaluating the Generation Part of RAG Pipelines

January 4, 2025

The ninth session of *AI Talks* focused on evaluating the generation component of RAG pipelines, highlighting key challenges like hallucinations, coherence, and response latency while stressing the need to assess both retrieval and generation stages. Various evaluation methods, including automated metrics (BLEU, ROUGE, GPT-4-based scoring), human evaluation, A/B testing, and automation tools like OpenAI evals, were explored alongside ethical considerations for fairness and bias mitigation.

Retrieval QualityEvaluation MetricsRelevancy Analysis

Session 8: Evaluating RAG pipelines

December 28, 2024

RAG pipeline evaluation involves assessing retrieval and generation phases using traditional IR metrics (accuracy, precision, recall) and NLP evaluation methods (BLEU, ROUGE, METEOR). Evaluation is conducted through offline (static dataset) and online (live feedback) methods with similarity and relevancy score analysis used to ensure retrieval quality, leveraging statistical measures and LLM-based assessments for reliability and alignment.

Long-term memoryMemGPT frameworkLangChain memory

Session 7: Building stateful LLM applications

December 14, 2024

The discussion covered the distinction between short-term and long-term memory in LLMs, emphasizing their role in maintaining continuity and personalization. It explored the MemGPT framework’s memory management approach and LangChain’s categorization of long-term memory into Semantic, Episodic, and Procedural types for efficient information processing.

Multimodal RetrievalColPaliVision-Language Models

Session 6: Retrieval with Vision Language Model

December 7, 2024

The discussion covered Chain of Thought (CoT) reasoning, which enhances LLM problem-solving by breaking down complex queries into logical steps, and function calling, which enables LLMs to interact with external systems like APIs and databases. It concluded with an exploration of four function calling types, addressing challenges like latency, error handling, and security, reinforced through hands-on coding demonstrations.

Prompting TechniquesAI Model ParametersChain-of-Thought (CoT)

Session 5: Prompt Engineering

November 30, 2024

The promptingguide.ai documentation, as discussed in our AI Talks meeting, covers key parameters for controlling AI outputs, such as Temperature and Top P for randomness, frequency and presence penalties for repetition, and basic constraints like max length and stop sequences. It also emphasizes clear and precise prompt design, using separators and direct instructions, while highlighting few-shot prompting limitations and introducing chain-of-thought (CoT) prompting for complex reasoning tasks in large models.

AIRAGCoT

Session 4: Function Calling

November 22, 2024

AILLMInformation Retrieval

Session 3: RAG, Retrieval & Generation

November 15, 2024

The meeting covered Retrieval Augmented Generation (RAG) as a method to enhance Large Language Models (LLMs) with up-to-date knowledge by integrating retrieval and generation processes. Key aspects included data ingestion with quality control, retrieval using dense, sparse, and hybrid methods, and response generation optimized through prompt engineering and evaluation metrics for accuracy and contextual relevance.

AIRAG

Session 1: Retrieval Augmented Generation (RAG)

November 1, 2024

The meeting covered the necessity of RAG in addressing generative model limitations like outdated knowledge, hallucinations, and private data leaks, explaining how a retriever-generator framework enhances response accuracy. It also explored technical implementation, applications across different modalities, and challenges such as memory constraints and data ingestion, highlighting key tools like FAISS, Qdrant, LlamaIndex, and LangChain for effective RAG deployment.

biasAI

Session 15: Biases in AI

Invalid Date

Bias in AI arises from skewed data and design flaws, leading to unfair outcomes, and must be addressed through transparency, diversity, and ethical oversight

AIRAG

Session 2: RAG, Data Ingestion

November 8, 2024

This session covered data preparation in RAG pipelines, emphasizing the ingestion phase for cleaning, organizing, and enriching data with metadata to enhance retrieval and embedding. Key topics included classical data cleaning methods, such as deduplication and text preprocessing, along with various chunking strategies like fixed-length, semantic, and dynamic chunking to optimize embedding efficiency while preserving context.