← Graph

Building LLM powered applications in Ruby

talk 25 connections

Andrei Bondarev's wroclove.rb 2024 single-speaker talk. Frames generative AI as about to become a standard part of every tech stack (alongside databases, caches, queues, storage) per the a16z vision, with developers building on top of AI systems. Covers: what LLMs are and what they excel at (structuring unstructured data, summarization, classification, translation, content generation, Q&A), the shift from business logic in fat models/service objects to business logic in prompts, AI agents as autonomous LLM-powered programs using tools via function calling, and the reliability/generality trade-off (focused narrow agents are POC-ready; general-purpose reliable agents would equal AGI). Discusses slow adoption (fast-changing field, IP/copyright ambiguity, lack of tooling, risk — Air Canada chatbot lawsuit, GM chatbot agreeing to sell a car for $1), calls prompt engineering 'prompt alchemy', covers jailbreaking (Anthropic's many-shot jailbreaking paper) and hallucinations (GPT-4 trained up to April 2023). Introduces RAG: embed user query, similarity-search a vector store of proprietary docs, merge context into prompt. Explains vector embeddings (OpenAI's Ada model is 1536 dimensions), similarity metrics (Manhattan, Euclidean, cosine), evaluations via human up/down votes, LLM-as-critic using GPT-4, and the RAGAS method (faithfulness, context relevance, answer relevance). Argues against open chat bots due to prompt-injection risk — advocates closed control-panel UIs with narrowly-scoped agents. Live-codes a 'Nerds and Threads' e-commerce AI assistant using langchainrb with six tools (customer management, email service, inventory management, order management, payment gateway, shipping service) backed by SQLite, demonstrating order placement, returns, and inventory updates — each driven by a system prompt describing standard operating procedures. Closes on why Ruby (pragmatism, OOP principles, Sandi Metz's POODR, ability to port Python libs via ChatGPT) and open-source maintenance lessons (be responsive, friendly, helpful). Q&A explores generating executable Ruby code once vs. instructing the LLM per request.

type
talk
talk Building LLM powered applications in Ruby
about
Talk centres on langchainrb as the Ruby solution for LLM-powered apps.
talk Building LLM powered applications in Ruby
about
Generative AI concept
Introduces generative AI as the context for Ruby applications.
talk Building LLM powered applications in Ruby
about
Explains LLMs and their strengths.
talk Building LLM powered applications in Ruby
about
Transformers concept
Names Transformers as the underlying LLM architecture.
talk Building LLM powered applications in Ruby
about
AI Agent concept
Covers AI agents, function calling, and the focus/reliability trade-off.
talk Building LLM powered applications in Ruby
about
Explains naive RAG and advanced multi-index strategies.
talk Building LLM powered applications in Ruby
about
Describes embedding models and 1536-dimensional vector spaces.
talk Building LLM powered applications in Ruby
about
Vector Database concept
Vector DBs used as the similarity-search substrate for RAG.
talk Building LLM powered applications in Ruby
about
RAGAS concept
Introduced as a quantitative way to evaluate RAG systems.
talk Building LLM powered applications in Ruby
about
Prompt Alchemy concept
Andrei renames 'prompt engineering' to 'prompt alchemy' and argues it's not engineering.
talk Building LLM powered applications in Ruby
about
Jailbreaking concept
Covers jailbreaking techniques including many-shot jailbreaking.
talk Building LLM powered applications in Ruby
about
Hallucinations concept
Discusses hallucinations and knowledge cut-offs as motivation for RAG.
talk Building LLM powered applications in Ruby
about
Live-coding demo central to the talk.
talk Building LLM powered applications in Ruby
about
Names the 2017 Google paper that kicked off modern LLMs.
talk Building LLM powered applications in Ruby
about
References Anthropic's many-shot jailbreaking paper.
talk Building LLM powered applications in Ruby
about
Cites a16z's vision of generative AI as a core tech-stack component.
talk Building LLM powered applications in Ruby
about
Air Canada company
Air Canada chatbot lawsuit used as a cautionary example.
talk Building LLM powered applications in Ruby
about
General Motors company
GM chatbot agreeing to sell a car for $1 used as a cautionary example.
asked_at
Building LLM powered applications in Ruby talk
Audience question during the Q&A.
authored
Building LLM powered applications in Ruby talk
Andrei delivered this single-speaker talk.
from_talk
Building LLM powered applications in Ruby talk
Takeaway drawn from Andrei's proposed shift of business logic into prompts.
from_talk
Building LLM powered applications in Ruby talk
Andrei's central argument about agent reliability.
from_talk
Building LLM powered applications in Ruby talk
Andrei's practical recommendation to Ruby developers missing Python libraries.
from_talk
Building LLM powered applications in Ruby talk
Lessons Andrei shared from running langchainrb as open source.
talk Building LLM powered applications in Ruby
presented_at
Talk given at wroclove.rb 2024 in Wrocław.

Provenance

Created
2026-04-17 16:17 seed
Read by
77 extractions