LLM Development Landscape

LLM Development Landscape Kamolphan Liwprasert (Fon) MLOps Consultant, AIMET.tech Google
Developer Expert - Cloud

แนะนําตัว

AIMET aimet.tech

LLM Development Landscape ✨ Overview ภาพรวมในการพัฒนาแอป LLM ✨ Concept ที่น่ารู้เกี่ยวกับ
LLM ✨ Dev Application LLM อย่างไรได้บ้าง ✨ มี framework อะไรให้เลือกใช้บ้าง

Overview ภาพรวม การพัฒนาแอป LLM

นิยาม LLM A large language model (LLM) is a computational
model capable of language generation or other natural language processing tasks. https://en.wikipedia.org/wiki/Large_language_model

นิยาม Multimodal LLM Multimodal = characterized by several diﬀerent modes
of activity or occurrence. https://research.google/blog/multimodal-medical-ai/

เราเรียกใช้ AI Model อย่างไรได้บ้าง?

Model Serving Application 📱 💻 🌐 Model 🤖 API API
= “Client - Server” Client Server

Application 📱 💻 🌐 Model On-Device AI = “Edge”

Challenge?

Language Model APIs

🏆 LMSYS Chatbot Arena Leaderboard https://chat.lmsys.org/?leaderboard

Artiﬁcial Analysis: เว็บเปรียบเทียบ AI models https://artificialanalysis.ai/

Artiﬁcial Analysis: Quality vs Price https://artificialanalysis.ai/

Artiﬁcial Analysis: API Prices https://artificialanalysis.ai/

Services to Host Language Models

Why self-host LLM? 💲 Cost eﬀicient in long term (ie.
on-premise) → Need to tune the latency to make the model faster ⚙ Customization & fine-tuning → No lock-in to a particular model 🔒 Security compliance & data residency / privacy

Run LLM locally LlamaFile github.com/Mozilla-Ocho/llamaﬁle Ollama ollama.com/ LM Studio lmstudio.ai/

LLM Development Frameworks

LangChain 🦜🔗 Python / JS library framework for developing applications
powered by large language models (LLMs). https://www.langchain.com/langchain

LlamaIndex Turn your enterprise data into production-ready LLM applications. (Python
/ Typescript) https://www.llamaindex.ai/

Semantic Kernel from Microsoft Semantic Kernel is an SDK that
integrates Large Language Models (LLMs) like OpenAI, Azure OpenAI, and Hugging Face with conventional programming languages like C#, Python, and Java. https://github.com/microsoft/semantic-kernel

It’s ﬁne not using any of these frameworks 󰙤

RAG Concept :Retrieval Augmented Generator

RAG - Ask→ Retrieve from DB → Generate Answer

Document Search example Vector DB

Vector Database https://www.graft.com/blog/top-vector-databases-for-ai-projects

RAG vs Fine-tuning

Agentic Workﬂow Agentic = behaves like an agent

Why Agentic? https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns

Agentic Workﬂow https://www.vellum.ai/blog/agentic-workflows-emerging-architectures-and-design-patterns

Crew AI https://www.crewai.com/

https://github.com/microsoft/autogen

Azure: Copilot Studio GCP: VertexAI Agent Builder

Inference / Serving

Text Generation Inference https://huggingface.co/docs/text-generation-inference/index

vLLM = Model serving for LLM Easy, fast, and cheap
LLM serving for everyone vLLM is fast with: ✅ State-of-the-art serving throughput ✅ Eﬀicient management of attention key and value memory with PagedAttention ✅ Continuous batching of incoming requests ✅ Fast model execution with CUDA/HIP graph ✅ Quantization: GPTQ, AWQ, SqueezeLLM, FP8 KV Cache ✅ Optimized CUDA kernels https://github.com/vllm-project/vllm Throughput: Higher is better

Responsible AI

https://ai.google/responsibility/responsible-ai-practices/

Google's Secure AI Framework https://safety.google/cybersecurity-advancements/saif/

Responsible AI ✅ ตรวจสอบความถูกต้องเสมอ ✅ Human-centered Design ออกแบบสําหรับคนใช้ ⚠ ระวังเรื่อง
Data Privacy ความเป็นส่วนตัวของข้อมูล ⚠ Biases and Fairness ทําให้มีความเป็นธรรมกับผู้ใช้

Resources

https://www.promptingguide.ai/

Sunday 3 November 2024 @ K+ Building Samyan Register now:
bit.ly/devfest-cloud-bkk24 Saturday 26 October 2024 @ Cleverse Register now: bit.ly/technologista-2024 ฝาก event :) Technologista By PyLadies x Women Techmakers DevFest Cloud Bangkok By GDG Cloud Bangkok

LLM Development Landscape Kamolphan Liwprasert (Fon) MLOps Consultant, AIMET.tech Google
Developer Expert - Cloud

LLM Development Landscape

LLM Development Landscape

More Decks by Kamolphan Liwprasert

Other Decks in Technology

Featured

Transcript