Krakow - Elastic Red Hat Meetup

Building and deploying AI infused apps with Elasticsearch using Podman
and OpenShift AI Syed M Shaaf Sr. Principal Developer Advocate Red Hat

2 Building and deploying AI Infused applications • Java developer,
advocate, architect, engineer… • Open source enthusiast, contributor • InfoQ Java Technical Editor • Ask me about #Java, backends, architecture, containers.. • Trainer, coach fosstodon.org/@shaaf sshaaf https://www.linkedin.com/in/shaaf/ shaaf.dev https://bsky.app/proﬁle/shaaf.dev

@shaaf.dev • Systems do not speak Natural language, can’t translate
and lack context outside of system boundaries. (e.g. sentiment) • Generating content is costly and sometimes hard. • Rapid data growth • Rising Expectations: Customers demand instant, personalized solutions. • Inefﬁciency: Manual processes increase costs and slow operations. • Skill Gaps: Limited expertise in AI adoption. Systems, Data, Networks and a Solution?

@shaaf.dev Foundation models Learning without labels, adapt, tune, massive data
appetite • Tasks ◦ Translation, Summarization, Writing, Q&A • “Attention is All you need”, Transformer architecture • Recognize, Predict, and Generate text • Trained on a Billions of words • Can also be tuned further A LLM predicts the next token based on its training data and statistical deduction Large Language Models

@shaaf.dev Tokens Tokenization: breaking down text into tokens. e.g., Byte
Pair Encoding (BPE) or WordPiece); handle diverse languages and manage vocabulary size efficiently. [12488, 6391, 4014, 316, 1001, 6602, 11, 889, 1236, 4128, 25, 3862, 181386, 364, 61064, 9862, 1299, 166700, 1340, 413, 12648, 1511, 1991, 20290, 15683, 290, 27899, 11643, 25, 93643, 248, 52622, 122, 279, 168191, 328, 9862, 22378, 2491, 2613, 316, 2454, 1273, 1340, 413, 73263, 4717, 25, 220, 7633, 19354, 29338, 15] https://platform.openai.com/tokenizer "Running", “unpredictability” (word-based tokenization). Or: "run" " ning" ; “un” “predict” “ability” (subword-based tokenization, used by many LLMs). “Building Large Language Models from scratch” - Sebastian Raschka

@shaaf.dev Amazing things Stupid mistakes “..Do not mix accuracy with
truth..”

@shaaf.dev Demo Overview

@shaaf.dev Langchain4j

1 0 Thank you! Source for the demo https://github.com/sshaaf/gpt-java-chatbot Syed
M Shaaf Developer Advocate Red Hat fosstodon.org/@shaaf sshaaf https://www.linkedin.com/in/shaaf/ shaaf.dev https://bsky.app/proﬁle/shaaf.dev

Krakow - Elastic Red Hat Meetup

Krakow - Elastic Red Hat Meetup

Shaaf Syed

More Decks by Shaaf Syed

Featured

Transcript

Building and deploying AI infused apps with Elasticsearch using Podman

2 Building and deploying AI Infused applications • Java developer,

@shaaf.dev • Systems do not speak Natural language, can’t translate

@shaaf.dev Foundation models Learning without labels, adapt, tune, massive data

@shaaf.dev Tokens Tokenization: breaking down text into tokens. e.g., Byte

@shaaf.dev Amazing things Stupid mistakes “..Do not mix accuracy with

@shaaf.dev Demo Overview

DEMO

@shaaf.dev Langchain4j

1 0 Thank you! Source for the demo https://github.com/sshaaf/gpt-java-chatbot Syed