Java + LLMs: A hands-on guide to building LLM Apps in Java with Jakarta

1 Java + LLMs: A hands-on guide to building LLM
Apps in Java with Jakarta Syed M Shaaf Developer Advocate @ Red Hat Technical Editor @ InfoQ Bazlur Rahman Java Champion 🏆 Staﬀ Software Developer at DNAstack

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Quick introduction • Prompts • Memory
• Tools • RAG (Retrieval Augmented Generation) • Model Context Protocol What we will cover today

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Systems do not speak Natural language,
can’t translate and lack context outside of system boundaries. (e.g. sentiment) • Generating content is costly and sometimes hard. • Rapid data growth • Rising Expectations: Customers demand instant, personalized solutions. • Inefﬁciency: Manual processes increase costs and slow operations. • Skill Gaps: Limited expertise in AI adoption. Systems, Data, Networks and a Solution?

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Understanding the journey that brought us here...
Expert System Machine learning Deep learning Foundation models No use of data Manually authored rules Brittle Labour intensive Data prep, feature eng. Supervised learning, unsupervised learning, classiﬁcation Learning without labels, adapt, tune, massive data appetite

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Foundation models Learning without labels, adapt, tune,
massive data appetite • Tasks ◦ Translation, Summarization, Writing, Q&A • “Attention is All you need”, Transformer architecture • Recognize, Predict, and Generate text • Trained on a Billions of tokens • Can also be tuned further A LLM predicts the next token based on its training data and statistical deduction Large Language Models

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Tokens Tokenization: breaking down text into tokens.
e.g., Byte Pair Encoding (BPE) or WordPiece); handle diverse languages and manage vocabulary size efficiently. [12488, 6391, 4014, 316, 1001, 6602, 11, 889, 1236, 4128, 25, 3862, 181386, 364, 61064, 9862, 1299, 166700, 1340, 413, 12648, 1511, 1991, 20290, 15683, 290, 27899, 11643, 25, 93643, 248, 52622, 122, 279, 168191, 328, 9862, 22378, 2491, 2613, 316, 2454, 1273, 1340, 413, 73263, 4717, 25, 220, 7633, 19354, 29338, 15] https://platform.openai.com/tokenizer "Running", “unpredictability” (word-based tokenization). Or: "run" " ning" ; “un” “predict” “ability” (subword-based tokenization, used by many LLMs). “Building Large Language Models from scratch” - Sebastian Raschka

Amazing things Stupid mistakes “..Do not mix accuracy with truth..”
@bazlur.ca @shaaf.dev

Truth is Discrete not continuous @bazlur.ca @shaaf.dev

@bazlur.ca @shaaf.dev

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Langchain4J

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev A Simple chat bot - Basic htmx
- Chat window - Backend sends question to the LLM. - Streaming is also an option

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Whats an AI Service? - AI Services,
tailored for Java - similar to Spring Data JPA or Retroﬁt - handle the most common operations

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Prompts System prompt - Deﬁne the task
- Set the expectations - Provide examples User prompt - Speciﬁc to the input When to use system vs user? What is a good prompt!? - E.g. Structure your input and output, (different LLMs behave differently) ** Try not to migrate prompts across models

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Reasoning - Chain of Thought - TOT
reasoning - Tree of Thought (Thinking, Organizing, Translating)

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Few-Shot , Zero Shot Zero-Shot - No
data collection needed - Better accuracy with minimal examples - Lower accuracy on complex tasks Few-Shot - Fast implementation - Adaptable to niche tasks - Sensitive to example quality/order

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Chat and Memory • Eviction policy •
Persistence • Special treatment of SystemMessage and Tools

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Function calling / Tools @Tool double squareRoot(double
x) { return Math.sqrt(x); } - Call other services or functions to enhance the response. - E.g. Web APIs, internal system requests

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev MCP - Model Context Protocol - Standardized
Context Format - Improved Interoperability - Richer Context Representation - Enhanced Model Grounding https://github.com/modelcontextprotocol

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev - Demo use case - *My* Books
API - Books MCP Server

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev “feed relevant pieces of information (chunks) from
your knowledge base to an LLM along with the user's query”... Retrieval Augmented Generation

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Retrieval Augmented Generation What is the representation
of the data? How do I want to split? Per document Chapter Sentence How many tokens do I want to end up with? How much overlap is there between segments?

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Retrieval Augmented Generation Paragraph and Sentence splitters
aim for semantic coherence but have variable chunk sizes. Character (especially recursive) and Word splitters offer size control but risk breaking semantic meaning. Line splitters are for speciﬁc line-oriented formats. Regex splitters provide maximum ﬂexibility for known structures.

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Quick introduction • Prompts • Memory
• Tools • RAG (Retrieval Augmented Generation) • Model Context Protocol What we covered….

2 5 Thank you! Syed M Shaaf Developer Advocate Red
Hat Bazlur Rahman Java Champion 🏆 Empowering Developers through Speaking 🗣 Writing ✍ Mentoring 🤝 & Community Building 🌍 Published Author 📖 Contributing Editor at InfoQ and Foojay.IO fosstodon.org/@shaaf sshaaf https://www.linkedin.com/in/shaaf/ shaaf.dev https://bsky.app/proﬁle/shaaf.dev https://x.com/bazlur_rahman rokon12 https://www.linkedin.com/in/bazlur/ https://bazlur.ca/ https://bsky.app/proﬁle/bazlur.ca Source for the demo https://github.com/learnj-ai/llm-jakarta https://docs.langchain4j.dev/ LangChain4J

Java + LLMs: A hands-on guide to building LLM A...

Java + LLMs: A hands-on guide to building LLM Apps in Java with Jakarta

A N M Bazlur Rahman

More Decks by A N M Bazlur Rahman

Featured

Transcript

1 Java + LLMs: A hands-on guide to building LLM

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Quick introduction • Prompts • Memory

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Systems do not speak Natural language,

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Understanding the journey that brought us here...

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Foundation models Learning without labels, adapt, tune,

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Tokens Tokenization: breaking down text into tokens.

Amazing things Stupid mistakes “..Do not mix accuracy with truth..”

Truth is Discrete not continuous @bazlur.ca @shaaf.dev

@bazlur.ca @shaaf.dev

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Langchain4J

DEMO

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev A Simple chat bot - Basic htmx

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Whats an AI Service? - AI Services,

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Prompts System prompt - Deﬁne the task

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Reasoning - Chain of Thought - TOT

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Few-Shot , Zero Shot Zero-Shot - No

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Chat and Memory • Eviction policy •

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Function calling / Tools @Tool double squareRoot(double

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev MCP - Model Context Protocol - Standardized

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev - Demo use case - My Books

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev “feed relevant pieces of information (chunks) from

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Retrieval Augmented Generation What is the representation

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev Retrieval Augmented Generation Paragraph and Sentence splitters

https://github.com/learnj-ai/llm-jakarta @bazlur.ca @shaaf.dev • Quick introduction • Prompts • Memory

2 5 Thank you! Syed M Shaaf Developer Advocate Red