Vector and GraphRAG: Accuracy and Explainability in GenAI Applications

RAG: Accuracy and Explainability in GenAI Applications Jennifer Reif [email protected]
@JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

Who is Jennifer Reif? Developer Advocate, Neo4j • Continuous learner
• Technical speaker • Tech blogger, podcaster • Other: geek Jennifer Reif [email protected] @JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

Negative AI stories Even well-respected companies get it wrong •
Hallucinating non-existent policy, legal cases • Chatbot produces Python • Legally binding vehicle o ff er • Harmful health advice • Threatening users • Inventing new language • Illegal activities (insider trading + local health laws)

Standalone LLM Doesn’t often work • Design: • human-consumable output
• creative variation (probabilistic answers) • Problems: • too little detail, vague prompt • missing information (recent or private knowledge) • probabilistic ~= inconsistent

How do we avoid this? Add as much context as
possible • Guide LLM to relevant ideas and content • Focuses / narrows search area • Adds to LLM knowledge • Reduces margin of error Photo by Ali Alauda on Unsplash

Demo - chatgpt.com

Who is Jennifer Reif? Public info IS available…

Let’s try Jennifer’s username Should be publicly fi ndable???

Adding some context… Pointing to a speci fi c source
as guide

Giving it reference produces better results Now it knows where
to look and what to look for

Layers of AI • More layers = better result •
Complexity vs value

How to provide context …that we already have! • Can’t
stu ff prompt with everything • Dynamic information • Plug and play with existing data • High-quality data

RAG architecture • Retrieval • Data retrieved from external source
• Augmented • Augments response with facts • Generation • Response in natural language Prompt + Relevant Information LLM API LLM  Chat API User Database Search Prompt Response Relevant Results / Documents 2 3 1 Database

Another layer Agentic systems • Multiple agents / tools •
LLM decides which to use (and order) • Range: automation -> autonomous https://www.anthropic.com/engineering/building-e ff ective-agents

Retrieval source options • Vector database • Relational (+ vectors)
• NoSQL (+ vectors) • Graph (+ vectors) • Other: Directories / Websites / etc

Our demo Book recommendations • Plug LLM into our curated
book+review data set • Book descrs -> User reviews

Vector dbs

Embeddings / Vectors Convert data to a point in space
• Series of numbers • 100s or 1000s of dimensions • Dimension = interesting feature / characteristic

How do we search the vectors? Similarity search • Expensive
queries (compare to every vector) • Approximate nearest neighbor (k-ANN) • Proximity in vector space • Example: Library • Book classi fi cation - genre vs location of plot • Smaller search set = smaller retrieval time! Photo by Martin Adams on Unsplash

Pinecone: sample data Document{ id='10609bf6-b358-449e-a4d1-8a6f2a2f805d', text='As always a Page turner.
may me think about where we are in our tech evolution.', media='null', metadata={ rating=4.0, book_id=18505765, distance=0.58059925 }, score=0.41940075159072876 }

Demo! https://www.pinecone.io/learn/vector-database/

Where do vectors fall flat? How do you… • Limited
metadata / connections • Verify vector representations? • Explain how it got to answer? • Ensure highest relevance / accuracy?

Graph dbs

What is a graph? Jennifer Jacob ??

What is a graph? Company Jennifer Jacob WORKED_FOR WORKED_FOR

What is a graph? Company Jennifer Jacob School W ORKED_FOR
WORKED_FOR ATTENDED ATTENDED

What is a graph? Degree Degree Company Jennifer Jacob School
ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN

What is a graph? Adrian Degree Degree Company Jennifer Jacob
School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED

What is a graph? Person Degree Degree Company Person Person
School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED Edward Jones Jacob Jennifer SIUE Music CMIS CS Adrian

What is a graph? Answers through relationships • How many
coworkers shared classes/degrees? • What are common degree journeys? • How many alumni re- enroll for higher degrees? • Who else went to a school and works for company? Person Degree Degree Company Person Person School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED Edward Jones Jacob Jennifer SIUE Music CMIS CS Adrian

Nodes (vertices) Objects or entities • Can have labels •
May have properties Person Degree Degree Company Person Person School Degree Edward Jones Jacob Jennifer SIUE Music CMIS CS Adrian

Relationships (edges) Connect entities • Must have type (label) •
Must have direction • May have properties Person Degree Degree Company Person Person School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED Edward Jones Jacob Jennifer SIUE Music CMIS CS Adrian

Graphs add context and meaning Not just point-to-point, but HOW
they connect

Neo4j: sample data Book[ id=18505765, title=Dark Matter (Star Carrier, #5),
isbn=0062183990, isbn13=9780062183996, average_rating=3.93, authors=[ Author[name=Ian Douglas] ], reviewList=[ Review[ id=f78b825c122cc5b924957fafc7382bc1, text=As always a Page turner. may me think about where we are in our tech evolution., book_id=18505765, rating=4 ] ] ]

Vector vs Graph RAG:

How are they similar? • Store and index data for
e ffi cient retrieval • Query data with queries • Semantic searches • Retrieve metadata / related data

Index differences Vector db vs Graph db • Top unit
of vector data • Data upserted into index • Del index = del data • Search vs Semantic indexes • Use explicit proc in Cypher • Del index != del data https://glennas.wordpress.com/2011/03/13/understanding-graph-databases-marko-rodriguez/ https://docs.pinecone.io/guides/indexes/understanding-indexes

Storage practices Vector db vs Graph db • Prioritize stats
• Some metadata / connections • Avoid large data types / values • Prioritize relationships • Value = metadata / connections • Avoid large types as properties Document{ id='10609bf6-b358-449e-a4d1-8a6f2a2f805d', text='As always a Page turner. may me think about where we are in our tech evolution.', media='null', metadata={ rating=4.0, book_id=18505765, distance=0.58059925 }, score=0.41940075159072876 } Book[ id=18505765, title=Dark Matter (Star Carrier, #5), isbn=0062183990, isbn13=9780062183996, average_rating=3.93, authors=[ Author[name=Ian Douglas] ], reviewList=[ Review[ id=f78b825c122cc5b924957fafc7382bc1, text=As always a Page turner. may me think about where we are in our tech evolution., book_id=18505765, rating=4 ] ] ]

Metadata / source information Structured + Unstructured Knowledge graph Unstructured
Vectors Structured Semi-structured

Graphs = extra layer • Accuracy: • extra context /
related info in connections • Veri fi ability: • check against understandable format • Explainability: • trace path through graph for answer

Our demo Book recommendations • Vector embeddings on Review text
• Vector similarity search Reviews • Traverse graph from similar results • Less needle-in-a-haystack • More relevant!

Demo! https://thenewstack.io/graphrag-101-increasing-genai-accuracy-and-completeness/

Resources • Github repository (today’s code): github.com/JMHReif/vector-graph-rag • Docs for
Spring AI: https://docs.spring.io/spring-ai/reference/api/vectordbs.html • GraphAcademy LLM courses: graphacademy.neo4j.com/categories/llms/ • Knowledge graph ebook: https://neo4j.com/whitepapers/developers-guide-how- to-build-knowledge-graph/ Jennifer Reif [email protected] @JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

Vector and GraphRAG: Accuracy and Explainabilit...

Vector and GraphRAG: Accuracy and Explainability in GenAI Applications

More Decks by Jennifer Reif

Other Decks in Technology

Featured

Transcript