レンズの下のLLM / LLM under the Lens

December 30, 2023

180

レンズの下のLLM / LLM under the Lens

Henry Cui

December 30, 2023

Tweet

More Decks by Henry Cui

See All by Henry Cui

プロダクション言語モデルの情報を盗む攻撃 / Stealing Part of a Production Language Model

1

200

Direct Preference Optimization

0

370

Diffusion Model with Perceptual Loss

0

390

Go with the Prompt Flow

0

160

0

210

ことのはの力で画像の異常検知 / Anomaly Detection by Language

0

560

驚愕の事実！LangChainが抱える問題 / Problems of LangChain

0

240

MLOps初心者がMLflowを触る / MLflow Brief Introduction

0

120

{{guidance}}のガイダンス / Guidance of guidance

0

160

Other Decks in Programming

See All in Programming

Quand Symfony, ApiPlatform, OpenAI et LangChain s'allient pour exploiter vos PDF : de la théorie à la production…

0

220

RailsGirls IZUMO スポンサーLT

0

200

はじめてのWeb API体験ー飲食店検索アプリを作ろうー

0

140

レベル1の開発生産性向上に取り組む − 日々の作業の効率化・自動化を通じた改善活動

0

300

Modern Angular with Signals and Signal Store:New Rules for Your Architecture @enterJS Advanced Angular Day 2025

0

270

SQLアンチパターン第2版データベースプログラミングで陥りがちな失敗とその対策 / Intro to SQL Antipatterns 2nd

11

1.3k

AI時代の『改訂新版良いコード／悪いコードで学ぶ設計入門』 / ai-good-code-bad-code

23

9.6k

A full stack side project webapp all in Kotlin (KotlinConf 2025)

0

150

Vibe Codingの幻想を超えて-生成AIを現場で使えるようにするまでの泥臭い話.ai

9

3.8k

TypeScriptでDXを上げろ！ Hono編

3

770

スタートアップの急成長を支えるプラットフォームエンジニアリングと組織戦略

1

7.3k

CDK引数設計道場100本ノック

2

480

Featured

See All Featured

Embracing the Ebb and Flow

86

4.8k

Unsuck your backbone

671

58k

Exploring the Power of Turbo Streams & Action Cable | RailsConf2023

34

5.9k

個人開発の失敗を避けるイケてる考え方 / tips for indie hackers

108

19k

30

14k

The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024

26

2.9k

Being A Developer After 40

90

590k

Mobile First: as difficult as doing things right

223

9.7k

Imperfection Machines: The Place of Print at Facebook

267

13k

Automating Front-end Workflow

1370

200k

Faster Mobile Websites

308

31k

How to Create Impact in a Changing Tech Landscape [PerfNow 2023]

53

2.9k

Transcript

レンズの下のLLM 機械学習の社会実装勉強会第30回 Henry 2023/12/30
LLM開発 ▪ LLM開発に必要な機能 • プロンプトエンジニアリングの繰り返し • 実験管理・性能評価・結果比較 ▪ これらの機能を達成する急成長のレポジトリ trulens
2
TruLens-Eval ▪ カルフォルニアにある会社TruEraのプロダクト • MLのMonitor. Debug. Test.にフォーカス ▪ TruLens-EvalはLLMの実験管理のために開発された ▪
TruLens-Explainは深層モデルの解釈性のために開発された ▪ 今日はTruLens-Evalに入門 3
TruLens-Evalを使う ▪ ライブラリインストール pip install trulens-eval==0.19.0 • 最新版の0.20.0では不明のエラーでimportできなかった ▪ シンプルなllm_app
• 2つの引数はサポートされてない ▪ Feedbackのカスタマイズ ▪ trulens-evalコマンドでstreamlitを開く • streamlit-javascriptが必要 4
まとめ ▪ TruLens-Evalの基本的な使い方 ▪ 余力ある方はLangChainなどとの組み合わせ 5