Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[ACL 2026 Demo] Fast-MIA: Efficient and Scalabl...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Shotaro Ishihara
May 12, 2026
Research
37
0
Share
[ACL 2026 Demo] Fast-MIA: Efficient and Scalable Membership Inference for LLMs
https://arxiv.org/abs/2510.23074
https://github.com/Nikkei/fast-mia
Shotaro Ishihara
May 12, 2026
More Decks by Shotaro Ishihara
See All by Shotaro Ishihara
Fast-MIA: Efficient and Scalable Membership Inference for LLMs
upura
0
27
JAPAN AI CUP Prediction Tutorial
upura
2
1.1k
情報技術の社会実装に向けた応用と課題:ニュースメディアの事例から / appmech-jsce 2025
upura
0
380
日本語新聞記事を用いた大規模言語モデルの暗記定量化 / LLMC2025
upura
0
670
Quantifying Memorization in Continual Pre-training with Japanese General or Industry-Specific Corpora
upura
1
110
JOAI2025講評 / joai2025-review
upura
0
1.6k
AI エージェントを活用した研究再現性の自動定量評価 / scisci2025
upura
1
250
JSAI2025 企画セッション「人工知能とコンペティション」/ jsai2025-competition
upura
0
130
生成的推薦の人気バイアスの分析:暗記の観点から / JSAI2025
upura
0
400
Other Decks in Research
See All in Research
IEEE AIxVR 2026 Keynote Talk: "Beyond Visibility: Understanding Scenes and Humans under Challenging Conditions with Diverse Sensing"
miso2024
0
190
Harness Engineering and Al Agent
kzinmr
3
1.5k
データセンター事業者を取り巻く近年の状況とその中での研究開発動向、テストベッドへの貢献の可能性
kikuzo
1
130
機械学習で作った ポケモン対戦bot で 遊ぼう!
fufufukakaka
0
220
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.5k
ローテーション別のサイドアウト戦略 ~なぜあのローテは回らないのか?~
vball_panda
0
330
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
200
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
530
Aurora Serverless からAurora Serverless v2への課題と知見を論文から読み解く/Understanding the challenges and insights of moving from Aurora Serverless to Aurora Serverless v2 from a paper
bootjp
6
1.7k
さくらインターネット研究所テックトーク2026春、研究開発Gr.25年度成果26年度方針
kikuzo
0
140
羽田新ルート運用6年の検証
1manken
0
150
Research Engineerという仕事 / Research Engineering: Bridging Research and Business
chck
1
150
Featured
See All Featured
VelocityConf: Rendering Performance Case Studies
addyosmani
333
25k
Optimising Largest Contentful Paint
csswizardry
37
3.7k
Building a Modern Day E-commerce SEO Strategy
aleyda
45
9.1k
Building Applications with DynamoDB
mza
96
7k
Making Projects Easy
brettharned
120
6.6k
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
390
Facilitating Awesome Meetings
lara
57
6.9k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.9k
The Anti-SEO Checklist Checklist. Pubcon Cyber Week
ryanjones
0
140
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
1.1k
Information Architects: The Missing Link in Design Systems
soysaucechin
0
940
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.3k
Transcript
Hiromu Takahashi and Shotaro Ishihara ACL 2026 System Demonstrations Fast-MIA:
Efficient and Scalable Membership Inference for LLMs
uv run --with vllm python main.py \ --config config/sample.yaml 1.
High-throughput batch inference using vLLM (about 5 times faster individually) 2. Cross-method caching architecture (Reduce the total processing time for benchmarking multiple methods) https://github.com/Nikkei/fast-mia Fast-MIA: Efficient and Scalable 2 LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA ……
Membership Inference Attack (MIA) on LLMs 3 LLM Is this
text included? Text Pre-training Data • Calculate the log-likelihood, etc. • Various methods have been proposed.
Challenges in MIA on LLMs 4 LLM Is this text
included? Text Pre-training Data • Calculate the log-likelihood, etc. • Various methods have been proposed. 1. Growing computational demands for individual MIA methods. 2. Redundant computation across methods for benchmarking.
We introduce Fast-MIA 5 1. Growing computational demands for individual
MIA methods. 2. Redundant computation across methods for benchmarking. LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA …… 1. High-throughput batch inference using vLLM. 2. Cross-method caching architecture.
uv run --with vllm python main.py \ --config config/sample.yaml How
to Use: https://github.com/Nikkei/fast-mia 6 model: model_id: "huggyllama/llama-30b" data: data_path: "swj0419/WikiMIA" format: "huggingface" text_length: 32 methods: - type: "loss"
AUC Reproducibility and Speed 7 Left: Fast-MIA Right: Transformers-based implementations
Inference time (the number of inferences) The cache is working
8
uv run --with vllm python main.py \ --config config/sample.yaml 1.
High-throughput batch inference using vLLM (about 5 times faster individually) 2. Cross-method caching architecture (Reduce the total processing time for benchmarking multiple methods) https://github.com/Nikkei/fast-mia Contributions Welcome 9 LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA ……