Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Adversarial Filters of Dataset Biases
Search
Scatter Lab Inc.
September 04, 2020
Research
2.3k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Adversarial Filters of Dataset Biases
Scatter Lab Inc.
September 04, 2020
More Decks by Scatter Lab Inc.
See All by Scatter Lab Inc.
zeta introduction
scatterlab
0
1.9k
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
scatterlab
0
4.4k
Sparse, Dense, and Attentional Representations for Text Retrieval
scatterlab
0
2.3k
Weight Poisoning Attacks on Pre-trained Models
scatterlab
0
2.2k
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
scatterlab
0
2.5k
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
scatterlab
0
2.3k
Open-Retrieval Conversational Question Answering
scatterlab
0
2.3k
What Can Neural Networks Reason About?
scatterlab
0
2.3k
Exploring the Limits of Transfer Learning with Unified Text-to-Text Transformer
scatterlab
0
2.3k
Other Decks in Research
See All in Research
(SIGQS17) Frasco-VS:フラグメントに基づく薬剤候補化合物選抜の量子アニーリングによる実現
keisukeyanagisawa
PRO
0
110
SoftMatcha 2: 1兆語規模コーパスの超高速かつ柔らかい検索
e869120_sub
6
3.5k
英語教育 “研究” のあり方:学術知とアウトリーチの緊張関係
terasawat
1
990
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
shunk031
4
1k
明日から使える!研究効率化ツール入門
matsui_528
13
7.3k
適応的スパムフィルタのための軽量な類似メッセージカウンタ / jsai2026-adaptive-spam-filter
monochromegane
0
3.6k
CVPR2026論文紹介_VLMにとって良いvision encoderとは何か?Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
kobayashi31
1
110
Research Engineerという仕事 / Research Engineering: Bridging Research and Business
chck
1
210
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
1.9k
さくらインターネット研究所テックトーク2026春、研究開発Gr.25年度成果26年度方針
kikuzo
0
150
AY 2026 Guide to Academic Writing Using Generative AI - Workshop
ks91
PRO
0
120
IEEE AIxVR 2026 Keynote Talk: "Beyond Visibility: Understanding Scenes and Humans under Challenging Conditions with Diverse Sensing"
miso2024
0
200
Featured
See All Featured
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
210
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
620
Fashionably flexible responsive web design (full day workshop)
malarkey
408
66k
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
170
First, design no harm
axbom
PRO
2
1.2k
The Invisible Side of Design
smashingmag
302
52k
Automating Front-end Workflow
addyosmani
1370
210k
Deep Space Network (abreviated)
tonyrice
0
170
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
610
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
62
44k
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
1
250
Making Projects Easy
brettharned
120
6.7k
Transcript
Adversarial Filters of Dataset Biases ࢿࠁ (ML Research Scientist, Pingpong)
ݾର ݾର 1. োҳ ߓ҃ 2. AFLite 1. द: WinoGrande
ؘఠࣇ 2. ੌ߈ചػ ঌҊ્ܻ 3. प 1. Synthetic Data 2. NLP 3. Vision
োҳ ߓ҃ োҳ ߓ҃
‘߮݃ ؘఠࣇীࢲ ֫ ࢿמਸ ׳ࢿ೮Ҋ ೧ ޙઁܳ ೧Ѿ೮Ҋ ݈ೡ ࣻ
ਸө?’ • In-distribution పझࣇীࢲח ੜೞ݅ Out-of-distribution adversarial sampleীח ডೠ അ࢚ • Input-Output рী ب ঋ Spurious correlation ࢤ҂ӝ ٸޙ • ܳ ೧Ѿೠ ؘఠࣇਸ ٜ݅যঠ दझమਸ ઁ۽ ಣоೡ ࣻ োҳ ߓ҃ High Performance = Problem Solved?
োҳо domain-specificೠ spurious ಁఢਸ ࠙ܨ ߂ ೞҊ ܳ ઁѢೞח
ߑध • োҳ domain-specificೠ धҗ ҙী ઓ • ঌҊ્ܻ ࢸ҅о Ҋ۰ೞ ޅೠ biasח ழߡ ࠛо োҳ ߓ҃ Previous Approaches
AFLite AFLite
• ޙীࢲ ݺࢎо оܻఃח ࢚ਸ ݏח ޙઁ • SOTA ഛب
ড 90% → ݽ؛ Spurious correlationਸ ਊೞח ѱ ইקө? • (3), (4)ח ߃ հ݈ җ ҙ۲ ਸ ഛܫ ֫ই Word association݅ਵ۽ ޙઁܳ ಽ ࣻ AFLite Winograd Schema Challenge (WSC)
• ࢎۈ ؘఠࣇਸ ٜ݅ݶ ۠ Annotation artifactী ೠ Biasܳ
ೖೞӝ য۰ • AFLite۽ ఠ݂ೠ WinoGrande ؘఠࣇ ݽ؛ ഛبب ծҊ ܲ ߮݃۽ Transferب ੜؽ AFLite WinoGrande Dataset
1. ؘఠ ੌࠗ݅ਵ۽ RoBERTa fine-tune 2. Splitਸ ׳ܻ ೞݶࢲ RoBERTa
feature۽ linear classifier ण 3. Split పझࣇীࢲ ߬٬݅ਵ۽ ਸ औѱ ਸ ࣻ ח పझ ೞҊ ੋझఢझ߹۽ ঔ࢚࠶ ࣇী ୶о 4. ৈ۞ linear classifierо ਸ ݏ൦ ࠺ਯ Thresholdܳ ֈח Ѫ Top-kѐܳ ୭ઙ ؘఠࣇীࢲ ઁ৻ 5. ઁ৻غח ѐࣻо kѐо উ غѢա ਗೞח ӝ ؘఠࣇ ؼ ٸ ө 2~4 ߈ࠂ AFLite AFLite in WinoGrande
• ױয ӓࢿ݅ਵ۽ ಽ ࣻ ח ޙઁܳ Ѧ۞ն • ח
ష ۨ߰ Biasۄӝࠁח ҳઑੋ Ѫ۽ lexical-level heuristicਵ۽ח Ѧ۞ղӝ ൨ٝ AFLite Filtered Examples
• AFLiteܳ ৈ۞ بݫੋਵ۽ ഛೞҊ model-agnosticೞѱ ੌ߈ച • Contributions: 1.
࢚݅ intractableೠ AFOptܳ AFLite۽ Ӕࢎೡ ࣻ ਸ ࠁੋ. (Skip) 2. Vision, NLP ࠙ঠ ৈ۞ ؘఠࣇীࢲ प೧ AFLite ਬബࢿਸ ّ߉ஜೠ. 3. Biasܳ হঙ ؘఠࣇਵ۽ णೠ ݽ؛ ੌ߈ചо ੜؽਸ पਵ۽ ࠁੋ. 4. AFLite۽ ఠ݂ೞݶ ؊ بੋ ߮݃ ؘఠࣇਸ ٜ݅ ࣻ ਸ ࠁੋ. AFLite Adversarial Filters of Dataset Biases
: any feature extractor : a family of classification models
Φ M AFLite AFLite (Generalized)
Experiments Experiments
Biasing Dataset • Class-specificೠ ੋҕ featureܳ ؘఠ 75%ী ੑ, աݠח
random feature ੑ • Biased sample ੌࠗח ۨ࠶ ߄Է Results • Linear classifier۽ب ֫ ࢿמ ׳ࢿ • AFLiteܳ ਊೞݶ ࢚धੋ ࢿמਵ۽ جই১ Experiments Synthetic Data
• प ࢚: SNLI annotation artifactܳ ೖೠ out-of-distribution ؘఠࣇ 3ઙ
• Non-entailment ޙઁ ਬഋ߹۽ Zero-shot పझ Experiments NLP: Out-of-distribution Generalization
AFLite۽ ఠ݂ೠ ؘఠࣇ ݽٚ ݽ؛ীࢲ ࢿמ ѱ ڄয Experiments In-distribution
Benchmark Re-estimation: SNLI
Experiments In-distribution Benchmark Re-estimation: MultiNLI & QNLI
• : ImageNet ؘఠࣇ 20%۽ णೠ EfficientNet-B7 feature • ImageNet-A۽
ಣоೞפ AFLite-filtered ؘఠࣇਵ۽ ण೮ਸ ٸ ࢿמ ؊ જ Φ Experiments Vision: Adversarial Image Classification
ImageNet dev setਸ ఠ݂ೞҊ ಣо೮ਸ ٸ ࢿמ ೞۅ ؊ ఀ
Experiments In-distribution Image Classification
ӝઓীب ࠁҊػ ౠ ನૉী ೠ Bias, ݽনࠁ х݅ਵ۽ ҳ࠙ೞח ޙઁ
١җ Ѿਸ эೣ Experiments Filtered Examples
• Adversarial Filtering SWAG: A Large-Scale Adversarial Dataset for Grounded
Commonsense Inference [EMNLP’18] HellaSwag: Can a Machine Really Finish Your Sentence? [ACL’19] • AFLite WinoGrande: An Adversarial Winograd Schema Challenge at Scale [arXiv’19] Adversarial Filters of Dataset Biases [ICML’20] References References
хࢎפ✌ ୶о ޙ ژח ҾӘೠ ݶ ઁٚ ইې োۅ۽
োۅ ࣁਃ! ࢿࠁ (ML Research Scientist, Pingpong)
[email protected]