Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Ivory - Data Modelling
Search
Ambiata
October 20, 2014
Technology
520
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Ivory - Data Modelling
Ambiata
October 20, 2014
More Decks by Ambiata
See All by Ambiata
Improving feature engineering in the lab and production with Ivory
ambiata
3
680
Ivory - A Data Store for Data Science
ambiata
1
740
Ivory - Concepts
ambiata
0
920
Ivory - An Introduction
ambiata
1
1.4k
Other Decks in Technology
See All in Technology
脆弱性対応、どこで線を引くか
rymiyamoto
1
400
あなたの知らないPDFのアクセシビリティ
lycorptech_jp
PRO
0
200
200個のGitHubリポジトリを横断調査したかった
icck
0
130
新しいUbuntu/GNOMEが使いたいからXからWaylandへ移行頑張ってるの巻 2026-06-20
nobutomurata
0
130
RAG を使わないという選択肢
tatsutaka
1
250
白金鉱業Meetup_Vol.24_「AIエージェントは分けるほど良い」は本当か? / Is it true that “the more you divide AI agents, the better”?
brainpadpr
1
400
Claude Codeとのおしゃべりでセマンティックモデルの定義からダッシュボード作成まで完成させる
nic_sugiyama
0
120
LayerXにおけるセキュリティ管理の現在地と次の一手
tosho
0
220
Snowflakeと仲良くなる第一歩
coco_se
4
490
Socrates × Looker 〜セマンティックレイヤーで進化するデータ分析エージェント〜
hanon52_
3
2.4k
【NRUG vol.18】なぜ多くのオブザーバビリティ導入は失敗するのか
nrug_member
0
170
自宅LLMの話
jacopen
1
600
Featured
See All Featured
Chasing Engaging Ingredients in Design
codingconduct
0
220
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
730
So, you think you're a good person
axbom
PRO
2
2.1k
SEOcharity - Dark patterns in SEO and UX: How to avoid them and build a more ethical web
sarafernandez
0
200
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.4k
The #1 spot is gone: here's how to win anyway
tamaranovitovic
2
1.1k
Odyssey Design
rkendrick25
PRO
2
700
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.7k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.9k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.2k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
Designing Powerful Visuals for Engaging Learning
tmiket
1
410
Transcript
IVORY DATA MODELLING http://github.com/ambiata/ivory © Ambiata 2014
WHAT WE START WITH © Ambiata 2014
© Ambiata 2014
WHAT WE NEED © Ambiata 2014
Feature vectors © Ambiata 2014 0.00 3 3001 1.00 634.83
16 4670 0.6875 15.12 2 - 0.50 33.56 2 - 1.00 98.34 12 3303 0.8333 523.81 23 2046 0.4782 1086.05 17 - 1.00 224.81 9 - 0.2222 78.21 2 2134 0.50 126.48 4 - 0.0 1 3 1 1 4 1 2 1 1 1 M - F M F - F F M - gender balance purchases zipcode prop_online num_accs 89340218 feature instance 48149407 18452274 07499337 62948721 93754723 00272446 13374497 31989993 46474236
Ivory Repository Ingest facts Extract features © Ambiata 2014
© Ambiata 2014 Fact ETL Source data Entity resolution +
attribution Factset Ivory Repository Ingest facts Extract features
WHAT’S A FACT? © Ambiata 2014
WHAT’S A FEATURE? © Ambiata 2014
FACT • Atomic piece of information attributed to an entity
• 2 types: states and events • Captured as close to the “source” as possible © Ambiata 2014
• State facts • Demographics, e.g.: gender, DOB, zipcode, etc
• Account statuses • Subscription states • Snapshots, e.g. account balance at end of month • Segments © Ambiata 2014
• Event facts • Purchases • Page views • Phone
calls • Queries © Ambiata 2014
FEATURE • Attribute that describes one aspect of an entity
• Derived from facts • Simplest feature is “latest value before ‘date’” © Ambiata 2014
• Latest • Days since latest, days since earliest •
Count, sum • Mean, quantile, proportion • Gradient, state changes © Ambiata 2014