Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
網路爬蟲與文字探勘工作坊
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
tlyu0419
November 15, 2021
Technology
550
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
網路爬蟲與文字探勘工作坊
tlyu0419
November 15, 2021
More Decks by tlyu0419
See All by tlyu0419
網路爬蟲與文字探勘 證券公司 App 評論分析的資料科學旅程
tlyu0419
0
130
網頁爬蟲技術於人力資源管理的應用
tlyu0419
0
360
Topic Modeling with Python: What do Customers Care about Digital Banking Apps?
tlyu0419
0
230
資料血緣: 營運機器/深度學習模型的秘密武器
tlyu0419
0
380
Mastering Feature Engineering: Mining the Hidden Salary Formula with CakeResume
tlyu0419
0
400
Spark_Task_Optimization_Journey_How_I_Increased_10x_Speed_by_Performance_Tuning
tlyu0419
0
400
Why we want to become PyCon TW volunteers
tlyu0419
0
230
Regular expression in Python - From zero to hero
tlyu0419
0
310
資料視覺化工作坊
tlyu0419
0
280
Other Decks in Technology
See All in Technology
SONiCの統計情報を取得したい
sonic
0
220
脆弱性対応、どこで線を引くか
rymiyamoto
1
410
IaC コードを資産へ:AWS CDK 社内ライブラリと横断展開 / aws-summit-japan-2026
gotok365
2
800
【Cyber-sec+】経営層を"動かす"ための考え方
hssh2_bin
0
190
生成 AI 実践ガイド (概略版) AIガバナンス編
asei
0
110
Kubernetesにおける学習基盤とLLMOpsの概要
ry
1
320
フィジカル版Github Onshapeの紹介
shiba_8ro
0
290
自分が詳しくない領域でAIを使う #プロヒス2026
konifar
11
3.4k
不要なレビューをAIにまかせて AIコーディングの環境改善を加速した
shoota
1
220
FPGAの開発コンペでZephyrを使ってみた
iotengineer22
0
120
【2026年版】 ベクトル検索とEmbedding最前線
mocobeta
14
3.8k
Oracle Cloud Infrastructure:2026年6月度サービス・アップデート
oracle4engineer
PRO
0
100
Featured
See All Featured
Making the Leap to Tech Lead
cromwellryan
135
9.9k
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
410
SEO for Brand Visibility & Recognition
aleyda
0
4.6k
Java REST API Framework Comparison - PWX 2021
mraible
34
9.4k
How to train your dragon (web standard)
notwaldorf
97
6.7k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
Jamie Indigo - Trashchat’s Guide to Black Boxes: Technical SEO Tactics for LLMs
techseoconnect
PRO
0
170
Deep Space Network (abreviated)
tonyrice
0
210
Google's AI Overviews - The New Search
badams
0
1k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.9k
Thoughts on Productivity
jonyablonski
76
5.2k
Unsuck your backbone
ammeep
672
58k
Transcript
None
None
None
None
None
None
None
None
None
• • • • •
• • • • • •
• • • •
None
• • • • • • • • •
None
None
Ans:
◼
◼
None
None
None
None
◼ ◼ ◼ ◼
None
None
None
• ➢ ➢ ➢ ➢ ➢
None
CONTENTS
None
None
None
None
Ref: LDA - How to grid search best topic models?
None
None
None
None
None
None
None
None
SOURCE_NAME SOURCE TARGET_NAME TARGET TIME TEXT Linda**** 1795**** **** 1000****
2020-01-01 19:53 **** 1000**** Tsai Ing-wen 4625**** 2019-11-19 15:24 ... **** 1000**** Tsai Ing-wen 4625**** 2019-11-13 20:37 Hsu**** 1000**** Tsai Ing-wen 4625**** 2019-11-19 18:59 Ingwen**** 1000**** Tsai Ing-wen 4625**** 2019-11-30 05:31 ... Faithé**** 1000**** Faithé**** 1000**** 2020-01-01 22:00 ... ... ... ... ... ...
None
None
None
• • •
None
None
• •
None
None
Study: Twitter Sentiment Mirrored Facebook’s Stock Price Today
CONTENTS
None
• •
None
None
None
None
None
None
• •
•
•
• •
• • •
• • • • • • •
• • • • • • •
None
None
None
• • • • • •
None
None
None
None
None
None
None
None
None
None
None
None
None
• •
• • • • • •
None
None
None
None
• • • • • • • • • •
None
None
None
None
None
None
None
None
(?
! ? ( 0.11 -> 0.28)
None
• • • • • • • • • •
What’s the next?
CONTENTS
None
•
None
None
None
None
• • • • • • • • • •
• • >“<
None
None
•
None
None
None
• >”<
None
None
None
None
沒錢、沒人的話可以「借」別人的模型 借完再拿來當目標變數 Train 自己的模型XD 解釋力最強,但需要花時間溝通>”< 小心別用壞別人的網站 有點吃翻譯的效度XD
None
0
What’s the next?
None
None
None
• • • • • • • • • •
• • >“<
None
None
None
/ ( ) ( )
None
None
None
CONTENTS
None
None
/
• • • • • • • • • •
• • • • •
None