各年で100回以上出現した動詞、形容詞、形状詞 n 前処理 n 補助記号と20単語未満の⽂を除外、uniq で重複を取り除く n ⽂脈窓の⼤きさ n 前後10単語 n 次元圧縮⼿法 n SVD、ICA 11 狭義の意味変化する単語を分析する⽬的で設定 名詞のような急激な変化よりも緩やかな変化を獲得したい ICA は歪度の絶対値が⼤きい順に並び替え [Yamagiwa+, 2023]
in the Semantic Orientation of Words. LREC 2010. n [Lazaridou+, 2021] n Mind the Gap: Assessing Temporal Generalization in Neural Language Models. NeurIPS 2021. n [Su+, 2022] n Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change. EMNLP 2022. n [Hamilton+, 2016] n Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change. ACL 2016. 28
Contextualised Word Representations. ACL 2020. n [Mikolov+, 2014] n Distributed Representations of Words and Phrases and their Compositionality. NeurIPS 2013. n [Kim+, 2014] n Temporal Analysis of Language through Neural Language Models. ACL 2014 workshop. n [Levy and Goldberg, 2014] n Neural Word Embedding as Implicit Matrix Factorization. NeurIPS 2014. 29
2. 2023. n [Yamagiwa+, 2023] n Discovering Universal Geometry in Embeddings with ICA. EMNLP2023. n [松井秀俊, 2020] n 関数データ解析の概要とその⽅法. Speaker Deck, 2020. 30