Upgrade to Pro — share decks privately, control downloads, hide ads and more …

10 Years of Open Source: Navigating the Next AI...

10 Years of Open Source: Navigating the Next AI Revolution

A lot has been happening in the field of AI and Natural Language Processing: there's endless excitement about new technologies, sobering post-hype hangovers and also uncertainty about where the field is heading next. In this talk, I'll share the most important lessons we've learned in 10 years of working on open-source software, our core philosophies that helped us adapt to an ever-changing AI landscape and why open source and interoperability still wins over black-box, proprietary APIs.

Ines Montani

August 28, 2024
Tweet

Resources

spaCy: Industrial-Strength NLP

https://spacy.io

spaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text.

Prodigy: Radically efficient machine teaching

https://prodi.gy

Prodigy is a modern annotation tool for creating training data for machine learning models. It’s so efficient that data scientists can do the annotation themselves, enabling a new level of rapid iteration.

spacy-llm: Integrating LLMs into structured NLP pipelines

https://github.com/explosion/spacy-llm

spacy-llm features a modular system for fast prototyping and prompting, and turning unstructured responses into robust outputs for various NLP tasks, no training data required.

The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs

https://speakerdeck.com/inesmontani/the-ai-revolution-will-not-be-monopolized-how-open-source-beats-economies-of-scale-even-for-llms-qcon-london

Are we heading further into a black box era with larger and larger models, obscured behind APIs controlled by big tech monopolies? I don’t think so, and in this talk, I’ll show you why.

A practical guide to human-in-the-loop distillation

https://explosion.ai/blog/human-in-the-loop-distillation

This blog post presents practical solutions for using the latest state-of-the-art models in real-world applications and distilling their knowledge into smaller and faster components that you can run and maintain in-house.

How S&P Global is making markets more transparent with NLP, spaCy and Prodigy

https://explosion.ai/blog/sp-global-commodities

A case study on S&P Global’s efficient information extraction pipelines for real-time commodities trading insights in a high-security environment using human-in-the-loop distillation.

Let Them Write Code

https://speakerdeck.com/inesmontani/let-them-write-code-keynote-pycon-india-2019

Talk about the development philosophy and mindset that motivates the design of our tools and practical tips for how to implement it in your code.

The Window-Knocking Machine Test

https://ines.io/blog/window-knocking-machine-test/

How will technology shape our world going forward? And what tools and products should we build? When imagining what the future could look like, it helps to look back in time and compare past visions to our reality today.

More Decks by Ines Montani

Other Decks in Technology

Transcript

  1. Alex Smith Developer Kim Miller Analyst GPT-4 API Modern scriptable

    annotation tool for machine learning developers prodigy.ai 900+ companies 10k+ users
  2. OUR DEVELOPMENT PHILOSOPHY OUR DEVELOPMENT PHILOSOPHY “Let T h e

    m W rite Code” spacy.fyi/ltwc Good tools help people do their work. You don’t have to do their work for them.
  3. OUR DEVELOPMENT PHILOSOPHY OUR DEVELOPMENT PHILOSOPHY “Let T h e

    m W rite Code” spacy.fyi/ltwc Good tools help people do their work. You don’t have to do their work for them. ["go", "swim"]
  4. OUR DEVELOPMENT PHILOSOPHY OUR DEVELOPMENT PHILOSOPHY “Let T h e

    m W rite Code” spacy.fyi/ltwc Good tools help people do their work. You don’t have to do their work for them. ["go", "swim"] spaCy
  5. OUR DEVELOPMENT PHILOSOPHY OUR DEVELOPMENT PHILOSOPHY “Let T h e

    m W rite Code” spacy.fyi/ltwc Good tools help people do their work. You don’t have to do their work for them. You can reinvent the wheel, but don’t try to reinvent the road. ["go", "swim"] spaCy
  6. ^ first commit to spaCy spaCy is first released spacy.io

    everyone gets excited about chat bots
  7. The Window K nocking Machine Tes t ines.io/blog/window-knocking-machine-test Are you

    designing a window-knocking machine or an alarm clock? “knocker-uppers”
  8. Hello, I ’ m Toni ’ s virtual assistant and

    I help schedule appointments. Do you have time at 1pm on Monday? No, but Tuesday would work for me. Okay, please confirm: Tuesday at 1pm? 1pm is unideal but 3pm would work. Toni doesn ’ t have availability at 3pm but I could offer a slot at 4pm or 5 : 30pm. Which time zone is this by the way? I ’ m in CET.
  9. Hello, I ’ m Toni ’ s virtual assistant and

    I help schedule appointments. Do you have time at 1pm on Monday? No, but Tuesday would work for me. Okay, please confirm: Tuesday at 1pm? 1pm is unideal but 3pm would work. Toni doesn ’ t have availability at 3pm but I could offer a slot at 4pm or 5 : 30pm. Which time zone is this by the way? I ’ m in CET. Calendly
  10. Hello, I ’ m Toni ’ s virtual assistant and

    I help schedule appointments. Do you have time at 1pm on Monday? No, but Tuesday would work for me. Okay, please confirm: Tuesday at 1pm? 1pm is unideal but 3pm would work. Toni doesn ’ t have availability at 3pm but I could offer a slot at 4pm or 5 : 30pm. Which time zone is this by the way? I ’ m in CET. Calendly “window-knocking machine” “alarm clock”
  11. ^ first commit to spaCy spaCy is first released spacy.io

    everyone gets excited about chat bots
  12. ^ first commit to spaCy spaCy is first released spacy.io

    deep learning is widely adopted everyone gets excited about chat bots
  13. Software 1.0 Software 1.0 📄 code 💾 program compiler Software

    2.0 Software 2.0 📊 data 🔮 model algorithm
  14. Software 1.0 Software 1.0 📄 code 💾 program compiler Software

    2.0 Software 2.0 📊 data 🔮 model algorithm ✅ tests 📈 evaluation
  15. Software 1.0 Software 1.0 📄 code 💾 program compiler Software

    2.0 Software 2.0 📊 data 🔮 model algorithm ✅ tests 📈 evaluation refactoring refactoring
  16. Software 1.0 Software 1.0 📄 code 💾 program compiler Software

    2.0 Software 2.0 📊 data 🔮 model algorithm ✅ tests 📈 evaluation refactoring refactoring iteration iteration
  17. language model pre-training works ^ ^ Prodigy is first released

    prodigy.ai few-shot in-context learning works ^ ^
  18. language model pre-training works ^ ^ Prodigy is first released

    prodigy.ai few-shot in-context learning works ^ ^
  19. i U se cases i n industr y generative tasks

    📖 single/multi-doc summarization 🧮 reasoning ✅ problem solving ✍ paraphrasing 🖼 style transfer ⁉ question answering predictive tasks 🔖 entity recognition 🔗 relation extraction 👫 coreference resolution 🧬 grammar & morphology 🎯 semantic parsing 💬 discourse structure 📚 text classification
  20. i U se cases i n industr y generative tasks

    📖 single/multi-doc summarization 🧮 reasoning ✅ problem solving ✍ paraphrasing 🖼 style transfer ⁉ question answering predictive tasks 🔖 entity recognition 🔗 relation extraction 👫 coreference resolution 🧬 grammar & morphology 🎯 semantic parsing 💬 discourse structure 📚 text classification structured data many industry problems have remained the same, they just changed in scale
  21. human-facing systems machine-facing models ChatGPT GPT-4 most important di erentiation

    is product, not just technology A I products are m ore t h an jus t a model
  22. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization most important di erentiation is product, not just technology A I products are m ore t h an jus t a model
  23. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization most important di erentiation is product, not just technology swappable components based on research, impacts are quantifiable A I products are m ore t h an jus t a model
  24. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization speed accuracy latency cost most important di erentiation is product, not just technology swappable components based on research, impacts are quantifiable A I products are m ore t h an jus t a model
  25. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization speed accuracy latency cost But what about the data? most important di erentiation is product, not just technology swappable components based on research, impacts are quantifiable A I products are m ore t h an jus t a model
  26. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization speed accuracy latency cost But what about the data? User data is an advantage for product, not the foundation for machine-facing tasks. most important di erentiation is product, not just technology swappable components based on research, impacts are quantifiable A I products are m ore t h an jus t a model
  27. human-facing systems machine-facing models ChatGPT GPT-4 UI / UX marketing

    customization speed accuracy latency cost But what about the data? User data is an advantage for product, not the foundation for machine-facing tasks. You don’t need specific data to gain general knowledge. most important di erentiation is product, not just technology swappable components based on research, impacts are quantifiable A I products are m ore t h an jus t a model
  28. task-specific output 💬 prompt 📖 text LLM prompt model &

    transform output to structured data spacy.io/usage/large-language-models spac y -llm
  29. task-specific output 💬 prompt 📖 text LLM prompt model &

    transform output to structured data spacy.io/usage/large-language-models spac y -llm config.cfg Structured Data {} LLM Text
  30. task-specific output 💬 prompt 📖 text LLM prompt model &

    transform output to structured data spacy.io/usage/large-language-models unified, model-agnostic API spac y -llm config.cfg Structured Data {} LLM Text
  31. task-specific output 💬 prompt 📖 text LLM prompt model &

    transform output to structured data spacy.io/usage/large-language-models unified, model-agnostic API spac y -llm config.cfg Structured Data {} LLM Text entity recognition entity linking text classification relation extraction and more…
  32. spacy-llm is first released github.com/explosion/spacy-llm spaCy v3 is first released

    in-context learning gains traction LLMs and Generative AI fully hit the mainstream ChatGPT ⏺ ⏺ ⏺
  33. E cono m ies of scale of scale output costs

    OpenAI Google access to talent, compute etc.
  34. E cono m ies of scale of scale output costs

    OpenAI Google access to talent, compute etc. API request batching
  35. E cono m ies of scale of scale output costs

    OpenAI Google high tra ff ic 💧 💧 💧 💧 💧 💧 💧 💧 low tra ff ic batch 💧 💧 💧 💧 💧 💧 💧 💧 … access to talent, compute etc. API request batching
  36. E cono m ies of scale of scale output costs

    OpenAI Google you 🤠 high tra ff ic 💧 💧 💧 💧 💧 💧 💧 💧 low tra ff ic batch 💧 💧 💧 💧 💧 💧 💧 💧 … access to talent, compute etc. API request batching
  37. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  38. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes • high-security environment 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  39. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  40. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  41. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop • 8 market pipelines in production 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  42. 99% 99% Case Stud y : S&P Global • real-time

    commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop • 8 market pipelines in production 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities
  43. What ’ s the total services revenue from 2023? $2,923,531

    How many clients is that in total? 29 ⏺ ⏺ ⏺ ? ines.io/blog/window-knocking-machine-test
  44. What ’ s the total services revenue from 2023? $2,923,531

    How many clients is that in total? 29 ⏺ ⏺ ⏺ 🔮 LLM 📚 database 🤖 agents ⚙ query Retrieval-Augmented Generation ? ines.io/blog/window-knocking-machine-test
  45. 2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

    Ltd. Python AG 432,032 82,000 1,500 193,000 91,320 $ 2,625,032 Clients (28) Revenue What ’ s the total services revenue from 2023? $2,923,531 How many clients is that in total? 29 ⏺ ⏺ ⏺ 🔮 LLM 📚 database 🤖 agents ⚙ query Retrieval-Augmented Generation ? ines.io/blog/window-knocking-machine-test
  46. 2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

    Ltd. Python AG 432,032 82,000 1,500 193,000 91,320 $ 2,625,032 Clients (28) Revenue A I still needs produc t decisions! Kim Miller Analyst What ’ s the total services revenue from 2023? $2,923,531 How many clients is that in total? 29 ⏺ ⏺ ⏺ 🔮 LLM 📚 database 🤖 agents ⚙ query Retrieval-Augmented Generation ? ines.io/blog/window-knocking-machine-test
  47. Explosion goes back to independent-minded and self-su ff icient explosion.ai/blog/

    back-to-our-roots human-in-the-loop distillation is promising prodigy.fyi/distillation everyone is excited about chat bots again
  48. Explosion goes back to independent-minded and self-su ff icient explosion.ai/blog/

    back-to-our-roots human-in-the-loop distillation is promising prodigy.fyi/distillation everyone is excited about chat bots again What’s next?
  49. Cycle A doptio n rules and conditional logic linear models

    applied workflow applied workflow combine new techniques with established workflows
  50. Cycle A doptio n rules and conditional logic deep learning

    linear models applied workflow applied workflow combine new techniques with established workflows
  51. Cycle A doptio n rules and conditional logic deep learning

    linear models chat bots applied workflow applied workflow combine new techniques with established workflows
  52. Cycle A doptio n rules and conditional logic deep learning

    linear models chat bots applied workflow applied workflow applied workflow combine new techniques with established workflows
  53. Cycle A doptio n rules and conditional logic deep learning

    transfer learning linear models chat bots applied workflow applied workflow applied workflow combine new techniques with established workflows
  54. Cycle A doptio n rules and conditional logic deep learning

    transfer learning linear models chat bots trans- formers applied workflow applied workflow applied workflow combine new techniques with established workflows
  55. Cycle A doptio n rules and conditional logic deep learning

    transfer learning linear models chat bots trans- formers applied workflow applied workflow applied workflow applied workflow combine new techniques with established workflows
  56. Cycle A doptio n rules and conditional logic deep learning

    transfer learning in-context learning linear models chat bots trans- formers applied workflow applied workflow applied workflow applied workflow combine new techniques with established workflows
  57. Cycle A doptio n rules and conditional logic deep learning

    transfer learning in-context learning linear models chat bots LLMs and GenAI trans- formers applied workflow applied workflow applied workflow applied workflow combine new techniques with established workflows
  58. Cycle A doptio n rules and conditional logic deep learning

    transfer learning in-context learning linear models chat bots LLMs and GenAI trans- formers applied workflow applied workflow applied workflow applied workflow applied workflow combine new techniques with established workflows
  59. Summar y NAVIGATING AI & NLP NAVIGATING AI & NLP

    Think beyond chat bots or human-shaped tasks. You don’t want to build a “window-knocking machine”.
  60. Summar y NAVIGATING AI & NLP NAVIGATING AI & NLP

    Think beyond chat bots or human-shaped tasks. You don’t want to build a “window-knocking machine”. Structured Data {} Focus on your application. Consider what it really needs and let your data guide you.
  61. Summar y NAVIGATING AI & NLP NAVIGATING AI & NLP

    Think beyond chat bots or human-shaped tasks. You don’t want to build a “window-knocking machine”. Stay ambitious. Don’t compromise on best practices, e iciency and privacy. Structured Data {} Focus on your application. Consider what it really needs and let your data guide you.
  62. Summar y NAVIGATING AI & NLP NAVIGATING AI & NLP

    Think beyond chat bots or human-shaped tasks. You don’t want to build a “window-knocking machine”. Stay ambitious. Don’t compromise on best practices, e iciency and privacy. LLM Keep filling up your toolbox. Know the techniques you have available and apply the best ones to get the job done. Structured Data {} Focus on your application. Consider what it really needs and let your data guide you.