of words. 1 token ~= 4 chars in English 1 token ~= ¾ words 100 tokens ~= 75 words Or 1-2 sentence ~= 30 tokens 1 paragraph ~= 100 tokens 1,500 words ~= 2048 tokens
like a library, while fine-tuned models are specialized experts, trained on specific tasks for higher accuracy. The ability to “ask questions” is something not present in a foundational model. Lama2 by Meta has released both foundational and fine tuned models. https://medium.com/mantisnlp/supervised-fine-tuning-customizing-llms-a2c1edbf22c3
to simplify the creation of applications using large language models (LLMs). It provides a standard interface for chains, lots of integrations with other tools, and end-to-end chains for common applications.
analysis task using a few-shot prompt. Text: (lawrence bounces) all over the stage, dancing, running, sweating, mopping his face and generally displaying the wacky talent that brought him fame in the first place. Sentiment: positive Text: despite all evidence to the contrary, this clunker has somehow managed to pose as an actual feature movie, the kind that charges full admission and gets hyped on tv and purports to amuse small children and ostensible adults. Sentiment: negative Text: for the first time in years, de niro digs deep emotionally, perhaps because he's been stirred by the powerful work of his co-stars. Sentiment: positive Text: i'll bet the video game is a lot more fun than the film. Sentiment: Few-shot Prompt
context, we can try giving the instruction directly. Please label the sentiment towards the movie of the given movie review. The sentiment label should be "positive" or "negative". Text: i'll bet the video game is a lot more fun than the film. Sentiment: Describe what is quantum physics to a 6-year-old. Instruction Prompting
time into PineconeDB. However in our case we had several thousand vectors generated for our content so we opted for their upsert API which allows for 100 vectors at a time.
a quiz in json. You are a product marketer targeting a Gen Z audience. Create exciting and fresh advertising copy for products and their simple description. Keep copy under a few sentences long. Let’s build a quiz app.
Safety Settings. • Top K • Top P • Stop Sequence https://www.marktechpost.com/2023/09/14/meet-next-gpt-an-end-to-end-general-purpose-any-to-any-multimodal-
which machine learning models can then process. These embeddings are like a dictionary that helps the model understand the meaning of words by placing them in a mathematical space where similar words are located near each other. https://rpradeepmenon.medium.com/introduction-to-large-language-models-and-the-transformer-architecture-534408ed7e61
OpenAI wins on multi-lingual mapping (Islamiat is in Urdu) ❏ Some LLM models work better with some subjects. ❏ Objective is to get around 90% accuracy in mappings! ❏ Right now we are using it to recommend SLOs, however in the future we intend to fully automate.