subhasmita sahoo’s Post

View profile for subhasmita sahoo, graphic

Senior SWE | Generative AI @ Google Search

ChAI Talk #3: LLMs and Tokens (Word Chunks) ☕️🧩 ☕️ 𝗧𝗼𝗱𝗮𝘆'𝘀 𝘀𝗶𝗽: LLMs Learn New Words Just Like Us! 🧠💡 Remember learning 'play' + 'ground' = 'playground'? LLMs do the same with 'tokens' (word parts). Why use tokens? 1. Speed: Process language faster 2. Smarts: Understand new words easily How it works: • LLM recognizes parts: 'play', 'ing', 'er', 'sing' from known words ('playing', 'singer') • LLM combines parts: Learns new word 'play' + 'er' = 'player', 'sing' + 'ing' = 'singing'. Result: Huge vocab from fewer parts! 🤯 𝗠𝗶𝗻𝗱-𝗯𝗼𝗴𝗴𝗹𝗶𝗻𝗴 𝗳𝗮𝗰𝘁: GPT-3 covers all English with a 50,000-token vocabulary, from 𝘵𝘦𝘢 to 𝘱𝘯𝘦𝘶𝘮𝘰𝘯𝘰𝘶𝘭𝘵𝘳𝘢𝘮𝘪𝘤𝘳𝘰𝘴𝘤𝘰𝘱𝘪𝘤𝘴𝘪𝘭𝘪𝘤𝘰𝘷𝘰𝘭𝘤𝘢𝘯𝘰𝘤𝘰𝘯𝘪𝘰𝘴𝘪𝘴! 👇 𝗬𝗼𝘂𝗿 𝘁𝘂𝗿𝗻: Play around with https://lnkd.in/e3cw3Gmc to see how words break into tokens! Share your experience below! #ChAI #ChAITalk #ChAITalk03 #LLM #NLP #ML #AI #OpenAI #GPT3

To view or add a comment, sign in

Explore topics