🎉 KAN: Kolmogorov-Arnold Networks ⚡️TL;DR: While MLPs have fixed activation functions on nodes ("neurons"), KANs have learnable activation functions on edges ("weights"). KANs have no linear weights at all -- every weight parameter is replaced by a univariate function parametrized as a spline. 👉 Paper Discussion by First Author, Ziming Liu : https://lnkd.in/gYwKMHfw pip install pykan 👉 Abstract: Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs). While MLPs have fixed activation functions on nodes ("neurons"), KANs have learnable activation functions on edges ("weights"). KANs have no linear weights at all -- every weight parameter is replaced by a univariate function parametrized as a spline. We show that this seemingly simple change makes KANs outperform MLPs in terms of accuracy and interpretability. For accuracy, much smaller KANs can achieve comparable or better accuracy than much larger MLPs in data fitting and PDE solving. Theoretically and empirically, KANs possess faster neural scaling laws than MLPs. For interpretability, KANs can be intuitively visualized and can easily interact with human users. Through two examples in mathematics and physics, KANs are shown to be useful collaborators helping scientists (re)discover mathematical and physical laws. In summary, KANs are promising alternatives for MLPs, opening opportunities for further improving today's deep learning models which rely heavily on MLPs. ⚙️ Code: https://lnkd.in/ghpeJjyU 🦋 Docs: https://lnkd.in/gTBdBfYh 📄 Paper: https://lnkd.in/gcwniZuS 💡 Note: Checkout the “Author’s Note” section on GitHub ReadMe. https://lnkd.in/g7F3HPr7 Also see: Althernate implementation of KAN on GitHub 👉 efficient-kan: https://lnkd.in/gjGEEkpX 👉 fourier-kan: https://lnkd.in/gbjh5x5i #KAN #paper #code #docs #python #research #physics #pinn #ml
Sugato Ray’s Post
More Relevant Posts
-
Multilevel Thresholding is employed as a preprocessing step in computer vision for object detection tasks. When the image's diversity and light scattering are not excessive, this method can be used as an alternative to Otsu, offering a lower computational cost. I recorded a tutorial on this (Persian Language) https://lnkd.in/dFb2gxJS
تکنیک آستانه گذاری چیست؟ آستانه گذاری چند لایه چیست؟ (1) | Thresholding
https://www.youtube.com/
To view or add a comment, sign in
-
For those who couldn't make it to SF in September for the AI Conference, check out the video link for the talk I gave on Graph RAG here 👇🏽. https://lnkd.in/gKR7AS4T The original BlackRock/Nvidia Hybrid RAG paper mentions triples (likely knowledge graphs using an RDF data model), and it's also light on implementation details of graph retrieval. Recently, I've been experimenting with strategies that utilize lightweight prompting frameworks and a reranker to combine results from graph + vector RAG, and this approach works surprisingly well even when using Kùzu's property graph data model and text2Cypher to retrieve from the graph. Give the video a watch first, and then check out the code here: https://lnkd.in/gaAWjqYv Happy to discuss more with anyone who has thoughts/comments on this methodology!
A couple months ago, Kùzu Inc. was at The AI Conference in #SanFrancisco, where we presented an overview, history and terminology of Graph RAG and Hybrid RAG (which combines vector + graph retrieval). For those who could not attend the session live, the recording has been kindly made available here by the organizers 👇🏽. https://lnkd.in/gajbbKUn In the talk, we unpacked some of the core ideas behind the recent paper "Hybrid RAG" by researchers at BlackRock and Nvidia, that combines vector and graph retrieval. We showcased a proposed workflow that can be built using Kùzu and the property graph data model. We discussed how and why a graph is beneficial, because it's able to provide additional context to an LLM for generating an answer. We also present a framework of thinking when conceptualizing and explaining the workflow for Graph/Hybrid RAG based on the BlackRock paper. For those who like to get their hands dirty with code, here's our GitHub repo that walks through the key steps using a combination of easy to use, open source, permissively licensed tools and frameworks: https://lnkd.in/gisiSieA If you're interested in getting started with using graphs, we encourage you to explore these ideas and try out the code! #graph #rag #kuzu #lancedb
Prashanth Rao, Kuzu: Unpacking Graph RAG: An overview of history, terminologies and examples
https://www.youtube.com/
To view or add a comment, sign in
-
A Paradigm Shift: MoRAs Role in Advancing Parameter-Efficient Fine-Tuning Techniques Parameter-efficient fine-tuning
To view or add a comment, sign in
-
This paper studies the performance limits of a popular #text #SemCom system named #DeepSC in the presence of (#multi-#interferer) #radio #frequency #interference (#RFI). By introducing a principled probabilistic framework for #semantic #communication (#SemCom), the authors show that DeepSC produces semantically irrelevant sentences as the power of (#multi-#interferer) RFI gets very large. They also derive DeepSC’s practical limits and a #lower #bound on its outage probability under multi-interferer #RFI, and propose a (#generic) #lifelong DL-based #interference-#resistant #and #robust (#IR #2 ) SemCom system. They corroborate the derived limits with simulations and computer experiments, which also affirm the vulnerability of DeepSC to a #wireless #attack using RFI. ---Tilahun M. Getu, Walid Saad, Georges Kaddoum, Mehdi Bennis More details can be found at this link: https://lnkd.in/ga2BbSAt
Performance Limits of a Deep Learning-Enabled Text Semantic Communication under Interference
ieeexplore.ieee.org
To view or add a comment, sign in
-
Can KAN Fit TAN?: My Latest Medium Article In this article, I delve into Kolmogorov-Arnold Network (KAN), and try to provide alternative viewpoint. Further, explain a novel Math experiment to appreciate strengths of KAN. Key Highlights: --> Understanding KAN --> KAN vs MLP --> Understanding KAN's parameters --> Implementing novel experiment with Radial Tan hyperbolic function, using KAN & PyTorch Check out my latest article on Medium and share your thoughts! How KAN can revolutionize AI/ML landscape? Read the full article here: [https://lnkd.in/gG_pXMGi] #MachineLearning #DataScience #Mathematics #Programming #Research
Can KAN Fit TAN?
link.medium.com
To view or add a comment, sign in
-
🚀𝗧𝗵𝗲 𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻 𝗼𝗳 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗔𝗜 has reached a new frontier with 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 (𝗥𝗔𝗚). By pairing large language models with real-time data retrieval, RAG allows for more accurate, relevant, and 𝗱𝘆𝗻𝗮𝗺𝗶𝗰 𝗰𝗼𝗻𝘁𝗲𝗻𝘁 𝗰𝗿𝗲𝗮𝘁𝗶𝗼𝗻—transforming the way marketing teams operate. Instead of relying on static, pre-trained knowledge, RAG empowers AI to generate content that is timely and personalized to specific campaigns and audiences. ✨At GoodBards, we're proud to integrate RAG into our platform, allowing marketing teams to harness this cutting-edge technology for 𝗮𝗴𝗶𝗹𝗲, 𝗱𝗮𝘁𝗮-𝗱𝗿𝗶𝘃𝗲𝗻 𝗰𝗮𝗺𝗽𝗮𝗶𝗴𝗻𝘀. What's more, our very own CTO, Cédrick Lunven, is one of the leading java developers in this space. He recently shared the stage at 𝗗𝗲𝘃𝗼𝘅𝘅 𝟮𝟬𝟮𝟰 in Antwerp with Google Cloud's Guillaume Laforge, where they discussed the latest innovations in AI and cloud technology. ✨ 💡 Cedrick's expertise in RAG and AI-driven marketing is shaping the future of how businesses can leverage these tools to stay ahead. If you're ready to take your marketing to the next level, Good Bards is here to make it happen! 💡 To see Cedrick in action check out the recording of the session: https://lnkd.in/dim-PnDp For a Good Bards demo, simply Contact Us on www.goodbards.com #GoodBards #GenerativeAI #RAG #MarketingInnovation #Devoxx2024 #AIForMarketing #AgileMarketing
From naive to advanced RAG: the complete guide by Cédrick Lunven, Guillaume Laforge
https://www.youtube.com/
To view or add a comment, sign in
-
It’s amazing how quickly things change! Less than two years ago, if you had asked me about RAG, I would have thought of a traffic light metaphor—something about the status of a project. 🟢🟡🔴 Fast forward to today, and RAG—𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻—has completely transformed how marketing teams leverage AI for content generation. RAG doesn’t just improve 𝗮𝗰𝗰𝘂𝗿𝗮𝗰𝘆 𝗮𝗻𝗱 𝗰𝗼𝗻𝘁𝗲𝘅𝘁; it also opens new doors for 𝘀𝗽𝗲𝗲𝗱 𝗮𝗻𝗱 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗼𝗻, something I’ve seen firsthand with how GoodBards has integrated this technology. While open-source platforms still require a lot of manual effort, Good Bards’ solution brings the power of automation to marketers, enabling teams to move faster and more efficiently. If you’re still exploring how to make AI work for your campaigns, I’d highly recommend diving deeper into this tech and seeing what Good Bards has done with it! 🚀 And for my tech-savvy connections, click the link below to see our CTO Cédrick Lunven, who spoke on the subject, alongside Guillaume Laforge, at Devoxx Belgium this month #RAG #RetrievalAugmentedGeneration #GoodBards #MarketingInnovation #AIForMarketing #Automation
🚀𝗧𝗵𝗲 𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻 𝗼𝗳 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗔𝗜 has reached a new frontier with 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 (𝗥𝗔𝗚). By pairing large language models with real-time data retrieval, RAG allows for more accurate, relevant, and 𝗱𝘆𝗻𝗮𝗺𝗶𝗰 𝗰𝗼𝗻𝘁𝗲𝗻𝘁 𝗰𝗿𝗲𝗮𝘁𝗶𝗼𝗻—transforming the way marketing teams operate. Instead of relying on static, pre-trained knowledge, RAG empowers AI to generate content that is timely and personalized to specific campaigns and audiences. ✨At GoodBards, we're proud to integrate RAG into our platform, allowing marketing teams to harness this cutting-edge technology for 𝗮𝗴𝗶𝗹𝗲, 𝗱𝗮𝘁𝗮-𝗱𝗿𝗶𝘃𝗲𝗻 𝗰𝗮𝗺𝗽𝗮𝗶𝗴𝗻𝘀. What's more, our very own CTO, Cédrick Lunven, is one of the leading java developers in this space. He recently shared the stage at 𝗗𝗲𝘃𝗼𝘅𝘅 𝟮𝟬𝟮𝟰 in Antwerp with Google Cloud's Guillaume Laforge, where they discussed the latest innovations in AI and cloud technology. ✨ 💡 Cedrick's expertise in RAG and AI-driven marketing is shaping the future of how businesses can leverage these tools to stay ahead. If you're ready to take your marketing to the next level, Good Bards is here to make it happen! 💡 To see Cedrick in action check out the recording of the session: https://lnkd.in/dim-PnDp For a Good Bards demo, simply Contact Us on www.goodbards.com #GoodBards #GenerativeAI #RAG #MarketingInnovation #Devoxx2024 #AIForMarketing #AgileMarketing
From naive to advanced RAG: the complete guide by Cédrick Lunven, Guillaume Laforge
https://www.youtube.com/
To view or add a comment, sign in
-
Can a network achieve infinite accuracy with a fixed width? Paper from Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y. Hou, Max Tegmark. Introduce Kolmogorov-Arnold Networks (KAN) as alternative to MLPs, where activation functions have been moved from nodes (neurons) to edges (weights) and this functions are learnable. https://lnkd.in/dDD_Texv
KAN: Kolmogorov-Arnold Networks
arxiv.org
To view or add a comment, sign in
-
Here's my summary of the recent cool KAN (Kolmogorov-Arnold Networks) paper.
Yes, we KAN!
gonzoml.substack.com
To view or add a comment, sign in