Andrew Gamino-Cheong’s Post

CTO & Co-Founder at Trustible

8mo

Deciding which LLM to use is hard. Trying to factor in the legal, ethical, or risk considerations of each model is even harder. A lot of this information is hidden deep in 50+ page technical reports, and often the key information necessary to determine whether a model is appropriate to use a task isn't disclosed. We're looking to help solve that problem by releasing our first ever Model Transparency Ratings. We analyzed the public documentation of the top 21 LLMs against the requirements of General Purpose AI Models from the EU AI Act and created a scoring criteria to identify which models may be riskier to use for AI products in the EU. ------- A few insights from our first set of ratings: 1️⃣ Models have been getting LESS transparent over time. Many providers like Meta, Cohere, and Mistral are less transparent about their newer models than their older ones. 2️⃣ Very few LLM providers disclose their data sources. I'd argue that understanding the data sources is MORE important to understanding the risks of an AI system than the model architecture, but we see decent model transparency, and poor data transparency. 3️⃣ "Open Source LLMs" do generally better, but we had to separate the idea of 'open weights' and 'open data' to better distinguish the risks and expectations. ------- Check out our ratings at: aimodelratings.com To read more about Trustible's methodology, check out our blog post here: https://lnkd.in/gShTuTGx We'll be expanding our ratings to include both new models, and include additional risk criteria. Share what you'd want to see in future versions of our ratings in the comments!

Inside Trustible’s Methodology for Model Transparency Ratings

trustible.ai

3 Comments

Eric Scott Lavin

Education Innovation Leader | K-100 Learning Strategist | EdTech Investor & Entrepreneur

8mo

Yusuf Ahmad

2 Reactions

Elena Gurevich

AI Policy-Curious Attorney | Owner @ EG Legal Services | Director of Development at Center for Art Law

8mo

Alisa S.

1 Reaction

Denis Peskoff

Postdoctoral Fellow in Computer Science

8mo

Joseph Barrow curious about your thoughts

See more comments

To view or add a comment, sign in

More Relevant Posts

VendEx Solutions

485 followers
7mo
Report this post
VendEx is thrilled to announce we've received a patent for our VendEx Identifier (VID), which standardizes the categorization and identification of all data, establishing provenance and streamlining data transactions. Data provenance is a critical factor in the control of IP as generative AI accelerates the demand for data in training large language models. https://buff.ly/3JynxUT #dataidentifier #standardidentifier #dataprovenance #generativeAI

VendEx Solutions Patents the Standard Identifier for Data

prweb.com

1 Comment
Like Comment
To view or add a comment, sign in
PigeonLine - Research-AI

420 followers
8mo
Report this post
https://lnkd.in/eJpqGePr The real power of AI we believe lies in the ability of public servants to cautiously learn how to work with, interrogate, and negotiate with AI-generated data, suggestions, and recommendations.

AI could help automate around 84% of repetitive service transactions across government

turing.ac.uk
Like Comment
To view or add a comment, sign in
Darren Beardsley

Americas Tax AI Leader
4mo
Report this post
The integration of AI into financial services, tax-related functions included, necessitates that we embrace responsibility frameworks as a core of how we innovate. To me, effective governance is the best way to get ahead of potential hiccups and ensure regulatory compliance while exploring new tech. Financial service leaders should implement enterprise-level controls and clearly delineate roles to keep organizational policies up to date with AI standards. To be even more accountable, FIs should establish processes to streamline reporting to regulators and maintain all records and logs, especially automated ones. Monitoring and documentation have always been crucial for tax services, and that hasn’t changed with the introduction of AI. For more insights on integrating these agile yet robust pillars of responsible AI, I highly recommend this article from Khalid Khan, PhD and Dietrich Chen: https://lnkd.in/gdhYqSPM

Four actions to pioneer responsible AI in any industry

ey.com
Like Comment
To view or add a comment, sign in
Michele I. Kelsey

Chief Data Officer
7mo Edited
Report this post
Really exciting news from VendEx Solutions! This is defining the next phase of digitizing the data business not only for financial markets, but for all data.

VendEx Solutions

485 followers
7mo

VendEx is thrilled to announce we've received a patent for our VendEx Identifier (VID), which standardizes the categorization and identification of all data, establishing provenance and streamlining data transactions. Data provenance is a critical factor in the control of IP as generative AI accelerates the demand for data in training large language models. https://buff.ly/3JynxUT #dataidentifier #standardidentifier #dataprovenance #generativeAI

VendEx Solutions Patents the Standard Identifier for Data

prweb.com
Like Comment
To view or add a comment, sign in
Fuad Khan

Building expertise on AI Governance and Project Researcher at Centre for Collaborative Research CCR
4mo
Report this post
Researchers from the Centre for the Governance of AI wrote a piece on visibility measures for AI agents to reduce risks. Three measures: "agent identifiers", "Real-time monitoring" and "Activity logs" are suggested by them. From product or service interaction point of view, "agent identifiers" is a pre interaction, "Real-time monitoring" is during interaction and "Activity logs" is a post interaction mechanisms, which can be deployed to avoid any undesired incidents. These visibility measures are essentially transparency pillar under the larger governance umbrella to mitigate risks as a result of AI agents deployment. While this is a fantastic write up and guidance for transparency, next is perhaps accountability pillar of governance that needs to be taken into consideration. The authors did not shy away from this point as they stated "information (obtained through such measures) is insufficient for reducing risks. We also need effective processes to allow relevant actors to use and act on the information......"

Visibility into AI Agents | GovAI Blog

governance.ai
Like Comment
To view or add a comment, sign in
Scott Rodger

Associate Solicitor - Senior Competition and regulated markets lawyer at Shepherd and Wedderburn
8mo
Report this post
CMA Update on AI Foundation Models: This is a really interesting read / statement of intent from the Competition and Markets Authority on AI Foundation Models and the concerns it has identified as this technology develops, particularly around the scope of large incumbent techcos to entrench existing market power. It also provides a useful summary of recent movements and changes in the market as it has evolved over the past months. Paras 53-54 also a clear warning for those who use AI tools to provide customer service etc. - the CMA will hold you liable for consumer harm incurred from the output of those tools, and will be very willing to use its new enforcement powers under the Digital Markets, Competition and Consumers Bill (when that becomes law): "We are ready to use these new powers to raise standards in the market and, if necessary, to tackle firms that do not play by the rules through enforcement action."

AI Foundation Models: Update paper

gov.uk

1 Comment
Like Comment
To view or add a comment, sign in
Avishay (AJ) Segal, MBA

Author | AI-Enthusiast | External Think Tank | HBR Advisory Council |
1mo Edited
Report this post
𝐈𝐒 𝐀𝐈 𝐀 𝐏𝐀𝐒𝐒𝐈𝐍𝐆 𝐅𝐀𝐍𝐂𝐘? Are AI companies really plateauing, that is, "atrophying" or our perceived notions of progress or more accurately, lack of, true? The average adoption rate of AI globally stands at 26%-35% (regional variance), India is the only country on the planet whose average adoption rate stands at 30%-40%. That means that the greater majority of the world either has never heard of AI or has no idea what it does. 𝐈𝐬 𝐀𝐈 𝐏𝐥𝐚𝐭𝐞𝐚𝐮𝐢𝐧𝐠? What the papers consider as "plateauing", just means that the rate at which features and tools are launched has diminished and is now becoming more standardised. Even o1-Preview is relevant as of September 2021; I don't know if you noticed, but that's more than 3 years ago. And if that is the level of reasoning we see, that also means the Computer Use by Claude isn't really new, it was just launched recently. 𝐀𝐫𝐞 𝐖𝐞 𝐚𝐭 𝐭𝐡𝐞 𝐅𝐨𝐫𝐞𝐟𝐫𝐨𝐧𝐭 𝐨𝐫 𝐣𝐮𝐬𝐭 𝐃𝐨𝐧'𝐭 𝐊𝐧𝐨𝐰 𝐀𝐧𝐲 𝐁𝐞𝐭𝐭𝐞𝐫? Even the concept behind it isn't new: Remotely controlling computers has been the staple of ITES companies since the 1960s, when the original developments started as early as the late 19th century by the likes of Nikola Tesla. The current iteration though, is the innovation. And even had this innovation been considered "old", for the early adopters it isn't. 𝐓𝐡𝐞 𝐀𝐝𝐨𝐩𝐭𝐢𝐨𝐧 𝐑𝐚𝐭𝐞 𝐚𝐦𝐨𝐧𝐠 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬𝐞𝐬 𝐢𝐬 𝐋𝐨𝐰𝐞𝐫 𝐭𝐡𝐚𝐧 𝐀𝐦𝐨𝐧𝐠 𝐈𝐧𝐝𝐢𝐯𝐢𝐝𝐮𝐚𝐥𝐬 Businesses, for the most part, are not even within the early adopters. Many businesses find resistance either from the executives, who can't yet pinpoint the ROI of using AI, or the employees who are afraid of losing their jobs if their companies do decide to adopt AI tomorrow. It would actually be nice to see launches which are not every 6 weeks, as that creates major confusion for companies which are looking to adopt AI tech for their business and have to rethink their strategy every quarter. #ai #adoption #claude
Sasha Krecinic

Investor at Headline
1mo

Is AI progress starting to 'slow down'? Have we 'tapped out' all the data? Short Answer: lol, no 😂 Long Answer: AI's momentum is parallelized now, meaning there are several viable pathways to build on (both on the research and scaling side). Some models underperform expectations, and some models aren't released due to competitive concerns or safety and alignment concerns. You cannot judge progress based on consumer-facing product releases. The best way to track the frontier is through research developments. As Ilya said, "Scaling the right thing matters more now than ever," hence the focus on parallelization. Both OpenAI and Anthropic have said they have a clear line of sight on where to build for the next 18-24 months. Beyond that, it is hard to plan because the frontier is actually moving so quickly (except for things like large infrastructure projects which can have long lead times). Why are people saying this then? These comments are usually taken out of context. They often reflect one small part of the picture and can be misleading. A quick litmus test is to ask if they know what Arxiv is (https://lnkd.in/gZWd7gwY) or what the most recent research paper they read was. If they can't answer, it's unlikely they are tracking the broader AI landscape. "Not much has happened since ChatGPT" — what do you say to this? Some of the biggest developments have occurred in the last two months and came sooner than many in the field expected. Here are a few examples: - Test time training/compute: https://lnkd.in/gSeFqG4b - Real-time voice API: https://lnkd.in/gK8bKeEK - Computer Use: https://lnkd.in/gn_8f222 Each of these has the potential to transform industries. The thing that stumps most people in this industry is how little coverage these developments have gotten. So when someone says AI progress is "losing steam," ask them what research papers they read to form this opinion... 🙃
6 Comments
Like Comment
To view or add a comment, sign in
Chelsea Gordon

Senior Associate at MinterEllison
3mo Edited
Report this post
Hats off to ASIC for performing this analysis. I think these findings highlight the importance of carefully assessing each use case to maximise value. Personally I find AI LLMs most useful when I ask them to provide brief answers to specific questions.

AI worse than humans in every way at summarising information, government trial finds

https://www.themandarin.com.au
Like Comment
To view or add a comment, sign in
Simon Cooper

Partner- Deloitte Digital I Customer | Technology | Transformation I Author
5mo
Report this post
A year on, the points in our article on how Governments can safely embrace AI broadly stand including around the increased awareness of innovation within clearer guardrails and government's exploring the use cases for productivity and enhanced service delivery. Tom Burton Martin Stewart-Weeks #genai https://lnkd.in/gTUevxZm

How government can safely embrace AI

afr.com
Like Comment
To view or add a comment, sign in

2,006 followers

122 Posts

View Profile Connect

Andrew Gamino-Cheong’s Post

More Relevant Posts

Explore topics