𝗨𝗻𝗶𝗻𝘁𝗲𝗻𝗱𝗲𝗱 𝗖𝗼𝗻𝘀𝗲𝗾𝘂𝗲𝗻𝗰𝗲𝘀 𝗶𝗻 𝗗𝗲𝗽𝗹𝗼𝘆𝗶𝗻𝗴 𝗔𝗜 𝗠𝗼𝗱𝗲𝗹𝘀 𝘞𝘩𝘢𝘵 𝘚𝘤𝘩𝘦𝘮𝘪𝘯𝘨 𝘪𝘴 𝘢𝘯𝘥 𝘏𝘰𝘸 𝘵𝘰 𝘍𝘪𝘹 𝘪𝘵? Agentic AI systems are now capable of using deception to achieve their goals. As foundation models grow smarter, this raises critical questions about AI safety. Misaligned #AI behaviors can lead to harmful and unethical consequences, making it essential for organizations to adopt proactive #safety measures. One key solution? 🔺 AI Red Teaming 🔺 This approach involves monitoring models and detecting unwanted actions before deployment. To learn more about how AI Red Teaming helps mitigate risks and builds safer AI systems. Read Here 👉 https://lnkd.in/dFs4tBKt
About us
ActiveFence is a Trust and Safety provider for online platforms, protecting platforms and their users from malicious behavior and content. Trust and Safety teams of all sizes rely on ActiveFence to keep their users safe from the widest spectrum of online harms, unwanted content, and malicious behavior, including child safety and exploitation, disinformation, hate speech, terror, nudity, fraud, and more. We offer a full stack of capabilities with our deep intelligence research, AI-driven harmful content detection, and online content moderation platform. Protecting over three billion users globally everyday in over 100 languages, ActiveFence lets people interact and thrive online.
- Website
-
https://www.activefence.com/
External link for ActiveFence
- Industry
- Software Development
- Company size
- 201-500 employees
- Headquarters
- New York
- Type
- Privately Held
- Founded
- 2018
Locations
-
Primary
New York, US
-
New York, NY, US
Employees at ActiveFence
Updates
-
ActiveFence reposted this
Much of my time these days is spent (or rather, invested!) in advising our partners and customers on what the UK's Online Safety Act means for them. There’s a wealth of information available, but it can be overwhelming if you’re not accustomed to navigating content alignment and online safety issues regularly. Under the Act, fines of up to £18 million or 10% of qualifying worldwide revenue (whichever is greater) can be imposed on Online Service Providers that fail to understand their responsibilities and act accordingly. If you fall into one of the following categories: 💡 User-to-User Services (U2U Services) – Platforms that enable users to interact with one another, such as: - Social media platforms - Online forums - Video-sharing websites - Consumer cloud storage and file-sharing platforms - Dating apps - Instant messaging services 💡 Search Services – Services with search engine functionality, allowing users to search multiple websites or databases. The Act requires you to take measures to assess your exposure to, detect, and counter illegal content. This includes, but is not limited to: - Terrorism - Child Sexual Exploitation and Abuse (CSEA) offenses - Grooming - CSAM images - CSAM URLs - Hate - Harassment - Stalking, threats, and abuse - Intimate image abuse and sexual exploitation - Human trafficking - Fraud -Proceeds of crime - Animal cruelty - Self-harm - State-sponsored interference The first step is to complete comprehensive risk assessments for illegal content, identifying potential exposure to these issues and estimating their impact. The deadline for this critical task is March 2025. The first step, and the deadline is March 2025, is completing comprehensive risk assessments for illegal content, examining potential exposure to these topics, and estimating their impact. If this applies to you, make sure to visit https://lnkd.in/e_z89REQ and send it to your safety/compliance team. This should be taken very seriously. ActiveFence and I are here to answer any questions and to support your process, from assessment to implementation of the required safety measures to protect your platform, community, and business.
-
🎉 𝟮𝟱𝗞 𝘀𝘁𝗿𝗼𝗻𝗴! 𝗧𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 𝗳𝗼𝗿 𝗯𝗲𝗶𝗻𝗴 𝗽𝗮𝗿𝘁 𝗼𝗳 𝗼𝘂𝗿 𝗷𝗼𝘂𝗿𝗻𝗲𝘆 🎉 We’re thrilled to share that ActiveFence has hit a major milestone: 25,000 followers here on LinkedIn! From insightful research to cutting-edge solutions, we strive to lead the conversation around Trust and Safety- and it’s your engagement, support, and feedback that fuel our mission. 𝗘𝘃𝗲𝗿𝘆 𝗳𝗼𝗹𝗹𝗼𝘄 𝗿𝗲𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝘀 𝗮 𝘀𝗵𝗮𝗿𝗲𝗱 𝗯𝗲𝗹𝗶𝗲𝗳 𝗶𝗻 𝗺𝗮𝗸𝗶𝗻𝗴 𝘁𝗵𝗲 𝗼𝗻𝗹𝗶𝗻𝗲 𝘄𝗼𝗿𝗹𝗱 𝘀𝗮𝗳𝗲𝗿. Here’s to more milestones ahead, 𝘁𝗵𝗮𝗻𝗸 𝘆𝗼𝘂 for being on this journey with us. 💙 #TrustandSafety #Community #OnlineSafety
-
𝗔𝗜 𝗦𝗮𝗳𝗲𝘁𝘆 𝗥𝗼𝘂𝗻𝗱𝘁𝗮𝗯𝗹𝗲𝘀: 𝗛𝗶𝗴𝗵𝗹𝗶𝗴𝗵𝘁𝘀 𝗳𝗿𝗼𝗺 𝗡𝗬𝗖 & 𝗟𝗼𝗻𝗱𝗼𝗻 ✨ Last week, we hosted AI Safety Roundtables in two iconic locations. These intimate gatherings brought together industry leaders, innovators, and experts to discuss the most pressing challenges and opportunities in AI Content Safety. Highlights included: 🔹 Iftach Orr, our CTO, presenting the latest AI Safety Briefing. 🔹 A fireside chat in NYC with Tomer Poran, our Chief Evangelist, and John Liu, Head of Product for Amazon Web Services (AWS) Bedrock. 🔹 Lively Q&A sessions and thought-provoking discussions that left us inspired to continue driving these crucial conversations forward. We’re grateful to everyone who joined us and shared their insights. Stay tuned for details about upcoming events. #AISafety #GenerativeAI #TrustandSafety #Innovation
-
🚨 New ActiveFence Alert 🚨 An interview with a Russian Foreign Minister got millions of views and tons of praise-especially from American women. Was this admiration genuine, or part of a larger campaign? Stay ahead of the game- read this ActiveFence Alert to learn how to identify and respond to trending narratives like this one effectively👉 https://lnkd.in/d2DmCnmq
-
✅ 𝗬𝗼𝘂𝗿 𝗨𝗹𝘁𝗶𝗺𝗮𝘁𝗲 𝗧𝗿𝘂𝘀𝘁 𝗮𝗻𝗱 𝗦𝗮𝗳𝗲𝘁𝘆 𝗩𝗲𝗻𝗱𝗼𝗿 𝗖𝗵𝗲𝗰𝗸𝗹𝗶𝘀𝘁: 𝗦𝗶𝗺𝗽𝗹𝗶𝗳𝘆, 𝗘𝘃𝗮𝗹𝘂𝗮𝘁𝗲, 𝗣𝗿𝗼𝘁𝗲𝗰𝘁 Navigating the world of Trust and Safety solutions can be overwhelming with so many factors to consider- detection, automation, moderation, compliance. But don't worry! Our new checklist ✅ gives you a clear, actionable framework to confidently evaluate vendors and choose the best solution for your platform. Ready to make your platform safer and more trustworthy? Download now 👉 https://lnkd.in/daP9kZ5p
-
🚀 𝗪𝗲’𝗿𝗲 𝘁𝗵𝗿𝗶𝗹𝗹𝗲𝗱 𝘁𝗼 𝗯𝗲 𝗽𝗮𝗿𝘁 𝗼𝗳 𝗚𝗲𝗻𝗠𝗟 𝟮𝟬𝟮𝟰! 🌟 Join our EVP of Engineering, Avi Golan at the ActiveFence booth to dive into the cutting-edge world of Generative AI. Discover how we’re leading the charge in ensuring AI advancements remain safe, responsible, and impactful. Let’s shape the future of AI-together. See you there! 👋 #GenML2024 #GenerativeAI #AISafety #TrustandSafety
-
𝗚𝗲𝗻𝗔𝗜 𝗚𝘂𝗮𝗿𝗱𝗿𝗮𝗶𝗹𝘀: 𝗕𝗿𝗲𝗮𝗸𝗳𝗮𝘀𝘁 & 𝗥𝗼𝘂𝗻𝗱𝘁𝗮𝗯𝗹𝗲 𝗗𝗶𝘀𝗰𝘂𝘀𝘀𝗶𝗼𝗻 | 𝗟𝗼𝗻𝗱𝗼𝗻 Join us for a Breakfast & Roundtable Discussion, where we’ll dive into the evolving landscape of trust, #safety, and policy in enterprise environments. As #Generative #AI technologies transform industries, the need for robust frameworks to secure AI systems has never been more critical. 📍 Where: Searcys at The Gherkin, 𝗟𝗼𝗻𝗱𝗼𝗻 🗓️ When: Thursday, December 12, 2024 | 8:30 AM GMT 𝗧𝗵𝗶𝘀 𝗶𝘀 𝗮𝗻 𝗶𝗻𝘃𝗶𝘁𝗲–𝗼𝗻𝗹𝘆 𝗲𝘃𝗲𝗻𝘁 𝘙𝘦𝘨𝘪𝘴𝘵𝘦𝘳 𝘺𝘰𝘶𝘳 𝘪𝘯𝘵𝘦𝘳𝘦𝘴𝘵 𝘩𝘦𝘳𝘦 👉 https://lnkd.in/dqY6Be8f
-
𝗥𝗲𝗮𝗹-𝗪𝗼𝗿𝗹𝗱 𝗜𝗺𝗽𝗮𝗰𝘁 𝗼𝗳 𝗖𝗵𝗶𝗻𝗲𝘀𝗲 𝗜𝗻𝗳𝗹𝘂𝗲𝗻𝗰𝗲 𝗶𝗻 𝗠𝗮𝗻𝗶𝗽𝘂𝗿 Chinese malign influence campaigns are not just abstract threats, they have tangible real-world consequences. In India’s northeastern state of Manipur, harmful narratives, often linked to China-affiliated accounts, have exacerbated existing tensions, fueling violence between communities. These narratives exploit grievances to promote division and target the BJP-led government, deepening the crisis. At ActiveFence, we’ve been closely tracking these narratives since the conflict escalated in May 2023. The violence and polarization in Manipur highlight the urgent need to stay ahead of such campaigns before they escalate. 👉 Contact us to learn more: https://lnkd.in/dRC9K3Gj Hayley Sweet
-
🎮 𝗚𝗮𝗺𝗶𝗻𝗴 𝘀𝗮𝗳𝗲𝘁𝘆 𝗶𝗻 𝘁𝗵𝗲 𝘀𝗽𝗼𝘁𝗹𝗶𝗴𝗵𝘁! Our AI-powered content moderation is featured in NBC News. We’re safeguarding the player experience with real-time, AI-driven solutions to combat in-game toxicity and ensure a safe, enjoyable environment for all. With the fastest response times, seamless implementation, and native partnerships, we create trusted, secure spaces for gamers worldwide. ⚡ Speed matters. 🤝 Trust matters. 🎮 Let’s keep gaming safe and thriving. To protect your players, contact us 👉https://lnkd.in/dEYyghSi Check out the feature 👉 https://lnkd.in/dJSQnd5F