Tag
9 articles
Learn how to set up and use OpenAI's new Trusted Contact feature in ChatGPT, which helps connect users with support when discussing self-harm or mental health concerns.
This explainer article explains the basics of artificial intelligence and how different people in the AI field can have different opinions about developing safe AI technologies.
Learn how to set up a basic AI content safety environment using Python and popular libraries, including pattern matching and semantic analysis techniques.
Learn to build a basic AI safety monitoring dashboard that tracks and analyzes discussions about AI risks and safety measures in online communities.
Learn how to use AI tools like ChatGPT responsibly with best practices for accuracy, transparency, and ethical usage.
Learn to build a practical model specification framework that balances AI safety, user freedom, and accountability, similar to OpenAI's approach.
Learn to build a basic AI safety monitoring system that detects potentially harmful content, similar to OpenAI's teen safety policies. This beginner-friendly tutorial teaches you how to implement keyword detection, sentiment analysis, and safety violation handling.
Learn to build a basic agentic AI monitoring system that demonstrates the core concepts behind Cisco's DefenseClaw, which aims to make enterprise AI safer through orchestration.
Learn how to implement Chain-of-Thought prompting techniques using Hugging Face Transformers to guide language models toward more structured reasoning patterns, similar to OpenAI's CoT-Control research.