Recommendations
Explore
Contribute
About
Updates
Build
Fund
Research
Measure
AI & ML
Practice
Recommendations
Build
Fund
Research
Measure
AI & ML
Practice
Explore
Contribute
About
Updates
← Explore Research Questions
Research Question
How do we prevent gaming or manipulation of AI backup systems?
Related Goals
Hybrid human-AI systems that can provide legitimate backup mechanisms.
Related Capabilities
Handle challenges
Robustness
Ability to withstand changing contexts and less-than-ideal conditions.
Related Existing Resources
Research
Adversarial testing for Generative AI
Google’s guide defining adversarial testing as systematically evaluating ML models against malicious or inadvertently harmful input, covering explicit queries (containing policy-violating language) and implicit queries (seeming harmless but involving sensitive topics). The four-stage workflow inv...