Beyond the hype: The reality of AI security

Beyond the hype: The reality of AI security José Manuel
Ortega [email protected]

AI CONNECT 2025 • Introduction to OWASP LLM Top 10
• Ethical implications of AI security • Securing AI systems: Best practices and emerging technologies • The future of AI security

AI CONNECT 2025 Introduction to OWASP LLM Top 10 https://genai.owasp.org

AI CONNECT 2025 Introduction to OWASP LLM Top 10

AI CONNECT 2025 Prompt Injection

AI CONNECT 2025 Indirect Prompt Injection

AI CONNECT 2025 ChiperChat https://arxiv.org/pdf/2308.0 6463

AI CONNECT 2025 Jailbreak prompts • https://jailbreak-llm s.xinyueshen.me/

AI CONNECT 2025 Data Poisoning

AI CONNECT 2025 Adversarial Attacks

AI CONNECT 2025 Adversarial Attacks • Small Perturbations: Adversarial attacks
typically involve adding small, carefully crafted perturbations to the input data that are often imperceptible to humans. These subtle changes can trick the AI system into making wrong predictions or classifications. • Model Vulnerabilities: These attacks exploit specific weaknesses in the machine learning model, such as its inability to generalize well to new, unseen data or the sensitivity of the model to certain types of input. • Impact on Critical Systems: Adversarial attacks can have severe consequences when applied to AI systems in critical domains such as autonomous vehicles, facial recognition systems, medical diagnostics, and security systems.

AI CONNECT 2025

AI CONNECT 2025 Adversarial Attacks • 1. Prompt Injection •
2. Evasion Attacks • 3. Poisoning Attacks • 4. Model Inversion Attacks • 5. Model Stealing Attacks • 6. Membership Inference Attacks

AI CONNECT 2025 • Trust: Users must trust that AI
systems are unbiased and transparent. • Fairness: Avoiding discrimination and ensuring equal treatment across all demographics. • Accountability: Ensuring that actions taken by AI systems can be explained and justified. The Importance of Ethics in AI-Driven Cybersecurity

AI CONNECT 2025 Understanding Algorithmic Bias in AI Systems •
Data Bias: Incomplete or unrepresentative training data. • Algorithmic Bias: Flaws in the algorithm design. • Human Bias: Prejudices introduced by the developers.

AI CONNECT 2025 • False Positives/Negatives: Incorrectly identifying safe activities
as threats or missing actual threats. • Discrimination: Unequal treatment of certain user groups, potentially exposing them to higher risks. • Erosion of Trust: Users may lose confidence in AI systems that are perceived as biased. The Impact of Bias on Cybersecurity Outcomes

AI CONNECT 2025 Strategies to Mitigate Algorithmic Bias • Diverse
Training Data: Ensure the training data represents various demographics and scenarios. • Regular Audits: Conduct periodic reviews of AI systems to identify and correct biases. • Algorithm Transparency: Make the algorithm’s workings understandable and accessible to all stakeholders.

AI CONNECT 2025 • Explainable AI (XAI): Developing AI systems
that can explain their decisions in human terms. • Open Source Models: Sharing AI models publicly to allow scrutiny and improvement by the broader community. • Documentation: Providing detailed documentation on the AI system’s functioning and decision criteria. Ensuring Transparency in AI Algorithms

AI CONNECT 2025 • Ethical Frameworks: Implement frameworks that guide
the ethical use of AI in cybersecurity. • Stakeholder Involvement: Engage diverse stakeholders in developing and monitoring AI systems. • Continuous Improvement: Regularly update AI systems and policies to adapt to new ethical challenges and technological advancements. Balancing Security Needs with Ethical Concerns

AI CONNECT 2025 • Inclusive Design: Incorporate input from diverse
groups during AI system development. • Bias Mitigation Techniques: Use techniques like reweighting or re-sampling to reduce bias in training data. • Ethics Committees: Establish committees to oversee the ethical aspects of AI deployment. Best Practices for Ethical AI Implementation in Cybersecurity

AI CONNECT 2025 • Decision Review: Humans should review critical
decisions made by AI systems. • Exception Handling: Develop protocols for handling exceptions and anomalies that AI cannot address. • Ongoing Training: Train security personnel to understand and effectively oversee AI systems. The Role of Human Oversight in AI-Based Security Systems

AI CONNECT 2025 Securing AI systems: Best practices and emerging
technologies • 1. Adversarial Defense

technologies • 1. Adversarial Defense • Adversarial Training: This involves exposing AI models to adversarial examples during training to make them more robust. • Defensive Distillation: This technique involves training a model to be less sensitive to adversarial inputs by smoothing out the decision boundaries. • Robust Optimization: Utilizing optimization techniques that focus on creating models resilient to adversarial perturbations.

technologies • 1. Adversarial Defense • Explainable AI (XAI): By making AI systems more transparent and interpretable, we can better understand why a system is vulnerable to adversarial attacks and improve its robustness. • Certified Defenses: Researchers are developing methods that provide provable guarantees that a model is resistant to adversarial examples, leading to more secure AI systems.

technologies • 2. Secure Data Handling and Privacy Preservation • Data Encryption: Encrypt data both at rest and in transit to ensure that it cannot be accessed or tampered with by unauthorized actors. • Federated Learning: This technique allows models to be trained on decentralized data, ensuring that raw data is not shared across servers. This improves privacy by keeping sensitive data local. • Differential Privacy: A method that adds noise to datasets or model outputs to protect individual data points from being exposed.

technologies • 2. Secure Data Handling and Privacy Preservation • Homomorphic Encryption: A technology that allows computations to be performed on encrypted data without decrypting it first, ensuring privacy during processing. • Secure Multi-Party Computation (SMPC): This allows different parties to collaboratively perform computations on data without revealing the underlying data itself.

technologies • 3. AI Model Integrity and Trustworthiness

technologies • 3. AI Model Integrity and Trustworthiness • Regular Model Audits: Continuously auditing AI models to ensure they are working as expected, and recalibrating them when necessary to maintain their integrity. • Bias Detection: Implement methods to identify and mitigate bias in AI models, ensuring fairness and preventing discriminatory outcomes. • Secure Model Versioning: Track and control the versions of models deployed, making sure that no unauthorized changes can be made without proper validation.

technologies • 3. AI Model Integrity and Trustworthiness • Blockchain for Model Integrity: Blockchain can be used to create transparent, immutable records of AI model versions, ensuring their integrity and making it easier to track changes. • AI Testing Frameworks: New testing frameworks are emerging that can automatically assess AI models for vulnerabilities, biases, and overall performance.

AI CONNECT 2025 The future of AI security • Adversarial
Machine Learning • AI as a Security Tool • AI-Powered Privacy Protection • Regulations and Standards • Human-AI Collaboration

AI CONNECT 2025

AI CONNECT 2025 • github.com/greshake/llm-security • github.com/corca-ai/awesome-llm-security • github.com/facebookresearch/PurpleLlama •
github.com/protectai/llm-guard • github.com/cckuailong/awesome-gpt-security • github.com/jedi4ever/learning-llms-and-genai-for-dev -sec-ops • github.com/Hannibal046/Awesome-LLM

AI CONNECT 2025 • https://cloudsecurityalliance.org/artifacts/security-implicatio ns-of-chatgpt • https://www.nist.gov/itl/ai-risk-management-framework • https://blog.google/technology/safety-security/introducing-g
oogles-secure-ai-framework • https://owasp.org/www-project-top-10-for-large-language- model-applications/

Beyond the hype: The reality of AI security

Beyond the hype: The reality of AI security

More Decks by jmortegac

Other Decks in Technology

Featured

Transcript