NewsBytes Stage
    Hindi
    More
    In the news
    Narendra Modi
    Amit Shah
    Box Office Collection
    Bharatiya Janata Party (BJP)
    OTT releases
    Hindi
    NewsBytes Stage
    India
    Business
    World
    Politics
    Sports
    Technology
    Entertainment
    Auto
    Lifestyle
    Career
    Visual Stories
    Find Cricket Statistics

    Download Android App

    Follow us on
    • Facebook
    • Twitter
    • Linkedin
    Home / News / Technology News / 'Skeleton key' vulnerability found in AI tools, Microsoft urges caution
    Summarize
    Next Article
    'Skeleton key' vulnerability found in AI tools, Microsoft urges caution
    Guardrails are bypassed by the user's deceptive claim of safety

    'Skeleton key' vulnerability found in AI tools, Microsoft urges caution

    By Dwaipayan Roy
    Jul 02, 2024
    12:39 pm

    What's the story

    AI companies are facing a new challenge, as users discover innovative ways to circumvent the security measures in place, to prevent chatbots from aiding in illegal activities.

    Earlier this year, a white hat hacker found a "Godmode" ChatGPT jailbreak that enabled the chatbot to assist in producing meth and napalm, an issue OpenAI promptly addressed.

    However, Microsoft Azure CTO Mark Russinovich recently acknowledged another jailbreaking technique known as "Skeleton Key."

    New technique

    'Skeleton Key' jailbreak: A multi-step strategy

    The "Skeleton Key" attack employs a multi-step strategy to manipulate the system into violating its operators' policies, heavily influenced by a user, and executing harmful instructions.

    In one case, a user asked the chatbot to list instructions for making a Molotov Cocktail under the false pretense of educational safety.

    Despite activating the chatbot's guardrails, they were bypassed by the user's deceptive claim of safety.

    Experiment results

    Jailbreak tests on leading chatbots

    Microsoft tested the "Skeleton Key" jailbreak on several advanced chatbots, including OpenAI's GPT-4o, Meta's Llama3, and Anthropic's Claude 3 Opus.

    Russinovich revealed that the jailbreak was successful on all models, leading him to suggest that "the jailbreak is an attack on the model itself."

    He further clarified that each model was tested across various risk and safety content categories, like explosives, bioweapons, political content, self-harm, racism, drugs, graphic sex and violence.

    Persistent threats

    Ongoing challenges for AI companies

    While developers are likely addressing the "Skeleton Key" jailbreak technique, other methods continue to pose significant threats.

    Adversarial attacks such as the Greedy Coordinate Gradient (BEAST) can still easily overcome guardrails established by companies like OpenAI.

    This persistent issue underscores that AI companies have a substantial amount of work ahead, to prevent their chatbots from spreading potentially harmful information.

    Facebook
    Whatsapp
    Twitter
    Linkedin
    Related News
    Latest
    Artificial Intelligence and Machine Learning
    Microsoft
    OpenAI

    Latest

    Who is India's most successful Test captain on England soil? Indian Cricket Team
    No duty cuts on British wine in India-UK trade deal United Kingdom
    Sneh Rana records career-best WODI returns against SL; Amanjot shines Indian Women's Cricket Team
    TVS's cheapest e-scooter to be launched soon: What we know TVS Motor Company

    Artificial Intelligence and Machine Learning

    Snapchat's on-device AI model changes user background, clothing in real-time Snapchat
    OpenAI's ex-chief scientist Ilya Sutskever launches new AI start-up OpenAI
    Google's Gemini API introduces context caching to optimize AI workflows Google
    Top AI chatbots echoing false narratives from Russian disinformation networks Russia

    Microsoft

    Microsoft unveils AI-driven 'Recall' feature for Windows: How it works Windows 11
    Microsoft launches Surface Pro all-purpose AI PC with Copilot key Satya Nadella
    Microsoft introduces Surface Laptop with an Arm-powered processor: Check features Wi-Fi 7
    Microsoft announces AI-driven Copilot+ PCs: Faster than MacBook Air M3? Qualcomm

    OpenAI

    OpenAI to alter paperwork, scrap controversial nondisparagement agreement with employees Sam Altman
    OpenAI's ChatGPT fails to meet EU data accuracy standards ChatGPT
    xAI, Elon Musk's AI startup, raises $6 billion in funding Elon Musk
    OpenAI sets up safety committee to evaluate AI models Sam Altman
    Indian Premier League (IPL) Celebrity Hollywood Bollywood UEFA Champions League Tennis Football Smartphones Cryptocurrency Upcoming Movies Premier League Cricket News Latest automobiles Latest Cars Upcoming Cars Latest Bikes Upcoming Tablets
    About Us Privacy Policy Terms & Conditions Contact Us Ethical Conduct Grievance Redressal News News Archive Topics Archive Download DevBytes Find Cricket Statistics
    Follow us on
    Facebook Twitter Linkedin
    All rights reserved © NewsBytes 2025