Hindi Tamil Telugu
    More
    In the news
    Narendra Modi
    Amit Shah
    Box Office Collection
    Bharatiya Janata Party (BJP)
    OTT releases
    Hindi Tamil Telugu
    User Placeholder

    Hi,

    Logout

    India
    Business
    World
    Politics
    Sports
    Technology
    Entertainment
    Auto
    Lifestyle
    Inspirational
    Career
    Bengaluru
    Delhi
    Mumbai

    Download Android App

    Follow us on
    • Facebook
    • Twitter
    • Linkedin
    Home / News / Technology News / Training AI on synthetic data: Is it a double-edged sword?
    Summarize
    Next Article
    Training AI on synthetic data: Is it a double-edged sword?
    OpenAI and Anthropic are testing a dual-model system

    Training AI on synthetic data: Is it a double-edged sword?

    By Mudit Dube
    Apr 11, 2024
    04:34 pm

    What's the story

    Artificial Intelligence (AI) companies are increasingly turning to synthetic data as a potential solution to the growing shortage of real-world data for training AI models.

    According to The New York Times, synthetic data could also address concerns over AI copyright infringement.

    Tech giants such as Anthropic, Google, and OpenAI are all striving toward generating high-quality synthetic data, an achievement yet to be realized.

    Habsburg AI

    Challenges faced by AI models based on synthetic data

    AI models that rely heavily on synthetic data have encountered significant challenges.

    Australian AI researcher and podcaster, Jathan Sadowski, coined the term "Habsburg AI" to describe a system that is "heavily trained on the outputs of other generative AIs," resulting in an "inbred mutant, likely with exaggerated, grotesque features."

    The issue was further identified as "Model Autophagy Disorder" or "MAD" by Richard Baraniuk from Rice University after observing malfunctions in their research model following just five generations of AI inbreeding.

    You're
    33%
    through

    Dual-model approach

    OpenAI and Anthropic test dual-model system

    OpenAI and Anthropic are experimenting with a two-model system for generating reliable synthetic data.

    The first model is responsible for producing the data, while the second verifies its accuracy.

    Anthropic has been open about its use of synthetic data, revealing that it uses a set of rules or "constitution" to train its dual-model system.

    The company's latest AI chatbot, Claude 3, has been trained on data "generated internally" and is claimed to be superior to Google Gemini and OpenAI's ChatGPT.

    You're
    66%
    through

    Double-edged sword

    Synthetic data could be a solution going forward

    Synthetic data is generated artificially to mimic real-world data for various purposes such as training AI algorithms.

    Hence, synthetic data offers advantages like privacy preservation, scalability, and copyright issues, the three main hurdles in training AI models, apart from the limited supply of powerful chips.

    But it also raises concerns regarding its accuracy and ethical implications. And an AI model is as good as the data it is trained on.

    Done!
    Facebook
    Whatsapp
    Twitter
    Linkedin
    Related News
    Latest
    OpenAI
    Google
    Artificial Intelligence and Machine Learning

    Latest

    Karan Johar launches podcast 'Live Your Best Life' on Audible Karan Johar
    US: FDA greenlights China's 1st painkiller to combat fentanyl overdoses United States of America
    Amul develops bioethanol from whey, plans ₹70cr plant in Gujarat Amul
    Xiaomi tops global wearable segment, beats Apple with 44% surge Xiaomi

    OpenAI

    Tinder parent teams up with OpenAI for ChatGPT integration Tinder
    Jeff Bezos, NVIDIA to back humanoid robot start-up Figure AI Jeff Bezos
    ChatGPT's Android widget makes AI more accessible than ever ChatGPT
    Microsoft's water consumption rises to 22bn liters amid AI boom Microsoft

    Google

    WhatsApp introduces new bottom navigation bar for Android users WhatsApp
    Google Podcasts to stop working from April 2: Know solution YouTube Music
    Want Pixel 8 for free? Participate in this Google contest Google Pixel 8
    Garena Free Fire MAX codes for today: How to redeem Garena Free Fire MAX

    Artificial Intelligence and Machine Learning

    Microsoft now working on AI chatbot for Xbox support Microsoft
    George Carlin's daughter warns of AI threat after settling lawsuit YouTube
    Amazon discontinues 'Just Walk Out' technology in US: Here's why Amazon
    Nicki Minaj, Billie Eilish, 200+ musicians warn against AI misuse Billie Eilish
    Indian Premier League (IPL) Celebrity Hollywood Bollywood UEFA Champions League Tennis Football Smartphones Cryptocurrency Upcoming Movies Premier League Cricket News Latest automobiles Latest Cars Upcoming Cars Latest Bikes Upcoming Tablets
    About Us Privacy Policy Terms & Conditions Contact Us Ethical Conduct Grievance Redressal News News Archive Topics Archive Download DevBytes Find Cricket Statistics
    Follow us on
    Facebook Twitter Linkedin
    All rights reserved © NewsBytes 2025