• Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Home Artificial Intelligence

OpenAI Released A Voice Cloning Model That Needs Only A 15-Sec Clip

Paul Balo by Paul Balo
March 31, 2024
in Artificial Intelligence
Share on FacebookShare on Twitter

OpenAI has introduced Voice Engine, a revolutionary text-to-voice generation platform. This technology can create lifelike synthetic voices based on a mere 15-second audio clip of a person’s voice, offering a plethora of possibilities across various industries. Simply put, OpenAI will now reproduce a voice just from a 15 second clip. Think about how this would potentially change the voice over game especially in the film and general media industries. In a sample that OpenAI provided via a blog post, it gave 15 second clips in English and Spanish and then the regenerated AI versions. It was hard to tell really and that’s the issue with AI – deepfakes. The first one below is the reference, 

Audio Player
https://tbwpfiles.s3.eu-west-2.amazonaws.com/wp-content/uploads/2024/03/01213540/age-of-learning-reference.mp3
00:00
00:00
00:00
Use Up/Down Arrow keys to increase or decrease volume.

 

Audio Player
https://tbwpfiles.s3.eu-west-2.amazonaws.com/wp-content/uploads/2024/03/01213540/age-of-learning-spanish-reference.mp3
00:00
00:00
00:00
Use Up/Down Arrow keys to increase or decrease volume.
Audio Player
https://tbwpfiles.s3.eu-west-2.amazonaws.com/wp-content/uploads/2024/03/01213540/age-of-learning-spanish-aprendizaje-compartido.mp3
00:00
00:00
00:00
Use Up/Down Arrow keys to increase or decrease volume.
Audio Player
https://tbwpfiles.s3.eu-west-2.amazonaws.com/wp-content/uploads/2024/03/01213540/age-of-learning-spanish-abc-mouse.mp3
00:00
00:00
00:00
Use Up/Down Arrow keys to increase or decrease volume.

Voice Engine enables the creation of AI-generated voices that can seamlessly read out text prompts in multiple languages, mirroring the accent and intonation of the original speaker. OpenAI emphasized the positive impact of Voice Engine, highlighting its potential applications in education, healthcare, communication, and beyond.

The platform has already garnered attention from leading companies, including Age of Learning, HeyGen, Dimagi, Livox, and Lifespan, who are utilizing Voice Engine to enhance their offerings. Age of Learning, for instance, is leveraging the technology to generate pre-scripted voice-over content and deliver personalized responses to students, powered by the advanced capabilities of GPT-4.

Voice Engine represents the culmination of extensive research and development efforts by OpenAI, with the model trained on a diverse dataset comprising licensed and publicly available data. Jeff Harris from OpenAI’s product team underscored the significance of this innovation, noting its integration into ChatGPT’s Read Aloud feature and the text-to-speech API.

While AI text-to-audio generation continues to evolve rapidly, ethical considerations remain paramount. OpenAI has implemented stringent usage policies to ensure responsible deployment of Voice Engine. Partners are required to obtain explicit consent from original speakers, refrain from impersonation, and disclose the AI-generated nature of the voices. Additionally, watermarking and active monitoring mechanisms are employed to track the usage of audio clips.

As the technology landscape evolves, OpenAI advocates for proactive measures to mitigate potential risks associated with AI voice technologies. This includes phasing out voice-based authentication for sensitive transactions, implementing robust policies to safeguard individuals’ voices, raising awareness about AI deepfakes, and developing systems to track AI-generated content.

With Voice Engine poised to redefine the boundaries of synthetic voice generation, OpenAI continues to lead the charge in driving innovation while prioritizing ethical considerations and societal well-being.

Related Posts:

  • OpenAI Unveils Enhanced ChatGPT With Voice Commands And Image Interaction
    OpenAI Unveils Enhanced ChatGPT With Voice Commands…
  • cf121196-1-CHATGPT
    ChatGPT Launches Desktop Apps with Voice Mode
  • hero-image (2)
    ChatGPT's Vision-Enhanced Voice Mode Is Now…
  • DSC04144_processed
    Samsung Debuts Voice Cloning AI That Can Respond To Calls
  • Microsoft-datacenter-cold-aisle-server-racks-for-the-AMD-MI300X
    Microsoft Prepares for OpenAI's GPT-5 Launch
  • 454439424_1017593256482398_3651231210910483627_n
    WhatsApp Adds Multi-Language Voice Message Transcription
  • Audio_Models_wallpaper_16.9
    OpenAI Launches New Audio Models for Agentic Workflows
  • 679a7510-bd6f-11ef-bfff-14a3e532bc4f
    WhatsApp Adds Voice and Image Support to ChatGPT Chats

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: AIartificial intelligenceopenaivoicevoice cloningvoice engine
Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Select Category

    Receive top tech news directly in your inbox

    subscription from
    Loading

    Freshly Squeezed

    • Microsoft Reveals Rejected Start Menu Redesigns May 13, 2025
    • SeerBit & Spectranet Launch ExpressPay for Internet Subscriptions May 13, 2025
    • Truecaller Filters Verified Business Messages May 12, 2025
    • ChatGPT Deep Research Now Links to GitHub Repos May 12, 2025
    • Microsoft Offers Guide to Fix Windows Blue Screen Errors May 12, 2025
    • We’ve Invested $10b in Nigeria so Far – MTN May 12, 2025

    Browse Archives

    May 2025
    MTWTFSS
     1234
    567891011
    12131415161718
    19202122232425
    262728293031 
    « Apr    

    Quick Links

    • About TechBooky
    • Advertise Here
    • Contact us
    • Submit Article
    • Privacy Policy

    Recent News

    Microsoft Reveals Rejected Start Menu Redesigns

    Microsoft Reveals Rejected Start Menu Redesigns

    May 13, 2025
    SeerBit & Spectranet Launch ExpressPay for Internet Subscriptions

    SeerBit & Spectranet Launch ExpressPay for Internet Subscriptions

    May 13, 2025
    Truecaller Filters Verified Business Messages

    Truecaller Filters Verified Business Messages

    May 12, 2025
    ChatGPT Deep Research Now Links to GitHub Repos

    ChatGPT Deep Research Now Links to GitHub Repos

    May 12, 2025
    Microsoft Offers Guide to Fix Windows Blue Screen Errors

    Microsoft Offers Guide to Fix Windows Blue Screen Errors

    May 12, 2025
    The NCC Commissioned MTNN To Lease Spectrums From NTEL And Renew Its 3G Spectrum

    We’ve Invested $10b in Nigeria so Far – MTN

    May 12, 2025
    • Login

    © 2021 Design By Tech Booky Elite

    Generic selectors
    Exact matches only
    Search in title
    Search in content
    Post Type Selectors
    • African
    • Artificial Intelligence
    • Gadgets
    • Metaverse
    • Tips
    • About TechBooky
    • Advertise Here
    • Submit Article
    • Contact us

    © 2021 Design By Tech Booky Elite

    Discover more from TechBooky

    Subscribe now to keep reading and get access to the full archive.

    Continue reading

    We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.Ok