Tech

OpenAI seeks to improve AI with broader training data

DIGITAL BUSINESS MAGAZINE1111/2023202320232023

241 2 minutes read

Artificial intelligence research company OpenAI announced a new initiative this week aimed at diversifying and expanding the data used to train AI models called Data Partnerships. Through the program, OpenAI plans to collaborate with third-party organizations to build new public and private datasets for AI training.

Aiming to be more fair and accurate, OpenAI wants to present better data

According to OpenAI, the goal is to create more fair, accurate, and beneficial models by exposing them to a broader range of data that better reflects diverse languages, cultures, and subject matters. Current AI datasets tend to suffer from issues like Western-centrism, lack of diversity, and inclusion of toxic or biased content.

“To ultimately make [AI] that is safe and beneficial to all of humanity, we’d like AI models to deeply understand all subject matters, industries, cultures, and languages, which requires as broad a training data set as possible,” OpenAI said in a blog post announcing the program.

Models and understanding across platforms can happen with training

By working with partners to collect large-scale datasets across modalities like text, images, audio, and video, OpenAI hopes to improve model understanding beyond what can easily be scraped from the internet today. The company says it will work to remove any sensitive or personal information and will offer options for keeping datasets private.

OpenAI has already partnered with organizations like the Icelandic government, Free Law Project, and Miðeind ehf on early versions of the program. However, some experts express skepticism about whether the effort will successfully minimize the deep-rooted biases that have impacted AI models thus far.

“Overall, we are seeking partners who want to help us teach AI to understand our world in order to be maximally helpful to everyone,” OpenAI said.

Diversification of AI training data for the GPT-4 to improve

While diversifying AI training data is essential, the program also clearly stands to benefit OpenAI models like GPT-4 commercially. This perceived dual motivation, along with OpenAI’s lack of compensation for data partners, has drawn some criticism in light of accusations around the company’s use of data without permission.

Greater transparency around OpenAI’s dataset collection, bias mitigation efforts, and commercial interests will be key to gauging the impact of Data Partnerships on the AI landscape overall. But the program signifies an awareness that improving future AI requires starting with better, more representative data.

Featured Image Credit: Photo by Andrew Neel; Pexels; Thank you!

Radek Zielinski

Radek Zielinski is an experienced technology and financial journalist with a passion for cybersecurity and futurology.

Source link

DIGITAL BUSINESS MAGAZINE1111/2023202320232023

241 2 minutes read

OpenAI seeks to improve AI with broader training data

Aiming to be more fair and accurate, OpenAI wants to present better data

Models and understanding across platforms can happen with training

Diversification of AI training data for the GPT-4 to improve

Radek Zielinski

DIGITAL BUSINESS MAGAZINE

New York Post’s slams activists removing posters of captives

Unraveling the Complex Threads of Israel-Palestine: Navigating Racism and Antisemitism

50 Jobs That AI Will Replace In The Next 5 Years

Discover the Thrill of the EGL 300cc Racing Series Dirt Bike

Capital spending soars as Amazon, Microsoft, Google bet big on AI

Apple races to catch up in AI arms race

Punctual’s Luxury Bus Charter Service

Punctual’s Luxury Bus Charter Service

Retirement Parties: Celebrate a Lifetime of Achievements with Unforgettable Transportation

Super Stretch Limousine Rental Service: Luxury and Elegance Redefined

The Secret to Dominating Search Results

Ellis Island Casino Sucked Into the F1 Vortex

Aiming to be more fair and accurate, OpenAI wants to present better data

Models and understanding across platforms can happen with training

Diversification of AI training data for the GPT-4 to improve

Radek Zielinski

DIGITAL BUSINESS MAGAZINE

CDC airport COVID screening to include flu, RSV detection

Tarek El Moussa Says Kids Gave Him ‘Strength’ in ‘Darkest Place’ – Hollywood Life

Related Articles

Sacramento beauty queen admits $10M investment fraud funding gambling and trips

Federal crackdown targets 18th Street gambling and drug network at MacArthur Park

Ohio gambling expansion raises consumer protection concerns after troubling safety scorecard

From Trading Places to war markets: how insider trading slipped into prediction platforms

New York Post’s slams activists removing posters of captives

Unraveling the Complex Threads of Israel-Palestine: Navigating Racism and Antisemitism

50 Jobs That AI Will Replace In The Next 5 Years

Discover the Thrill of the EGL 300cc Racing Series Dirt Bike

Capital spending soars as Amazon, Microsoft, Google bet big on AI

Apple races to catch up in AI arms race

Punctual’s Luxury Bus Charter Service

Punctual’s Luxury Bus Charter Service

Retirement Parties: Celebrate a Lifetime of Achievements with Unforgettable Transportation

Super Stretch Limousine Rental Service: Luxury and Elegance Redefined

The Secret to Dominating Search Results

Ellis Island Casino Sucked Into the F1 Vortex