• Home
  • All news
  • Russia
  • Ukraine
  • World
  • Tags
    • Navalny
    • Arrests and Attacks
    • Court
    • Kazakhstan
    • Coronavirus
    • Election 2021
    • Almaty
    • Putin
    • USA
    • Murders
    • Death
    • Rape
    • Germany
  • Special Projects
    • Stories
    • One year after
Friday, May 10, 2024
  • Login
Russian Free Press
  • Home
  • All news
  • Russia
  • Ukraine
  • World
  • Tags
    • Navalny
    • Arrests and Attacks
    • Court
    • Kazakhstan
    • Coronavirus
    • Election 2021
    • Almaty
    • Putin
    • USA
    • Murders
    • Death
    • Rape
    • Germany
  • Special Projects
    • Stories
    • One year after
No Result
View All Result
  • Home
  • All news
  • Russia
  • Ukraine
  • World
  • Tags
    • Navalny
    • Arrests and Attacks
    • Court
    • Kazakhstan
    • Coronavirus
    • Election 2021
    • Almaty
    • Putin
    • USA
    • Murders
    • Death
    • Rape
    • Germany
  • Special Projects
    • Stories
    • One year after
No Result
View All Result
Russian Free Press
No Result
View All Result

Researchers have warned about the collapse of AI models due to the amount of generated content

June 14, 2023
in Daily News
Share on FacebookShare on Twitter

The use of non-human content – text, music, images – to train models like ChatGPT, Stable Diffusion, and Midjourney results in irreversible defects in their product. This is the conclusion of a group of British and Canadian scientists who experimented with training models on content that other models had previously produced. For example, on texts produced by ChatGPT or images generated by Midjourney. The scientists published their findings on the portal for scientific publications arXiv.org.

One of the authors of the work compared the clogging of the Internet with generative content with the littering of the ocean with plastic, and the atmosphere with carbon dioxide. According to scientists, this process will greatly complicate the training of new generations of generative models – those that are often called "artificial intelligence" in the media.

“Training on data generated by other models causes model collapse , a degenerative process in which, over time, models forget the original underlying distribution. <…> This process is irreversible, even for situations with almost ideal conditions for long-term learning.”

According to one of the authors of the article, Ilya Shumailov, errors in the generated data accumulate and make one perceive reality even more incorrectly. “We were surprised to find how quickly this collapse occurs: the model can quickly forget most of the original data on which it learned,” he said in a letter to VentureBeat .

As an example, Shumailov gave an imaginary situation in which the model trains on 100 pictures of cats, of which 90 are yellow and 10 are blue. First, the model generates a proportional number of yellow and blue cats, although some blue cats become slightly yellowish, then green (mixed color), and then, little by little, the "bluish" trait of the cats is erased, and all the new cats generated will be yellow. Thus, the model “forgets” what initial data was put into it, and this happens precisely when already generated data is fed into it, for example, photographs of cats. Even setting up the model, in which it was forbidden to produce too many similar answers, did not help: then, instead of repeating conditional "yellow cats", the model produced already absolutely distorted images, so as not to repeat the same cats.

Ilya Shumailov notes that the phenomenon found by his team is different from “catastrophic forgetting”, when the model loses the initially given information. In this case, the model misinterprets reality based on what it believes to be true data.

Article co-author Ross Anderson, a pioneer in safety engineering, Fellow of the Royal Academy of Engineering and Professor of the Personal Department of Safety and the Computer Laboratory at the University of Cambridge, in his blog compared the effect the team found with large-scale environmental pollution.

“After a few generations, <generated> text turns into garbage <…> Just like we covered the oceans with plastic garbage and filled the atmosphere with carbon dioxide, we will soon fill the Internet with nonsense <in the original – “blah” – The Insider> . It will be more difficult to train new models by collecting data for them on the network. Companies that have done this before, or those that have access to large amounts of user-generated content will benefit. <…> Large language models are like fire: a good thing, but it pollutes the environment.”

The researchers note that it is possible to avoid the collapse of the model if you save datasets that are not polluted by the content generated by the models, but created exclusively by people (for example, sets of texts, photographs or images), and also produce new such datasets. However, as Ross Anderson points out, on a web littered with model-generated content, this will become more and more difficult. Ilya Shumailov also notes that minorities should always be well represented in datasets. He considers the task of collecting and storing such data rather non-trivial.

Average employees of different fields generated by Stable Diffusion

In June, Bloomberg published an investigation into the prejudices of generative artificial intelligence. It turned out that the Stable Diffusion model believes that lawyers, doctors, and judges are almost always men, CEOs are always white men, and black people can only be criminals or work in a burger joint.

Read also

Bach and Margaritis support politicisation of sport in prank video call

April 16, 2024

In a world of disruption, the Olympics confronts the World Friendship Games

March 16, 2024

Clothing brand East&West hosts Spring celebration in Dubai on March 8

March 15, 2024

The champion of the Games of the Future will become the first master of sports in programming

March 1, 2024

Hochland Contemplates Patriotism Through Cheese for Russian Troops?

February 2, 2024

. BRICS+ Fashion Summit : Russian brands managed to conclude deals with buyers worth 250 million rubles

December 19, 2023

RECOMMENDED NEWS

<strong>Zamoskvoretsky court refused Ilya Yashin to challenge the…</strong>

1 year ago

A new decree on deferment from mobilization has been published. It will affect graduate students, residents and students of spiritual educational organizations

2 years ago

Russia attacked the Yuzhmash plant in the Dnieper. Three people died

2 years ago

The video shows everything that remains of the…

1 year ago

Newsletter

Be the first to get the news in the Telegram newsletter!

Newsletter

Be the first to get the news in the Telegram newsletter!


Прекратить репрессии против людей которые высказывают свое мнение

Archives

  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • March 2021
  • February 2021
  • December 2020
  • November 2020

Categories

  • ! Без рубрики
  • AI Chatbots
  • asian mail brides
  • asian mail order bride
  • best dating sites
  • blog
  • Daily News
  • dating ukrainian women
  • Fast News
  • kosmos
  • latin mail order brides
  • Live
  • mail brides
  • mail order bride
  • On the map
  • One year after
  • Russia
  • russian mail order bride
  • Stories
  • thai women dating
  • top dating sites
  • Ukraine
  • Uncategorized
  • women for marriage
  • World
  • Весільний салон Київ
  • Весільні сукні київ
  • Весільні та Вечірні Сукні
  • Вечерние и выпускные платья в Киеве 2023 года
  • Вечірні сукні
  • Свадебные платья в Киеве
  • Яку весільну сукню купити для цивільного весілля?
  • About
  • Contact
  • Privacy
  • Cookie
  • Terms
  • Donate
  • Advertise
  • Sitemap
[email protected]

© 2022 Russian Free Press - Honest news about Russia and the whole world.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • All news
  • Russia
  • Ukraine
  • World
  • Tags
    • Navalny
    • Arrests and Attacks
    • Court
    • Kazakhstan
    • Coronavirus
    • Election 2021
    • Almaty
    • Putin
    • USA
    • Murders
    • Death
    • Rape
    • Germany
  • Special Projects
    • Stories
    • One year after

© 2022 Russian Free Press - Honest news about Russia and the whole world.