ChatGPT And Other Language AIs Are Nothing Without Humans — A Sociologist Explains How Countless Hidden People Make The Magic

The media frenzy surrounding ChatGPT and other large language model artificial intelligence systems spans a range of themes, from the prosaic – large language models could replace conventional web search – to the concerning – AI will eliminate many jobs – and the overwrought – AI poses an extinction-level threat to humanity.

All of these themes have a common denominator: large language models herald artificial intelligence that will supersede humanity.

But large language models, for all their complexity, are actually really dumb. And despite the name “artificial intelligence,” they’re completely dependent on human knowledge and labor. They can’t reliably generate new knowledge, of course, but there’s more to it than that.

ChatGPT can’t learn, improve or even stay up to date without humans giving it new content and telling it how to interpret that content, not to mention programming the model and building, maintaining and powering its hardware. To understand why, you first have to understand how ChatGPT and similar models work, and the role humans play in making them work.

How ChatGPT works

Large language models like ChatGPT work, broadly, by predicting what characters, words and sentences should follow one another in sequence based on training data sets. In the case of ChatGPT, the training data set contains immense quantities of public text scraped from the internet.

Imagine I trained a language model on the following set of sentences: Bears are large, furry animals. Bears have claws. Bears are secretly robots. Bears have noses. Bears are secretly robots. Bears sometimes eat fish. Bears are secretly robots.

The model would be more inclined to tell me that bears are secretly robots than anything else, because that sequence of words appears most frequently in its training data set. This is obviously a problem for models trained on fallible and inconsistent data sets – which is all of them, even academic literature.

People write lots of different things about quantum physics, Joe Biden, healthy eating or the Jan. 6 insurrection, some more valid than others. How is the model supposed to know what to say about something, when people say lots of different things? The need for feedback This is where feedback comes in. If you use ChatGPT, you’ll notice that you have the option to rate responses as good or bad. If you rate them as bad, you’ll be asked to provide an example of what a good answer would contain. ChatGPT and other large language models learn what answers, what predicted sequences of text, are good and bad through feedback from users, the development team and contractors hired to label the output.

ChatGPT cannot compare, analyse or evaluate arguments or information on its own. It can only generate sequences of text similar to those that other people have used when comparing, analysing or evaluating, preferring ones similar to those it has been told are good answers in the past.

Thus, when the model gives you a good answer, it’s drawing on a large amount of human labour that’s already gone into telling it what is and isn’t a good answer. There are many, many human workers hidden behind the screen, and they will always be needed if the model is to continue improving or to expand its content coverage.

A recent investigation published by journalists in Time magazine revealed that hundreds of Kenyan workers spent thousands of hours reading and labeling racist, sexist and disturbing writing, including graphic descriptions of sexual violence, from the darkest depths of the internet to teach ChatGPT not to copy such content.

They were paid no more than USD2 an hour, and many understandably reported experiencing psychological distress due to this work.

What ChatGPT can’t do

The importance of feedback can be seen directly in ChatGPT’s tendency to “hallucinate”; that is, confidently provide inaccurate answers. ChatGPT can’t give good answers on a topic without training, even if good information about that topic is widely available on the internet.

You can try this out yourself by asking ChatGPT about more and less obscure things. I’ve found it particularly effective to ask ChatGPT to summarise the plots of different fictional works because, it seems, the model has been more rigorously trained on nonfiction than fiction.

In my own testing, ChatGPT summarised the plot of JRR. Tolkien’s The Lord of the Rings, a very famous novel, with only a few mistakes. But its summaries of Gilbert and Sullivan’s The Pirates of Penzance and of Ursula K. Le Guin’s The Left Hand of Darkness – both slightly more niche but far from obscure – come close to playing Mad Libs with the character and place names. It doesn’t matter how good these works’ respective Wikipedia pages are. The model needs feedback, not just content.

Because large language models don’t actually understand or evaluate information, they depend on humans to do it for them. They are parasitic on human knowledge and labor. When new sources are added into their training data sets, they need new training on whether and how to build sentences based on those sources.

They can’t evaluate whether news reports are accurate or not. They can’t assess arguments or weigh trade-offs. They can’t even read an encyclopedia page and only make statements consistent with it, or accurately summarize the plot of a movie. They rely on human beings to do all these things for them.

Then they paraphrase and remix what humans have said, and rely on yet more human beings to tell them whether they’ve paraphrased and remixed well. If the common wisdom on some topic changes – for example, whether salt is bad for your heart or whether early breast cancer screenings are useful – they will need to be extensively retrained to incorporate the new consensus.

Many people behind the curtain In short, far from being the harbingers of totally independent AI, large language models illustrate the total dependence of many AI systems, not only on their designers and maintainers but on their users. So if ChatGPT gives you a good or useful answer about something, remember to thank the thousands or millions of hidden people who wrote the words it crunched and who taught it what were good and bad answers.

Far from being an autonomous superintelligence, ChatGPT is, like all technologies, nothing without us.

Affiliate links may be automatically generated – see our ethics statement for details.

Check out our Latest News and Follow us at Facebook

Original Source

US CFTC Orders Fallen Crypto Exchange FTX to Pay $12.7 Billion to Customers

ByNews Polite August 9, 2024

A US court has ordered bankrupt cryptocurrency exchange FTX to pay $12.7 billion (roughly Rs. 1,06,572 crore) in relief to its customers, the Commodity Futures Trading Commission (CFTC) said on Thursday. FTX drew customers in with “an illusion that it was a safe and secure place to access crypto markets,” then…

Tech

Asteroid 2024 YR4 Has a Small but Notable Chance of Earth Impact in 2032

ByNews Polite January 31, 2025

#news #newstoday #tech #technews #latestnews #techupdates #newsupdates A newly detected asteroid, named 2024 YR4, has been observed by NASA scientists, carrying a 1.2 percent probability of colliding with Earth on December 22, 2032. The asteroid, estimated to be around 55 metres in diameter, has been tracked since its discovery on…

Tech

Xbox Developer Direct 2024: Indiana Jones and the Great Circle Unveiled, Senua’s Saga Gets Release Date

ByNews Polite January 19, 2024

Xbox kicked off its 2024 with a jam-packed Developer Direct showcase, bringing reveals and updates on four upcoming releases from four studios, plus a bonus update from Square Enix. The 45-minute presentation, streamed live on YouTube and Xbox’s other social channels late Thursday, took the covers off MachineGames’ upcoming Indiana…

Tech

Xiaomi 15 Ultra Tipped to Debut With Ceramic, Glass and Faux Leather Rear Panel Options

ByNews Polite August 20, 2024

Xiaomi 15 Ultra is expected to be launched by the company in the coming months and the handset could make its debut in early 2025, based on the company’s release schedule for its previous flagship phones. While there’s no word from Xiaomi about the arrival of a successor to the 14…

Tech

Union Budget 2023-24 Expectations: Crypto Sector Looks Forward to Tax Deduction

ByNews Polite February 1, 2023

Union Budget 2023-24 will be tabled by Finance Minister Nirmala Sitharaman in the Indian Parliament today at 11 am. Just like various other sectors, the crypto market is also expecting some favourable announcements for its growth in India in the coming financial year. From tax regimes that can help the…

Tech

BSNL 4G Services to be Rolled Out in the Next 18-24 Months, 5G Services Under Testing: Report

ByNews Polite July 22, 2022

Bharat Sanchar Nigam Limited (BSNL) is reportedly planning to roll out 4G services in India soon. The 4G services are said to roll out to Indian customers in the next 18 to 24 months. The latest report claims that Tata Consultancy Services (TCS), a subsidiary of the Tata Group, has…

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

ChatGPT and Other Language AIs Are Nothing Without Humans — a Sociologist Explains How Countless Hidden People Make the Magic

US CFTC Orders Fallen Crypto Exchange FTX to Pay $12.7 Billion to Customers

Asteroid 2024 YR4 Has a Small but Notable Chance of Earth Impact in 2032

Xbox Developer Direct 2024: Indiana Jones and the Great Circle Unveiled, Senua’s Saga Gets Release Date

Xiaomi 15 Ultra Tipped to Debut With Ceramic, Glass and Faux Leather Rear Panel Options

Union Budget 2023-24 Expectations: Crypto Sector Looks Forward to Tax Deduction

BSNL 4G Services to be Rolled Out in the Next 18-24 Months, 5G Services Under Testing: Report

Leave a Reply Cancel reply

Reincarnated by A.I., Arizona Man Forgives His Killer at Sentencing

Trump Revives Push for Higher Taxes on the Rich

Student’s Text Exchange With Him Revealed

Pregnant Cassie Steps Out Amid Sean “Diddy” Combs Trial

Opinion | The New Pope Might Be Something Like the Old Pope

Similar Posts

Leave a Reply Cancel reply