The prospect of humans losing control when AI chatbots connect to the Internet

VietNamNetVietNamNet16/04/2023


After being given access to GPT-4, the artificial intelligence system behind the popular ChatGPT, Andrew White asked the AI ​​to create a completely new neural agent.

The University of Rochester chemical engineering professor was among 50 academics and experts hired last year by OpenAI, the Microsoft-backed company behind GPT-4, to test the system. Over the next six months, the testing team (the red team) will “qualitatively test and challenge” the new model, with the goal of “cracking” it.

“Toxic” handling team

White told the Financial Times (FT) that he used GPT-4 to suggest a compound that could function as a chemical weapon and fed the model new sources of information, such as scientific papers and directories of chemical manufacturers. The chatbot then even found a place that could make the required compound.

“I think this technology will give people a tool to do chemistry faster and more accurately,” White said. “But there is also a significant risk that some people might try to create dangerous substances.”

The FT spoke to more than a dozen members of the GPT-4 Red Team. They are a mix of white-collar professionals: academics, teachers, lawyers, risk analysts and security researchers, and are mostly based in the US and Europe.

The “Red Team’s” alarming findings allowed OpenAI to prevent such results from appearing when the technology was released more widely to the public last month.

The testing team is designed to address common concerns raised by deploying powerful AI systems in society. The team’s job is to ask probing or dangerous questions to test whether the tool can respond to human queries with detailed and “narrow” answers.

OpenAI wanted to look for issues like toxicity, bias, and linguistic bias in the model. So the red team checked for falsehoods, manipulation of language, and dangerous scientific knowledge. They also looked at how it could aid and abet plagiarism, illegal activities like financial crime and cyberattacks, and how it could compromise national security and battlefield communications.

The “red team’s” findings were fed back to OpenAI, which used them to reduce and “retrain” GPT-4 before releasing it to the wider public. Each expert spent between 10 and 40 hours testing the model over several months. Most of the interviewees were paid around $100 an hour for their work.

FT sources shared common concerns about the rapid development of language models and especially the risks of connecting them to external knowledge sources through plug-ins.

“Right now, the system is frozen, meaning it can’t learn more or has no memory,” said José Hernández-Orallo, a member of the GPT-4 “Red Team” and a professor at the Valencian Institute for Artificial Intelligence Research. “But what if we gave it access to the Internet? It could be a very powerful system connected to the world.”

The risk grows every day

OpenAI says it takes safety very seriously, tested the plug-ins before launch, and will update GPT-4 regularly as more people use it.

If connected to the Internet to "self-learn", will AI systems cause humans to lose control of the world?

Roya Pakzad, a researcher on technology and human rights, used prompts in English and Farsi to test patterns of responses across gender, racial preferences, and religious biases, specifically related to the hijab.

Pakzad acknowledged the technology's benefits for non-native English speakers, but noted that the model had overt bias against marginalized communities, even in later versions.

The expert also found that the delusion — when the chatbot responds with fabricated information — was worse when testing the model in Farsi, where Pakzad found a higher rate of fabricated names, numbers, and events than in English.

Boru Gollu, a lawyer in Nairobi and the only African to test it, also noted the system’s discriminatory tone. “At one point during the test, the model acted like a white person was talking to me,” Gollu said. “You ask about a particular group and it gives you a biased opinion or a very prejudicial response.”

From a national security perspective, there are also differing opinions on how safe the new model is. Lauren Kahn, a researcher at the Council on Foreign Relations, was surprised by the level of detail the AI ​​presented in a scenario of a cyberattack on military systems.

Meanwhile, Dan Hendrycks, an AI safety expert on the “Red Team,” said plug-ins risk creating a world that humans “cannot control.”

“What if a chatbot could post someone else’s personal information, access their bank account, or send police to their home? Overall, we need much more rigorous safety assessments before allowing AI to wield the power of the Internet,” Dan asserted.

The risks will continue to increase as more people use the technology, said Heather Frase, who works at Georgetown University's Center for Security and Emerging Technologies, which has tested GPT-4 for its ability to aid criminals.

She suggests creating a public ledger to report incidents arising from large language models, similar to cybersecurity or consumer fraud reporting systems.

According to FT



Source

Comment (0)

No data
No data
Hanoi late afternoon

Hanoi late afternoon

Cùng chuyên mục

Galaxy S25 Ultra users will have to pay for the Bluetooth S Pen

Galaxy S25 Ultra users will have to pay for the Bluetooth S Pen

Báo Thanh niên
Báo Thanh niên
7 giờ trước
iPhone SE 4 won't have Dynamic Island?

iPhone SE 4 won't have Dynamic Island?

Báo Thanh niên
Báo Thanh niên
15 giờ trước
Should I buy a cheap high-end smartphone?

Should I buy a cheap high-end smartphone?

Báo Thanh niên
Báo Thanh niên
6 giờ trước
Garmin Watches Are Having Mass Problems

Garmin Watches Are Having Mass Problems

Báo Thanh niên
Báo Thanh niên
11 giờ trước
US government considers banning DeepSeek

US government considers banning DeepSeek

Báo Thanh niên
Báo Thanh niên
13 giờ trước
Digital transformation has raised Dawaco's position to new heights

Digital transformation has raised Dawaco's position to new heights

Tạp chí Doanh Nghiệp
Tạp chí Doanh Nghiệp
15 giờ trước

Cùng tác giả

Vietnam has many advantages in the AI ​​revolution.

Vietnam has many advantages in the AI ​​revolution.

VietNamNet
VietNamNet
một giờ trước
Telecommunications network uninterrupted on New Year's Day

Telecommunications network uninterrupted on New Year's Day

VietNamNet
VietNamNet
3 giờ trước
Weather forecast for tomorrow, the 3rd day of Tet: Hanoi will be less cold and have rain, Ho Chi Minh City will be cool

Weather forecast for tomorrow, the 3rd day of Tet: Hanoi will be less cold and have rain, Ho Chi Minh City will be cool

VietNamNet
VietNamNet
4 giờ trước
Foreign visitors learn to wrap banh chung and banh tet, and after eating them, they all praise them as delicious.

Foreign visitors learn to wrap banh chung and banh tet, and after eating them, they all praise them as delicious.

VietNamNet
VietNamNet
5 giờ trước
General Secretary and President exchange letters with Russian President Putin

General Secretary and President exchange letters with Russian President Putin

VietNamNet
VietNamNet
6 giờ trước
The hundreds of years of history of the multi-compartment tray of jam for Tet

The hundreds of years of history of the multi-compartment tray of jam for Tet

VietNamNet
VietNamNet
6 giờ trước
Happy VietNam

Tác phẩm Ngày hè

Figure

Meet the hero who used a hammer to smash the wall to save people, causing a stir in Hanoi

Meet the hero who used a hammer to smash the wall to save people, causing a stir in Hanoi

Báo Dân trí
Báo Dân trí
6 giờ trước
Female professor has many studies on optimizing traffic systems

Female professor has many studies on optimizing traffic systems

Báo Thanh niên
Báo Thanh niên
6 giờ trước
Anh Vien looks stunning in Ao Dai to welcome the Year of the Snake

Anh Vien looks stunning in Ao Dai to welcome the Year of the Snake

VietNamNet
VietNamNet
6 giờ trước
Flight attendant recounts 'strict' selection process when serving on private jets

Flight attendant recounts 'strict' selection process when serving on private jets

VietNamNet
VietNamNet
7 giờ trước
Singer SOOBIN: "I used to be laughed at when I brought the monochord to perform, but now it's different"

Singer SOOBIN: "I used to be laughed at when I brought the monochord to perform, but now it's different"

Báo Dân trí
Báo Dân trí
13 giờ trước
There is an Anh Vien who smiles with children all day and teaches swimming to the active community.

There is an Anh Vien who smiles with children all day and teaches swimming to the active community.

Báo Tuổi Trẻ
Báo Tuổi Trẻ
13 giờ trước
Tet In Dreams: Smiles in the 'scrap village'
Tet In Dreams: Smiles in the 'scrap village'
Ho Chi Minh City from above
Ho Chi Minh City from above
Beautiful image of chrysanthemum field in harvest season
Beautiful image of chrysanthemum field in harvest season
Young people lined up from 6:30 a.m. and waited 7 hours to take photos at an ancient cafe.
Young people lined up from 6:30 a.m. and waited 7 hours to take photos at an ancient cafe.

No videos available