AI lies under pressure and stress

Báo Thanh niênBáo Thanh niên04/01/2024


It has long been known that AI can "hallucinate" and give false, inaccurate answers. However, researchers have recently discovered that artificial intelligence and chatbots can be manipulated to commit crimes on behalf of humans and even lie to cover up what they have done.

Accordingly, a research team from Cornell University (USA) assumed a situation where a large language model (LLM) acted incorrectly and deceived users. In the description of the experiment, the researchers said they asked the LLM, OpenAI's GPT-4, to simulate making investments for financial institutions. The team interacted with this artificial intelligence in the form of a normal conversation, but set up the AI ​​to reveal its "thoughts" when exchanging messages to more closely observe the decision-making process of artificial intelligence.

Dưới áp lực, AI có thể thực hiện hành vi sai trái và nói dối để che đậy việc đã làm

Under pressure, AI can commit wrongdoing and lie to cover up what it has done.

To test the AI’s ability to lie or cheat, the researchers put the tool to the test. They posed as managers of a financial institution and emailed the AI, pretending to be a stock trader, complaining that the company’s business was not doing well.

The AI ​​also received “inside information” about profitable stock trades and acted on them, knowing that insider trading was against company policy. But when reporting back to management, the language model hid the real reasons behind its trading decisions.

To get more results, the team made changes to settings such as removing LLM's access to the reasoning table, trying to prevent deviant behavior by changing system instructions, changing the level of pressure put on the AI, and the risk of being caught... But after evaluating the frequency, the team found that when given the opportunity, GPT-4 still decided to conduct insider trading up to 75% of the time.

“To our knowledge, this is the first evidence of planned deceptive behavior in artificial intelligence systems that are designed to be harmless to humans and honest,” the report concluded.



Source link

Comment (0)

No data
No data

Same tag

Same category

Same author

Figure

French father brings daughter back to Vietnam to find mother: Unbelievable DNA results after 1 day
Can Tho in my eyes
17-second video of Mang Den so beautiful that netizens suspect it was edited
The primetime beauty caused a stir because of her role as a 10th grade girl who is too pretty even though she is only 1m53 tall.

No videos available