The tool, which was previously introduced in August, uses a “thought train” similar to OpenAI’s recently released o1 model, which helps the algorithm make more reliable judgments.

The principle of this technique is to break down complex problems into smaller logical steps, thus having the potential to improve response accuracy in difficult fields such as science, programming and mathematics.

23981ce8 5113 4cea b167 ba4e87d782f8_26068ba5.jpg
Meta announces a series of new AI products. Photo: Bloomberg

Currently, AI models generally undergo reinforcement learning through human feedback, a process that is expensive, inefficient, and often requires human experts to accurately label data or verify answers to high-precision queries.

“We hope that as AI approaches superhuman levels, it will be able to self-check its own work,” said Jason Weston of Meta's research team.

Other companies like Google and Anthropic have also published research on the concept of Learning from Feedback Reinforcement AI (RLAIF). However, Meta is the only company to release these models to the public so far.

Additionally, Meta also released an update to its Segment Anything image recognition model, which speeds up the generation of data that can be used in new inorganic materials research.

Last week, the Facebook-owned giant partnered with Blumhouse Productions, a film production house known for horror franchises like “The Purge” and “Get Out,” to put its Movie Gen video-generating AI model to the test.

Movie Gen is said to be able to create realistic video and audio, competing with services from other industry-leading startups like OpenAI and ElevenLabs.

(According to Yahoo Tech)

Google builds 7 nuclear reactors to provide energy for AI race Google has just reached an agreement with startup Kairos Power to build 7 small nuclear reactors in the US to provide energy for data centers serving the AI ​​race.