Categories: Artificial Intelligence, Machine Learning, Neural NetworksTechnology and IT market. news

Meta showed AI for the metaverse and created an alternative to traditional large language models

ahr0chm6ly8zzg5ld3mucnuvyxnzzxrzl2v4dgvybmfsl2lsbhvzdhjhdglvbnmvmjaync8xmi8xmy8xmte1nda1l2fydglmawnpywwtaw50zwxsawdlbmnllmpwzw3d3d

Meta✴ reported on the results of the latest research in the field of artificial intelligence within the framework of the FAIR (Fundamental AI Research) projects. The company’s specialists have developed an AI model that is responsible for believable movements of virtual characters; a model that operates not with tokens—language units—but with concepts; and much more.

Image Source: Google DeepMind / unsplash.com

The Meta✴ Motivo model controls the movements of virtual humanoid characters when performing complex tasks. It was trained with reinforcement on an unlabeled array with data on the movements of the human body – this system can be used as an auxiliary system in designing the movements and body positions of characters. “Meta Motivo is capable of performing a wide range of full-body control tasks, including motion tracking and target posture, without any additional training or planning,” the company said.

An important achievement was the creation of a large conceptual model (Large Concept Model or LCM) – an alternative to traditional large language models. Meta✴ researchers have noticed that today’s advanced AI systems operate at the level of tokens—language units that typically represent a fragment of a word—but do not demonstrate explicit hierarchical reasoning. In LCM, the reasoning mechanism is separated from the linguistic representation – in a similar way, a person first forms a sequence of concepts, and then puts it into verbal form. Thus, when conducting a series of presentations on one topic, the speaker already has a formed series of concepts, but the wording in the speech may change from one event to another.

When generating a response to a query, LCM predicts a sequence not of tokens, but of concepts represented in full sentences in a multimodal and multilingual space. As the context on the input increases, the LCM architecture, according to the developers, appears to be more efficient at the computational level. In practice, this work will help improve the performance of language models with any modality, that is, data format, or when outputting responses in any language.

Image source: Meta✴

The Meta✴ Dynamic Byte Latent Transformer mechanism also offers an alternative to language tokens, but not by expanding them into concepts, but, on the contrary, by forming a hierarchical model at the byte level. This, according to the developers, increases efficiency when working with long sequences when training and running models. The Meta✴ Explore Theory-of-Mind companion tool is designed to instill social intelligence skills in AI models as they are trained, to evaluate the models’ performance on these tasks, and to fine-tune already trained AI systems. Meta✴ Explore Theory-of-Mind is not limited to a given range of interactions, but generates its own scenarios.

Meta✴ Memory Layers at Scale technology aims to optimize the actual memory mechanisms of large language models. As the number of parameters in models increases, working with actual memory requires more and more resources, and the new mechanism is aimed at saving them. The Meta✴ Image Diversity Modeling project, which is being implemented with the involvement of third-party experts, aims to increase the priority of AI-generated images that more accurately correspond to real-world objects; it also helps improve developer safety and responsibility when creating images using AI.

Meta✴ CLIP 1.2 model is a new version of the system designed to establish a connection between text and visual data. It is also used to train other AI models. The Meta✴ Video Seal tool is designed to create watermarks on AI-generated videos – this marking is invisible when viewing the video with the naked eye, but can be detected to determine the origin of the video. The watermark is preserved through editing, including blurring, and encoding using various compression algorithms. Finally, Meta✴ recalled the Flow Matching paradigm, which can be used to generate images, video, sound, and even three-dimensional structures, including protein molecules – this solution helps to use information about movement between different parts of the image and acts as an alternative to the diffusion mechanism.

admin

Next Video: Google's autopilot failed to cope with a roundabout and spun the Waymo robotaxi »

Previous « Xiaomi has broken into the electric vehicle market as successfully as it did the smartphone market 13 years ago

A demo of Dispatch, a comedy game about a superhero agency from the former developers of Tales from the Borderlands and The Wolf Among Us, has been released on Steam

Developers from the American AdHoc Studio, founded by former Telltale Games, Ubisoft and Night School…

3 hours ago

Sem categoria

Digma DP-FHD800A LCD Full HD Projector Review: A Modern Approach

When you think about a home theater, you immediately imagine bulky projectors with a bunch…

3 hours ago

Sem categoria

Lian Li Introduces HydroShift II LCD-C Liquid Cooling System with 360mm Radiator and Three Configurations

Lian Li has introduced a series of maintenance-free liquid cooling systems HydroShift II LCD-C. It…

3 hours ago

Sem categoria

Apple: App Store App Developers to Earn $406 Billion in 2024

Amid mounting pressure from U.S. regulators, Apple has released the results of an independent study…

3 hours ago

Sem categoria

ASRock Admits Its Motherboards Break Ryzen 9000 Processors

Following a report from YouTube channel Tech Yes City that ASRock linked Ryzen 9000 processor…

3 hours ago

Sem categoria

Apple to Change OS Numbering: iOS 26 to Come This Year Instead of iOS 19

Apple is preparing a large-scale rebranding of its line of operating systems. This was reported…

1 day ago

Meta showed AI for the metaverse and created an alternative to traditional large language models

Recent Posts

A demo of Dispatch, a comedy game about a superhero agency from the former developers of Tales from the Borderlands and The Wolf Among Us, has been released on Steam

Digma DP-FHD800A LCD Full HD Projector Review: A Modern Approach

Lian Li Introduces HydroShift II LCD-C Liquid Cooling System with 360mm Radiator and Three Configurations

Apple: App Store App Developers to Earn $406 Billion in 2024

ASRock Admits Its Motherboards Break Ryzen 9000 Processors

Apple to Change OS Numbering: iOS 26 to Come This Year Instead of iOS 19