A new brain decoding method called mind captioning can generate accurate text descriptions of what a person is seeing or recalling—without relying on the brain's language system.
The new ImageBind model combines text, audio, visual, movement, thermal, and depth data. It’s only a research project but shows how future AI models could be able to generate multisensory content. The ...