Powered by OpenAIRE graph
Found an issue? Give us feedback

Computer Vision Center

Funder
Top 100 values are shown in the filters
Results number
arrow_drop_down
2 Projects, page 1 of 1
  • Funder: French National Research Agency (ANR) Project Code: ANR-15-CHR2-0006
    Funder Contribution: 247,889 EUR

    Communication is one of the necessary condition to develop intelligence in living beings. Humans use several modalities to exchange information: speech, written text, both in many languages, gestures, images, and many more. There is evidence that human learning is more effective when several modalities are used. There is a large body of research to make computers process these modalities, and ultimately, understand human language. These modalities have been, however, generally addressed independently or at most in pairs. However, merging information from multiple modalities is best done at the highest levels of abstraction, which deep learning models are trained to capture. The M2CR project aims at developing a revolutionary approach to combine all these modalities and their respective tasks in one unified architecture, based on deep neural networks, including both a discriminant and a generative component through multiple levels of representation. Our system will jointly learn from resources in several modalities, including but not limited to text of several languages (European languages, Chinese and Arabic), speech and images. In doing so, the system will learn one common semantic representation of the underlying information, both at a channel-specific level and at a higher channel-independent level. Pushing these ideas to the large scale, e.g. training on very large corpora, the M2CR project has the ambition to advance the state-of-the-art in human language understanding (HLU). M2CR will address all major tasks in HLU by one unified architecture: speech understanding and translation, multilingual image retrieval and description, etc. The M2CR project will collect existing multimodal and multilingual corpora, extend them as needed, and make them freely available to the community. M2CR will also define shared tasks to set up a common evaluation framework and ease research for other institutions, beyond the partners of this consortium. All developed software and tools will be open-source. By these means, we hope to help to advance the field of human language.

    more_vert
  • Funder: CHIST-ERA Project Code: M2CR

    Communication is one of the necessary condition to develop intelligence in living beings. Humans use several modalities to exchange information: speech, written text, both in many languages, gestures, images, and many more. There is evidence that human learning is more effective when several modalities are used. There is a large body of research to make computers process these modalities, and ultimately, understand human language. These modalities have been, however, generally addressed independently or at most in pairs. However, merging information from multiple modalities is best done at the highest levels of abstraction, which deep learning models are trained to capture. The M2CR project aims at developing a revolutionary approach to combine all these modalities and their respective tasks in one unified architecture, based on deep neural networks, including both a discriminant and a generative component through multiple levels of representation. Our system will jointly learn from resources in several modalities, including but not limited to text of several languages (European languages, Chinese and Arabic), speech and images. In doing so, the system will learn one common semantic representation of the underlying information, both at a channel-specific level and at a higher channel-independent level. Pushing these ideas to the large scale, e.g. training on very large corpora, the M2CR project has the ambition to advance the state-of-the-art in human language understanding (HLU). M2CR will address all major tasks in HLU by one unified architecture: speech understanding and translation, multilingual image retrieval and description, etc. The M2CR project will collect existing multimodal and multilingual corpora, extend them as needed, and make them freely available to the community. M2CR will also define shared tasks to set up a common evaluation framework and ease research for other institutions, beyond the partners of this consortium. All developed software and tools will be open-source. By these means, we hope to help to advance the field of human language.

    more_vert
1 Organizations, page 1 of 1

Do the share buttons not appear? Please make sure, any blocking addon is disabled, and then reload the page.

Content report
No reports available
Funder report
No option selected
arrow_drop_down

Do you wish to download a CSV file? Note that this process may take a while.

There was an error in csv downloading. Please try again later.