Multimodal Machine Learning Applications

External reference: https://openalex.org/T11714

  1. Learnable communication graphs improve multi-agent coordination
    Study proposes learnable communication graphs for multi-agent systems, enabling dynamic information sharing that adapts to task demands and reduces computational resource consumption.
  2. SCOPE: Real-Time Natural Language Camera Agent at the Edge
    SCOPE integrates natural language processing with camera control for edge deployment, executing perception and planning locally without cloud dependencies.