<samp id="9ylhn"><rp id="9ylhn"></rp></samp>

<table id="9ylhn"><span id="9ylhn"></span></table>
<big id="9ylhn"><ruby id="9ylhn"></ruby></big>

<big id="9ylhn"><strike id="9ylhn"><ol id="9ylhn"></ol></strike></big>

    1. <table id="9ylhn"></table>
      <p id="9ylhn"></p>

      <p id="9ylhn"></p>

        <table id="9ylhn"></table>

        The Sound of Pixels

        Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick, Josh McDermott, Antonio Torralba

        Computer Science and Artificial Intelligence Laboratory, and Department of Brain and Cognitive Sciences
        Massachusetts Institute of Technology

        We introduce PixelPlayer, a system that, by watching large amounts of unlabeled videos, learns to locate image regions which produce sounds and separate the input sounds into a set of components that represents the sound from each pixel. Our approach capitalizes on the natural synchronization of the visual and audio modalities to learn models that jointly parse sounds and images, without requiring additional manual supervision.

        The system is trained with a large number of videos containing people playing instruments in different combinations, including solos and duets. No supervision is provided on what instruments are present on each video, where they are located, or how they sound. During test time, the input to the system is a video showing people playing different instruments, and the mono auditory input. Our system performs audio-visual source separation and localization, splitting the input sound signal into N sound channels, each one corresponding to a different instrument category. In addition, the system can localize the sounds and assign a different audio wave to each pixel in the input video.

        New! Follow-up Projects

        Check out our recent follow-up projects:

        Interactive Demo

        In this interactive demo, you can click on different video locations on the right, to hear the sound component associated with the selected location. The input video is shown on the left. (The demo is not well supported on the mobile end yet.)

        Input video to PixelPlayer:

        Click on a pixel to hear its sound:

        Input video to PixelPlayer:

        Click on a pixel to hear its sound:

        Input video to PixelPlayer:

        Click on a pixel to hear its sound:

        Video clips credit to original Youtube videos: [1] [2] [3]


              author = {Zhao, Hang and Gan, Chuang and Rouditchenko, Andrew and Vondrick, Carl and McDermott, Josh and Torralba, Antonio},
              title = {The Sound of Pixels},
              booktitle = {The European Conference on Computer Vision (ECCV)},
              month = {September},
              year = {2018}
        网上棋牌娱乐 http://www.cnziben.com/tags/969/42.html http://www.cnziben.com/tags/23/44.html http://www.cnziben.com/tags/7298/839.html www.zgygsy.com/html/823/9805631.html www.zgygsy.com/html/943089/43812.html www.zgygsy.com/html/87/23147130.html 斗牛棋牌可以提现的 斗地主真人版 捕鱼游戏下载
        3人跑得快15张 金蟾捕鱼攻略 能兑现的棋牌 迎丰棋牌 神州炸金花安卓版下载 850棋牌金蟾捕鱼 手机欢乐斗地主 寿光棋牌 欢乐炸金花官网安卓版 在线捕鱼棋牌游戏 信誉上下分捕鱼 十点半棋牌 网上炸金花有什么规律 850棋牌捕鱼技巧 新版真人炸金花 赢三张规则 炸金花赢现金哪个平台 星力摇钱树捕鱼技巧 斗牛游戏大全软件 炸金花手游版