Quan
Quan
  • 28
  • 120 267

Відео

[ICASSP 2022] Turn-to-Diarize: Online Speaker Diarization Constrained by Speaker Turn Detection
Переглядів 2,3 тис.2 роки тому
0:18 - Introduction 3:31 - Speaker turn detection 6:58 - Turn-to-Diarize 12:20 - Experiments 16:28 - Python Library 17:29 - Conclusions and future work Code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/2109.11641 Poster: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_poster.pdf More resources on speaker diarization: wq2012.g...
【机器之心&博文视点】入门声纹技术|第二讲:声纹分割聚类与其他应用
Переглядів 1,3 тис.3 роки тому
Udemy声纹识别在线课程:www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程:www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书:github.com/wq2012/VoiceIdentityBook 京东:item.jd.com/12970526.html 天猫:detail.tmall.com/item.htm?id=628032618898 当当:product.dangdang.com/29130997.html 机器之心:mp.weixin.qq.com/s/5e-Pqu1VUDsU7fTtiD87rw 博文视点:mp.weixin.qq.com/s...
【机器之心&博文视点】入门声纹技术|第一讲:音频基础与声纹识别
Переглядів 2,7 тис.3 роки тому
Udemy声纹识别在线课程:www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程:www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书:github.com/wq2012/VoiceIdentityBook 京东:item.jd.com/12970526.html 天猫:detail.tmall.com/item.htm?id=628032618898 当当:product.dangdang.com/29130997.html 机器之心:mp.weixin.qq.com/s/iQtHFi34uKTGfvWVOl8adw 博文视点:mp.weixin.qq.com/s...
[Interspeech 2020] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recog
Переглядів 2,3 тис.3 роки тому
0:48 - Recap of VoiceFilter 2:07 - VoiceFilter for on-device ASR 4:19 - The journey to Lite 8:20 - The long fight with over-suppression 10:48 - Experiment setup 12:37 - Results and conclusions Home: google.github.io/speaker-id/publications/VoiceFilter-Lite/ arXiv paper: arxiv.org/abs/2009.04323 Demo: ua-cam.com/video/BiWMZdnHuVs/v-deo.html Previous VoiceFilter lecture (Interspeech 2019): ua-cam...
Android demo for VoiceFilter-Lite and on-device ASR
Переглядів 2,4 тис.3 роки тому
Home page: google.github.io/speaker-id/publications/VoiceFilter-Lite/ Paper: arxiv.org/abs/2009.04323 Lecture: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html
[Speaker Odyssey 2020] Personal VAD: Speaker-Conditioned Voice Activity Detection
Переглядів 2,7 тис.3 роки тому
00:21 - Key messages 00:46 - Background 04:04 - Introducing Personal VAD 06:09 - Implementation 09:58 - Experiment Setup 11:55 - Results and Conclusions 13:54 - Future Work Home page: google.github.io/speaker-id/publications/PersonalVAD/ ISCA archive: www.isca-speech.org/archive/Odyssey_2020/abstracts/2.html arXiv paper: arxiv.org/abs/1908.04284 Slides: google.github.io/speaker-id/publications/...
[Interspeech 2019] VoiceFilter live lecture
Переглядів 1,6 тис.4 роки тому
Live recording of the presentation at Interspeech 2019. The presentation was given on Sep. 18, 2019.
[Interspeech 2019] Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection
Переглядів 1944 роки тому
This work is done by Yiteng (Arden) Huang. I'm presenting for him because he could not make his trip to Interspeech. The presentation was done on Sep. 17, 2019. Here is the link to the paper: ai.google/research/pubs/pub48420/
[Interspeech 2019] VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Переглядів 5 тис.4 роки тому
0:30 - Introduction 3:23 - VoiceFilter Models 6:40 - Data 8:07 - Experiments 11:10 - Conclusions and Future Work Home page: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Demo: ua-cam.com/video/2BF_1X7bmds/v-deo.html Lecture on our new VoiceFilter-Lite system: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html Udemy online course on speaker recognition: www.udemy.com...
Speaker Diarization with LSTM: Android Demo
Переглядів 4,9 тис.5 років тому
Home page: google.github.io/speaker-id/publications/LstmDiarization/ Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The audios were being played from a speaker, so there were some acoustic distortions. I was holding another phone to record the videos with single hand, so the videos are not very stable. ...
[ICASSP 2019] Fully Supervised Speaker Diarization: Say Goodbye to clustering
Переглядів 20 тис.5 років тому
0:17 - Introduction 2:05 - Clustering - Why it's not good enough? 8:43 - UIS-RNN 17:06 - Experimental Results 20:17 - The Python Library 26:38 - Conclusions and Future Work Code: github.com/google/uis-rnn Paper: arxiv.org/abs/1810.04719 More resources on speaker diarization: wq2012.github.io/awesome-diarization Udemy online course on speaker recognition: www.udemy.com/course/speaker-recognition...
Audio samples for Google's VoiceFilter
Переглядів 5 тис.5 років тому
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Project site: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Third-party implementation: github.com/mindslab-ai/voicefilter
Speaker Diarization with LSTM: Colaboratory Interactive Demo
Переглядів 10 тис.5 років тому
Home page: google.github.io/speaker-id/publications/LstmDiarization Spectral clustering code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The "Run diarization" part runs a bit slow because this demo is not built on top of a service, but runs a local executable for ev...
Multispeaker Text-To-Speech audio samples
Переглядів 1,2 тис.5 років тому
Title: Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Link to the page: google.github.io/tacotron/publications/speaker_adaptation/ Paper: arxiv.org/abs/1806.04558
[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM
Переглядів 25 тис.5 років тому
[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM
Car racing game on Arduino UNO and 2X16 LCD screen
Переглядів 14 тис.5 років тому
Car racing game on Arduino UNO and 2X16 LCD screen
[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification
Переглядів 10 тис.6 років тому
[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification
A preview of the "Compound Eye" multidirectional color sensor
Переглядів 489 років тому
A preview of the "Compound Eye" multidirectional color sensor
Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors
Переглядів 889 років тому
Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors
Illumination Feedback Control with PID Controller and RPi Sensors
Переглядів 2039 років тому
Illumination Feedback Control with PID Controller and RPi Sensors
Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors
Переглядів 18410 років тому
Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors
Video spotlight for COSBOS
Переглядів 8210 років тому
Video spotlight for COSBOS
COSBOS: COlor-Sensor-Based Occupancy Sensing
Переглядів 65410 років тому
COSBOS: COlor-Sensor-Based Occupancy Sensing
Label Consistent Fisher Vectors (LCFV) Demo
Переглядів 1,2 тис.10 років тому
Label Consistent Fisher Vectors (LCFV) Demo
ConnectFour Demo
Переглядів 12410 років тому
ConnectFour Demo
LF3DR: Light-Field-Based 3D Object Retrieval
Переглядів 29410 років тому
LF3DR: Light-Field-Based 3D Object Retrieval
Active Geometric Shape Model Demos
Переглядів 4,6 тис.10 років тому
Active Geometric Shape Model Demos

КОМЕНТАРІ

  • @aram69420
    @aram69420 4 місяці тому

    Thank you for this. As an undergrad student trying to get into research. I find it really hard to read and understand research paper, thanks a lot for the video break down of your research!

  • @Maddy_akil
    @Maddy_akil 6 місяців тому

    can i get this android apps github link

  • @joshuarileymagic
    @joshuarileymagic 10 місяців тому

    Is there any code for this?

  • @rockrock7655
    @rockrock7655 Рік тому

    doesnt work tried it

  • @user-qs5uv4ye2m
    @user-qs5uv4ye2m Рік тому

    is there any source code available for this?

  • @guldencetin3939
    @guldencetin3939 Рік тому

    Hello sir, can we make the game on the proteus screen, if it is, how can we do it?

  • @HongjiWang
    @HongjiWang Рік тому

    Thank you for your amazing job! I wonder if you have adopted this system in real-world applications and how it performs.

  • @saamermansoor4399
    @saamermansoor4399 Рік тому

    Have you seen anything like this done on iOS using the same principle?

  • @jamesgenius1673
    @jamesgenius1673 Рік тому

    greaaaaattt.

  • @meghashreebhattacharya7376

    100th like i did!

  • @AdiPassover
    @AdiPassover Рік тому

    Personal timestamp: 3:01

  • @generichuman_
    @generichuman_ Рік тому

    14:47 This must be a podcast with Neil Degrasse Tyson

  • @avahome5285
    @avahome5285 Рік тому

    Hi, I have a sound mix of English and Chinese. English sound is louder while Chinese sound is in the background. How can I get the Chinese sound? Should I find a network trained in a Chinese data set, right?

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Also this Udemy online course on Speaker Diarization: www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

  • @lisabecker3246
    @lisabecker3246 2 роки тому

    Great work and great presentation! Is it possible to share the slides as well to visit the websites you linked there?

    • @QuanWang
      @QuanWang 2 роки тому

      Yes. The slides can be downloaded here: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_slides.pdf

  • @vectox6480
    @vectox6480 2 роки тому

    share colab please

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

    • @ashwinirameshh9878
      @ashwinirameshh9878 Рік тому

      Hello quan. I want to join the course in Udemy and am in need of a coupon...

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

    • @angkymusa972
      @angkymusa972 Рік тому

      hello, i'm interest with your course and i'm really be so thankful if you can share some coupon for me to use it. thank you so much for making this and i will wait for the coupon ^^

    • @QuanWang
      @QuanWang Рік тому

      @@angkymusa972 please send me an email quanw@google.com

    • @Brono25
      @Brono25 Рік тому

      Thanks. The course was helpful in starting my UG thesis in voice diarization.

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

  • @QuanWang
    @QuanWang 2 роки тому

    After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!

    • @qalabeabbas6114
      @qalabeabbas6114 2 роки тому

      Hi Quan, I am planning build a a reat time prototype for the voice filter lite. Do you think the course would be helpful ? Thanks

    • @QuanWang
      @QuanWang 2 роки тому

      @@qalabeabbas6114 The course won't cover speech enhancement or separation. But if you are looking for a course for fundamental audio/speech processing, this course might be helpful.