28
120 267

[ICASSP 2022] Turn-to-Diarize: Online Speaker Diarization Constrained by Speaker Turn Detection

18:56

【机器之心&博文视点】入门声纹技术｜第二讲：声纹分割聚类与其他应用

59:33

【机器之心&博文视点】入门声纹技术｜第一讲：音频基础与声纹识别

59:17

[Interspeech 2020] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recog

15:39

Android demo for VoiceFilter-Lite and on-device ASR

1:39

[Speaker Odyssey 2020] Personal VAD: Speaker-Conditioned Voice Activity Detection

14:53

Speaker Recognition online course on Udemy

Enroll in the course today: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A
Contact me if you need a coupon.
Chinese textbook on Voice Identity Techniques: github.com/wq2012/VoiceIdentityBook

Відео

[ICASSP 2022] Turn-to-Diarize: Online Speaker Diarization Constrained by Speaker Turn Detection

18:56

[ICASSP 2022] Turn-to-Diarize: Online Speaker Diarization Constrained by Speaker Turn Detection

Переглядів 2,3 тис.2 роки тому

0:18 - Introduction 3:31 - Speaker turn detection 6:58 - Turn-to-Diarize 12:20 - Experiments 16:28 - Python Library 17:29 - Conclusions and future work Code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/2109.11641 Poster: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_poster.pdf More resources on speaker diarization: wq2012.g...

59:33

【机器之心&博文视点】入门声纹技术｜第二讲：声纹分割聚类与其他应用

Переглядів 1,3 тис.3 роки тому

Udemy声纹识别在线课程：www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程：www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书：github.com/wq2012/VoiceIdentityBook 京东：item.jd.com/12970526.html 天猫：detail.tmall.com/item.htm?id=628032618898 当当：product.dangdang.com/29130997.html 机器之心：mp.weixin.qq.com/s/5e-Pqu1VUDsU7fTtiD87rw 博文视点：mp.weixin.qq.com/s...

59:17

【机器之心&博文视点】入门声纹技术｜第一讲：音频基础与声纹识别

Переглядів 2,7 тис.3 роки тому

Udemy声纹识别在线课程：www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程：www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书：github.com/wq2012/VoiceIdentityBook 京东：item.jd.com/12970526.html 天猫：detail.tmall.com/item.htm?id=628032618898 当当：product.dangdang.com/29130997.html 机器之心：mp.weixin.qq.com/s/iQtHFi34uKTGfvWVOl8adw 博文视点：mp.weixin.qq.com/s...

[Interspeech 2020] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recog

15:39

[Interspeech 2020] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recog

Переглядів 2,3 тис.3 роки тому

0:48 - Recap of VoiceFilter 2:07 - VoiceFilter for on-device ASR 4:19 - The journey to Lite 8:20 - The long fight with over-suppression 10:48 - Experiment setup 12:37 - Results and conclusions Home: google.github.io/speaker-id/publications/VoiceFilter-Lite/ arXiv paper: arxiv.org/abs/2009.04323 Demo: ua-cam.com/video/BiWMZdnHuVs/v-deo.html Previous VoiceFilter lecture (Interspeech 2019): ua-cam...

Android demo for VoiceFilter-Lite and on-device ASR

1:39

Android demo for VoiceFilter-Lite and on-device ASR

Переглядів 2,4 тис.3 роки тому

Home page: google.github.io/speaker-id/publications/VoiceFilter-Lite/ Paper: arxiv.org/abs/2009.04323 Lecture: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html

[Speaker Odyssey 2020] Personal VAD: Speaker-Conditioned Voice Activity Detection

14:53

[Speaker Odyssey 2020] Personal VAD: Speaker-Conditioned Voice Activity Detection

Переглядів 2,7 тис.3 роки тому

00:21 - Key messages 00:46 - Background 04:04 - Introducing Personal VAD 06:09 - Implementation 09:58 - Experiment Setup 11:55 - Results and Conclusions 13:54 - Future Work Home page: google.github.io/speaker-id/publications/PersonalVAD/ ISCA archive: www.isca-speech.org/archive/Odyssey_2020/abstracts/2.html arXiv paper: arxiv.org/abs/1908.04284 Slides: google.github.io/speaker-id/publications/...

[Interspeech 2019] VoiceFilter live lecture

20:04

[Interspeech 2019] VoiceFilter live lecture

Переглядів 1,6 тис.4 роки тому

Live recording of the presentation at Interspeech 2019. The presentation was given on Sep. 18, 2019.

[Interspeech 2019] Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection

19:26

[Interspeech 2019] Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection

Переглядів 1944 роки тому

This work is done by Yiteng (Arden) Huang. I'm presenting for him because he could not make his trip to Interspeech. The presentation was done on Sep. 17, 2019. Here is the link to the paper: ai.google/research/pubs/pub48420/

[Interspeech 2019] VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

12:25

[Interspeech 2019] VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

Переглядів 5 тис.4 роки тому

0:30 - Introduction 3:23 - VoiceFilter Models 6:40 - Data 8:07 - Experiments 11:10 - Conclusions and Future Work Home page: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Demo: ua-cam.com/video/2BF_1X7bmds/v-deo.html Lecture on our new VoiceFilter-Lite system: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html Udemy online course on speaker recognition: www.udemy.com...

Speaker Diarization with LSTM: Android Demo

5:40

Speaker Diarization with LSTM: Android Demo

Переглядів 4,9 тис.5 років тому

Home page: google.github.io/speaker-id/publications/LstmDiarization/ Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The audios were being played from a speaker, so there were some acoustic distortions. I was holding another phone to record the videos with single hand, so the videos are not very stable. ...

[ICASSP 2019] Fully Supervised Speaker Diarization: Say Goodbye to clustering

28:10

[ICASSP 2019] Fully Supervised Speaker Diarization: Say Goodbye to clustering

Переглядів 20 тис.5 років тому

0:17 - Introduction 2:05 - Clustering - Why it's not good enough? 8:43 - UIS-RNN 17:06 - Experimental Results 20:17 - The Python Library 26:38 - Conclusions and Future Work Code: github.com/google/uis-rnn Paper: arxiv.org/abs/1810.04719 More resources on speaker diarization: wq2012.github.io/awesome-diarization Udemy online course on speaker recognition: www.udemy.com/course/speaker-recognition...

1:10

Audio samples for Google's VoiceFilter

Переглядів 5 тис.5 років тому

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Project site: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Third-party implementation: github.com/mindslab-ai/voicefilter

Speaker Diarization with LSTM: Colaboratory Interactive Demo

4:20

Speaker Diarization with LSTM: Colaboratory Interactive Demo

Переглядів 10 тис.5 років тому

Home page: google.github.io/speaker-id/publications/LstmDiarization Spectral clustering code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The "Run diarization" part runs a bit slow because this demo is not built on top of a service, but runs a local executable for ev...

Multispeaker Text-To-Speech audio samples

2:05

Multispeaker Text-To-Speech audio samples

Переглядів 1,2 тис.5 років тому

Title: Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Link to the page: google.github.io/tacotron/publications/speaker_adaptation/ Paper: arxiv.org/abs/1806.04558

[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM

29:05

[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM

Переглядів 25 тис.5 років тому

[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM

Car racing game on Arduino UNO and 2X16 LCD screen

1:48

Car racing game on Arduino UNO and 2X16 LCD screen

Переглядів 14 тис.5 років тому

Car racing game on Arduino UNO and 2X16 LCD screen

[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification

17:34

[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification

Переглядів 10 тис.6 років тому

[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification

A preview of the "Compound Eye" multidirectional color sensor

0:18

A preview of the "Compound Eye" multidirectional color sensor

Переглядів 489 років тому

A preview of the "Compound Eye" multidirectional color sensor

Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors

1:29

Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors

Переглядів 889 років тому

Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors

Illumination Feedback Control with PID Controller and RPi Sensors

2:24

Illumination Feedback Control with PID Controller and RPi Sensors

Переглядів 2039 років тому

Illumination Feedback Control with PID Controller and RPi Sensors

Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors

4:57

Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors

Переглядів 18410 років тому

Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors

0:38

Video spotlight for COSBOS

Переглядів 8210 років тому

Video spotlight for COSBOS

COSBOS: COlor-Sensor-Based Occupancy Sensing

13:39

COSBOS: COlor-Sensor-Based Occupancy Sensing

Переглядів 65410 років тому

COSBOS: COlor-Sensor-Based Occupancy Sensing

Label Consistent Fisher Vectors (LCFV) Demo

2:44

Label Consistent Fisher Vectors (LCFV) Demo

Переглядів 1,2 тис.10 років тому

Label Consistent Fisher Vectors (LCFV) Demo

3:31

ConnectFour Demo

Переглядів 12410 років тому

ConnectFour Demo

LF3DR: Light-Field-Based 3D Object Retrieval

4:33

LF3DR: Light-Field-Based 3D Object Retrieval

Переглядів 29410 років тому

LF3DR: Light-Field-Based 3D Object Retrieval

4:07

Active Geometric Shape Model Demos

Переглядів 4,6 тис.10 років тому

Active Geometric Shape Model Demos

КОМЕНТАРІ

@aram69420 4 місяці тому
Thank you for this. As an undergrad student trying to get into research. I find it really hard to read and understand research paper, thanks a lot for the video break down of your research!
@Maddy_akil 6 місяців тому
can i get this android apps github link
@joshuarileymagic 10 місяців тому
Is there any code for this?
@rockrock7655 Рік тому
doesnt work tried it
@user-qs5uv4ye2m Рік тому
is there any source code available for this?
@guldencetin3939 Рік тому
Hello sir, can we make the game on the proteus screen, if it is, how can we do it?
@HongjiWang Рік тому
Thank you for your amazing job! I wonder if you have adopted this system in real-world applications and how it performs.
@saamermansoor4399 Рік тому
Have you seen anything like this done on iOS using the same principle?
@jamesgenius1673 Рік тому
greaaaaattt.
@meghashreebhattacharya7376 Рік тому
100th like i did!
@AdiPassover Рік тому
Personal timestamp: 3:01
@generichuman_ Рік тому
14:47 This must be a podcast with Neil Degrasse Tyson
@avahome5285 Рік тому
Hi, I have a sound mix of English and Chinese. English sound is louder while Chinese sound is in the background. How can I get the Chinese sound? Should I find a network trained in a Chinese data set, right?
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Also this Udemy online course on Speaker Diarization: www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@lisabecker3246 2 роки тому
Great work and great presentation! Is it possible to share the slides as well to visit the websites you linked there?
@QuanWang 2 роки тому
Yes. The slides can be downloaded here: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_slides.pdf
@vectox6480 2 роки тому
share colab please
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@ashwinirameshh9878 Рік тому
Hello quan. I want to join the course in Udemy and am in need of a coupon...
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@angkymusa972 Рік тому
hello, i'm interest with your course and i'm really be so thankful if you can share some coupon for me to use it. thank you so much for making this and i will wait for the coupon ^^
@QuanWang Рік тому
@@angkymusa972 please send me an email quanw@google.com
@Brono25 Рік тому
Thanks. The course was helpful in starting my UG thesis in voice diarization.
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@QuanWang 2 роки тому
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
@qalabeabbas6114 2 роки тому
Hi Quan, I am planning build a a reat time prototype for the voice filter lite. Do you think the course would be helpful ? Thanks
@QuanWang 2 роки тому
@@qalabeabbas6114 The course won't cover speech enhancement or separation. But if you are looking for a course for fundamental audio/speech processing, this course might be helpful.

Quan

КОМЕНТАРІ