![Quan](/img/default-banner.jpg)
- 28
- 120 267
Quan
Приєднався 15 жов 2011
Enroll in the Speaker Recognition online course on Udemy today: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A
Also the Speaker Diarization online course on Udemy: www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F
Purchase the Voice Identity Techniques textbook: item.jd.com/12970526.html
Also the Speaker Diarization online course on Udemy: www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F
Purchase the Voice Identity Techniques textbook: item.jd.com/12970526.html
Speaker Recognition online course on Udemy
Enroll in the course today: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A
Contact me if you need a coupon.
Chinese textbook on Voice Identity Techniques: github.com/wq2012/VoiceIdentityBook
Contact me if you need a coupon.
Chinese textbook on Voice Identity Techniques: github.com/wq2012/VoiceIdentityBook
Переглядів: 789
Відео
[ICASSP 2022] Turn-to-Diarize: Online Speaker Diarization Constrained by Speaker Turn Detection
Переглядів 2,3 тис.2 роки тому
0:18 - Introduction 3:31 - Speaker turn detection 6:58 - Turn-to-Diarize 12:20 - Experiments 16:28 - Python Library 17:29 - Conclusions and future work Code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/2109.11641 Poster: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_poster.pdf More resources on speaker diarization: wq2012.g...
【机器之心&博文视点】入门声纹技术|第二讲:声纹分割聚类与其他应用
Переглядів 1,3 тис.3 роки тому
Udemy声纹识别在线课程:www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程:www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书:github.com/wq2012/VoiceIdentityBook 京东:item.jd.com/12970526.html 天猫:detail.tmall.com/item.htm?id=628032618898 当当:product.dangdang.com/29130997.html 机器之心:mp.weixin.qq.com/s/5e-Pqu1VUDsU7fTtiD87rw 博文视点:mp.weixin.qq.com/s...
【机器之心&博文视点】入门声纹技术|第一讲:音频基础与声纹识别
Переглядів 2,7 тис.3 роки тому
Udemy声纹识别在线课程:www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Udemy声纹分割聚类在线课程:www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F 关于本书:github.com/wq2012/VoiceIdentityBook 京东:item.jd.com/12970526.html 天猫:detail.tmall.com/item.htm?id=628032618898 当当:product.dangdang.com/29130997.html 机器之心:mp.weixin.qq.com/s/iQtHFi34uKTGfvWVOl8adw 博文视点:mp.weixin.qq.com/s...
[Interspeech 2020] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recog
Переглядів 2,3 тис.3 роки тому
0:48 - Recap of VoiceFilter 2:07 - VoiceFilter for on-device ASR 4:19 - The journey to Lite 8:20 - The long fight with over-suppression 10:48 - Experiment setup 12:37 - Results and conclusions Home: google.github.io/speaker-id/publications/VoiceFilter-Lite/ arXiv paper: arxiv.org/abs/2009.04323 Demo: ua-cam.com/video/BiWMZdnHuVs/v-deo.html Previous VoiceFilter lecture (Interspeech 2019): ua-cam...
Android demo for VoiceFilter-Lite and on-device ASR
Переглядів 2,4 тис.3 роки тому
Home page: google.github.io/speaker-id/publications/VoiceFilter-Lite/ Paper: arxiv.org/abs/2009.04323 Lecture: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html
[Speaker Odyssey 2020] Personal VAD: Speaker-Conditioned Voice Activity Detection
Переглядів 2,7 тис.3 роки тому
00:21 - Key messages 00:46 - Background 04:04 - Introducing Personal VAD 06:09 - Implementation 09:58 - Experiment Setup 11:55 - Results and Conclusions 13:54 - Future Work Home page: google.github.io/speaker-id/publications/PersonalVAD/ ISCA archive: www.isca-speech.org/archive/Odyssey_2020/abstracts/2.html arXiv paper: arxiv.org/abs/1908.04284 Slides: google.github.io/speaker-id/publications/...
[Interspeech 2019] VoiceFilter live lecture
Переглядів 1,6 тис.4 роки тому
Live recording of the presentation at Interspeech 2019. The presentation was given on Sep. 18, 2019.
[Interspeech 2019] Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection
Переглядів 1944 роки тому
This work is done by Yiteng (Arden) Huang. I'm presenting for him because he could not make his trip to Interspeech. The presentation was done on Sep. 17, 2019. Here is the link to the paper: ai.google/research/pubs/pub48420/
[Interspeech 2019] VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Переглядів 5 тис.4 роки тому
0:30 - Introduction 3:23 - VoiceFilter Models 6:40 - Data 8:07 - Experiments 11:10 - Conclusions and Future Work Home page: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Demo: ua-cam.com/video/2BF_1X7bmds/v-deo.html Lecture on our new VoiceFilter-Lite system: ua-cam.com/video/EhCPJgzmdLQ/v-deo.html Udemy online course on speaker recognition: www.udemy.com...
Speaker Diarization with LSTM: Android Demo
Переглядів 4,9 тис.5 років тому
Home page: google.github.io/speaker-id/publications/LstmDiarization/ Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The audios were being played from a speaker, so there were some acoustic distortions. I was holding another phone to record the videos with single hand, so the videos are not very stable. ...
[ICASSP 2019] Fully Supervised Speaker Diarization: Say Goodbye to clustering
Переглядів 20 тис.5 років тому
0:17 - Introduction 2:05 - Clustering - Why it's not good enough? 8:43 - UIS-RNN 17:06 - Experimental Results 20:17 - The Python Library 26:38 - Conclusions and Future Work Code: github.com/google/uis-rnn Paper: arxiv.org/abs/1810.04719 More resources on speaker diarization: wq2012.github.io/awesome-diarization Udemy online course on speaker recognition: www.udemy.com/course/speaker-recognition...
Audio samples for Google's VoiceFilter
Переглядів 5 тис.5 років тому
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Project site: google.github.io/speaker-id/publications/VoiceFilter/ Paper: arxiv.org/abs/1810.04826 Third-party implementation: github.com/mindslab-ai/voicefilter
Speaker Diarization with LSTM: Colaboratory Interactive Demo
Переглядів 10 тис.5 років тому
Home page: google.github.io/speaker-id/publications/LstmDiarization Spectral clustering code: github.com/wq2012/SpectralCluster Paper: arxiv.org/abs/1710.10468 Poster: 162.242.252.85/documents/speaker-diarization-lstm Tutorial: ua-cam.com/video/pjxGPZQeeO4/v-deo.html The "Run diarization" part runs a bit slow because this demo is not built on top of a service, but runs a local executable for ev...
Multispeaker Text-To-Speech audio samples
Переглядів 1,2 тис.5 років тому
Title: Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Link to the page: google.github.io/tacotron/publications/speaker_adaptation/ Paper: arxiv.org/abs/1806.04558
[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM
Переглядів 25 тис.5 років тому
[ICASSP 2018] Google's Diarization System: Speaker Diarization with LSTM
Car racing game on Arduino UNO and 2X16 LCD screen
Переглядів 14 тис.5 років тому
Car racing game on Arduino UNO and 2X16 LCD screen
[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification
Переглядів 10 тис.6 років тому
[ICASSP 2018] Google's D-Vector System: Generalized End-to-End Loss for Speaker Verification
A preview of the "Compound Eye" multidirectional color sensor
Переглядів 489 років тому
A preview of the "Compound Eye" multidirectional color sensor
Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors
Переглядів 889 років тому
Occupancy-Driven Lighting with Support Vector Machines and RPi Sensors
Illumination Feedback Control with PID Controller and RPi Sensors
Переглядів 2039 років тому
Illumination Feedback Control with PID Controller and RPi Sensors
Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors
Переглядів 18410 років тому
Occupancy Estimation using Light Reflection Model and Ceiling-Mounted RPi Sensors
COSBOS: COlor-Sensor-Based Occupancy Sensing
Переглядів 65410 років тому
COSBOS: COlor-Sensor-Based Occupancy Sensing
Label Consistent Fisher Vectors (LCFV) Demo
Переглядів 1,2 тис.10 років тому
Label Consistent Fisher Vectors (LCFV) Demo
LF3DR: Light-Field-Based 3D Object Retrieval
Переглядів 29410 років тому
LF3DR: Light-Field-Based 3D Object Retrieval
Active Geometric Shape Model Demos
Переглядів 4,6 тис.10 років тому
Active Geometric Shape Model Demos
Thank you for this. As an undergrad student trying to get into research. I find it really hard to read and understand research paper, thanks a lot for the video break down of your research!
can i get this android apps github link
Is there any code for this?
doesnt work tried it
is there any source code available for this?
Hello sir, can we make the game on the proteus screen, if it is, how can we do it?
Thank you for your amazing job! I wonder if you have adopted this system in real-world applications and how it performs.
Have you seen anything like this done on iOS using the same principle?
greaaaaattt.
100th like i did!
Personal timestamp: 3:01
14:47 This must be a podcast with Neil Degrasse Tyson
Hi, I have a sound mix of English and Chinese. English sound is louder while Chinese sound is in the background. How can I get the Chinese sound? Should I find a network trained in a Chinese data set, right?
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Also this Udemy online course on Speaker Diarization: www.udemy.com/course/diarization/?referralCode=21D7CC0AEABB7FE3680F Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
Great work and great presentation! Is it possible to share the slides as well to visit the websites you linked there?
Yes. The slides can be downloaded here: github.com/google/speaker-id/blob/master/publications/Turn-to-Diarize/resources/icassp2022_turn_to_diarize_slides.pdf
share colab please
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
Hello quan. I want to join the course in Udemy and am in need of a coupon...
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
hello, i'm interest with your course and i'm really be so thankful if you can share some coupon for me to use it. thank you so much for making this and i will wait for the coupon ^^
@@angkymusa972 please send me an email quanw@google.com
Thanks. The course was helpful in starting my UG thesis in voice diarization.
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
After years of preparation, I'm excited to share that my online course on Speaker Recognition now starts to accept enrollment on Udemy: www.udemy.com/course/speaker-recognition/?referralCode=1914766AF241CE15D19A Please contact me if you need a coupon. Looking forward to seeing you in the lectures!
Hi Quan, I am planning build a a reat time prototype for the voice filter lite. Do you think the course would be helpful ? Thanks
@@qalabeabbas6114 The course won't cover speech enhancement or separation. But if you are looking for a course for fundamental audio/speech processing, this course might be helpful.