default search action
Gautham J. Mysore
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c55]Bryan Wang, Zeyu Jin, Gautham J. Mysore:
Record Once, Post Everywhere: Automatic Shortening of Audio Stories for Social Media. UIST 2022: 14:1-14:11 - 2021
- [c54]Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra:
Emotion Embedding Spaces for Matching Music to Stories. ISMIR 2021: 777-785 - [i5]Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra:
Emotion Embedding Spaces for Matching Music to Stories. CoRR abs/2111.13468 (2021) - 2020
- [j8]Oriol Nieto, Gautham J. Mysore, Cheng-i Wang, Jordan B. L. Smith, Jan Schlüter, Thomas Grill, Brian McFee:
Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications. Trans. Int. Soc. Music. Inf. Retr. 3(1): 246-263 (2020) - [c53]Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-Consistent Many-To-Many Non-Parallel Voice Conversion Via Conditional Autoencoder. ICASSP 2020: 6284-6288 - [c52]Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham J. Mysore, Zeyu Jin:
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences. INTERSPEECH 2020: 2852-2856 - [c51]Max Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore:
Controllable Neural Prosody Synthesis. INTERSPEECH 2020: 4437-4441 - [i4]Pranay Manocha, Adam Finkelstein, Zeyu Jin, Nicholas J. Bryan, Richard Zhang, Gautham J. Mysore:
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences. CoRR abs/2001.04460 (2020) - [i3]Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder. CoRR abs/2004.07370 (2020) - [i2]Max Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore:
Controllable Neural Prosody Synthesis. CoRR abs/2008.03388 (2020)
2010 – 2019
- 2019
- [j7]Meinard Müller, Bryan Pardo, Gautham J. Mysore, Vesa Välimäki:
Recent Advances in Music Signal Processing [From the Guest Editors]. IEEE Signal Process. Mag. 36(1): 17-19 (2019) - [c50]Bernd Huber, Hijung Valentina Shin, Bryan C. Russell, Oliver Wang, Gautham J. Mysore:
B-Script: Transcript-based B-roll Video Editing with Recommendations. CHI 2019: 81 - [c49]Prem Seetharaman, Gautham J. Mysore, Bryan Pardo, Paris Smaragdis, Celso Gomes:
VoiceAssist: Guiding Users to High-Quality Voice Recordings. CHI 2019: 309 - [i1]Bernd Huber, Hijung Valentina Shin, Bryan C. Russell, Oliver Wang, Gautham J. Mysore:
B-Script: Transcript-based B-roll Video Editing with Recommendations. CoRR abs/1902.11216 (2019) - 2018
- [c48]Zhengshan Shi, Gautham J. Mysore:
LoopMaker: Automatic Creation of Music Loops from Pre-recorded Music. CHI 2018: 454 - [c47]Prem Seetharaman, Gautham J. Mysore, Paris Smaragdis, Bryan Pardo:
Blind Estimation of the Speech Transmission Index for Speech Quality Prediction. ICASSP 2018: 591-595 - [c46]Mark Cartwright, Bryan Pardo, Gautham J. Mysore:
Crowdsourced Pairwise-Comparison for Source Separation Evaluation. ICASSP 2018: 606-610 - [c45]Zeyu Jin, Adam Finkelstein, Gautham J. Mysore, Jingwan Lu:
Fftnet: A Real-Time Speaker-Dependent Neural Vocoder. ICASSP 2018: 2251-2255 - [c44]Zhengshan Shi, Gautham J. Mysore:
MedleyAssistant - A System for Personalized Music Medley Creation. IUI Workshops 2018 - 2017
- [j6]Neal Wadhwa, Hao-Yu Wu, Abe Davis, Michael Rubinstein, Eugene Shih, Gautham J. Mysore, Justin G. Chen, Oral Büyüköztürk, John V. Guttag, William T. Freeman, Frédo Durand:
Eulerian video magnification and analysis. Commun. ACM 60(1): 87-95 (2017) - [j5]Zeyu Jin, Gautham J. Mysore, Stephen DiVerdi, Jingwan Lu, Adam Finkelstein:
VoCo: text-based insertion and replacement in audio narration. ACM Trans. Graph. 36(4): 96:1-96:13 (2017) - [c43]Cheng-i Wang, Gautham J. Mysore, Shlomo Dubnov:
Re-Visiting the Music Segmentation Problem with Crowdsourcing. ISMIR 2017: 738-744 - [c42]Shrikant Venkataramani, Paris Smaragdis, Gautham J. Mysore:
AutoDub: Automatic Redubbing for Voiceover Editing. UIST 2017: 533-538 - 2016
- [c41]Cheng-i Wang, Gautham J. Mysore:
Structural segmentation with the Variable Markov Oracle and boundary adjustment. ICASSP 2016: 291-295 - [c40]François G. Germain, Gautham J. Mysore, Takako Fujioka:
Equalization matching of speech recordings in real-world environments. ICASSP 2016: 609-613 - [c39]Mark Cartwright, Bryan Pardo, Gautham J. Mysore, Matthew D. Hoffman:
Fast and easy crowdsourced perceptual audio evaluation. ICASSP 2016: 619-623 - [c38]Zeyu Jin, Adam Finkelstein, Stephen DiVerdi, Jingwan Lu, Gautham J. Mysore:
Cute: A concatenative method for voice conversion using exemplar-based unit selection. ICASSP 2016: 5660-5664 - 2015
- [j4]Gautham J. Mysore:
Can we Automatically Transform Speech Recorded on Common Consumer Devices in Real-World Environments into Professional Production Quality Speech? - A Dataset, Insights, and Challenges. IEEE Signal Process. Lett. 22(8): 1006-1010 (2015) - [c37]Valkyrie Savage, Andrew Head, Björn Hartmann, Dan B. Goldman, Gautham J. Mysore, Wilmot Li:
Lamello: Passive Acoustic Sensing for Tangible Input Components. CHI 2015: 1277-1280 - [c36]François G. Germain, Gautham J. Mysore:
Speaker and noise independent online single-channel speech enhancement. ICASSP 2015: 71-75 - [c35]Minje Kim, Paris Smaragdis, Gautham J. Mysore:
Efficient manifold preserving audio source separation using locality sensitive hashing. ICASSP 2015: 479-483 - [c34]Dawen Liang, Matthew D. Hoffman, Gautham J. Mysore:
Speech dereverberation using a learned speech model. ICASSP 2015: 1871-1875 - [c33]Steve Rubin, Floraine Berthouzoz, Gautham J. Mysore, Maneesh Agrawala:
Capture-Time Feedback for Recording Scripted Narration. UIST 2015: 191-199 - 2014
- [j3]François G. Germain, Gautham J. Mysore:
Stopping Criteria for Non-Negative Matrix Factorization Based Supervised and Semi-Supervised Source Separation. IEEE Signal Process. Lett. 21(10): 1284-1288 (2014) - [j2]Paris Smaragdis, Cédric Févotte, Gautham J. Mysore, Nasser Mohammadiha, Matthew D. Hoffman:
Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified view. IEEE Signal Process. Mag. 31(3): 66-75 (2014) - [j1]Abe Davis, Michael Rubinstein, Neal Wadhwa, Gautham J. Mysore, Frédo Durand, William T. Freeman:
The visual microphone: passive recovery of sound from video. ACM Trans. Graph. 33(4): 79:1-79:10 (2014) - [c32]Nicholas J. Bryan, Gautham J. Mysore, Ge Wang:
ISSE: an interactive source separation editor. CHI 2014: 257-266 - [c31]Dawen Liang, Daniel P. W. Ellis, Matthew D. Hoffman, Gautham J. Mysore:
Speech decoloration based on the product-of-filters model. ICASSP 2014: 2400-2404 - [c30]Nicolas Boulanger-Lewandowski, Gautham J. Mysore, Matthew D. Hoffman:
Exploiting long-term temporal dependencies in NMF using recurrent neural networks with application to source separation. ICASSP 2014: 6969-6973 - [c29]Dawen Liang, Matthew D. Hoffman, Gautham J. Mysore:
A Generative Product-of-Filters Model of Audio. ICLR (Poster) 2014 - 2013
- [c28]Dennis L. Sun, Gautham J. Mysore:
Universal speech models for speaker independent single channel source separation. ICASSP 2013: 141-145 - [c27]Nicholas J. Bryan, Gautham J. Mysore:
Interactive refinement of supervised and semi-supervised sound source separation estimates. ICASSP 2013: 883-887 - [c26]Nicholas J. Bryan, Gautham J. Mysore:
An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source Separation. ICML (3) 2013: 208-216 - [c25]François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Speaker and noise independent voice activity detection. INTERSPEECH 2013: 732-736 - [c24]Zafar Rafii, François G. Germain, Dennis L. Sun, Gautham J. Mysore:
Combining Modeling Of Singing Voice And Background Music For Automatic Separation Of Musical Mixtures. ISMIR 2013: 41-46 - [c23]Nicholas J. Bryan, Gautham J. Mysore, Ge Wang:
Source Separation of Polyphonic Music with Interactive User-Feedback on a Piano Roll Display. ISMIR 2013: 119-124 - [c22]Steve Rubin, Floraine Berthouzoz, Gautham J. Mysore, Wilmot Li, Maneesh Agrawala:
Content-based tools for editing audio stories. UIST 2013: 113-122 - 2012
- [c21]Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis:
Online PLCA for Real-Time Semi-supervised Source Separation. LVA/ICA 2012: 34-41 - [c20]Jinyu Han, Gautham J. Mysore, Bryan Pardo:
Audio Imputation Using the Non-negative Hidden Markov Model. LVA/ICA 2012: 347-355 - [c19]Gautham J. Mysore, Paris Smaragdis:
A Non-negative Approach to Language Informed Speech Separation. LVA/ICA 2012: 356-363 - [c18]Juhan Nam, Gautham J. Mysore, Paris Smaragdis:
Sound Recognition in Mixtures. LVA/ICA 2012: 405-413 - [c17]Brian King, Paris Smaragdis, Gautham J. Mysore:
Noise-robust dynamic time warping using PLCA features. ICASSP 2012: 1973-1976 - [c16]Nicholas J. Bryan, Paris Smaragdis, Gautham J. Mysore:
Clustering and synchronizing multi-camera video via landmark cross-correlation. ICASSP 2012: 2389-2392 - [c15]Paris Smaragdis, Gautham J. Mysore:
Following musical sources by example. ICASSP 2012: 5373-5376 - [c14]Gautham J. Mysore, Maneesh Sahani:
Variational Inference in Non-negative Factorial Hidden Markov Models for Efficient Audio Source Separation. ICML 2012 - [c13]Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis:
Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments. INTERSPEECH 2012: 595-598 - [c12]Jinyu Han, Gautham J. Mysore, Bryan Pardo:
Language informed bandwidth expansion. MLSP 2012: 1-6 - [c11]Steve Rubin, Floraine Berthouzoz, Gautham J. Mysore, Wilmot Li, Maneesh Agrawala:
UnderScore: musical underlays for audio stories. UIST 2012: 359-366 - 2011
- [c10]Gautham J. Mysore, Paris Smaragdis:
A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics. ICASSP 2011: 17-20 - [c9]Gautham J. Mysore, Paris Smaragdis:
A convolutive spectral decomposition approach to the separation of feedback from target speech. MLSP 2011: 1-6 - 2010
- [c8]Gautham J. Mysore, Paris Smaragdis, Bhiksha Raj:
Non-negative Hidden Markov Modeling of Audio with Application to Source Separation. LVA/ICA 2010: 140-148 - [c7]Joachim Ganseman, Paul Scheunders, Gautham J. Mysore, Jonathan S. Abel:
Source Separation By Score Synthesis. ICMC 2010 - [c6]Juhan Nam, Gautham J. Mysore, Joachim Ganseman, Kyogu Lee, Jonathan S. Abel:
A super-resolution spectrogram using coupled PLCA. INTERSPEECH 2010: 1696-1699 - [c5]Joachim Ganseman, Paul Scheunders, Gautham J. Mysore, Jonathan S. Abel:
Evaluation of a Score-informed Source Separation System. ISMIR 2010: 219-224
2000 – 2009
- 2009
- [c4]Gautham J. Mysore, Paris Smaragdis:
Relative pitch estimation of multiple instruments. ICASSP 2009: 313-316 - [c3]Paris Smaragdis, Madhusudana V. S. Shashanka, Bhiksha Raj, Gautham J. Mysore:
Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints. ICA 2009: 330-337 - [c2]Paris Smaragdis, Gautham J. Mysore:
Separation by "humming": User-guided sound extraction from monophonic mixtures. WASPAA 2009: 69-72 - 2005
- [c1]Juan Pablo Cáceres, Gautham J. Mysore, Jeffrey Treviño:
SCUBA: The Self-Contained Unified Bass Augmenter. NIME 2005: 38-41
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-05-08 21:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint