I am a first year PhD student at the Center for Machine Perception , Czech Technical University in Prague, where I am supervised by Prof. Jiří Matas. Previously, I graduated with a Master's degree in Computer Vision from the Robotics Institute of Carnegie Mellon University, where I worked with Prof. Abhinav Gupta. During the Masters, I did two Research Internships at Amazon (first at A9 and second at AWS-AI), during these internships, my work was supervised by Prof. R. Manmatha and Prof. Alex Smola. Even before that, I obtained a Bachelor in Technology with Honors by Research in Computer Science and Engineering from International Institute of Information Technology, Hyderabad (IIIT-H). During my undergrad, I was working with Prof. C.V. Jawahar at the Center for Visual Information Technology (CVIT). At some point during my undergrad, I did a Research Internship at the Computer Vision Center (CVC), Universitat Autònoma de Barcelona, where I was supervised by Prof. Dimosthenis Karatzas. I did another internship during my undergrad at Center for Machine Perception , Czech Technical University in Prague, where I was supervised by Prof. Jiří Matas. Research Interests: Self-Supervised Representation Learning, Image Compression, Scene Text Detection and Recognition, Tracking and Segmentation in Videos, 3D Reconstruction email | Twitter | GitHub | LinkedIn | Google Scholar | ResearchGate patelyas AT cmp DOT felk DOT cvut DOT cz yashp AT alumni DOT cmu DOT edu |
![]() |
[NEW] Saliency Driven Perceptual Image Compression
Yash Patel, Srikar Appalaraju, R. Manmatha
Winter Applications of Computer Vision (WACV), 2021
pdf   abstract   bibtex   supplementary material   video
[NEW] Learning Surrogates via Deep Embedding
Yash Patel, Tomas Hodan, Jiri Matas
European Conference on Computer Vision (ECCV), 2020
pdf   abstract   bibtex   video   long video
[NEW] Neural Network-based Acoustic Vehicle Counting
Slobodan Djukanović, Yash Patel, Jiři Matas, Tuomas Virtanen
arXiv e-print, 2020 (under review)
pdf   abstract   bibtex
Deep Perceptual Compression
Yash Patel, Srikar Appalaraju, R. Manmatha
arXiv e-print, 2019
pdf   abstract   bibtex
Human Perceptual Evaluations for Image Compression
Yash Patel, Srikar Appalaraju, R. Manmatha
arXiv e-print, 2019
pdf   abstract   bibtex
Self-Supervised Visual Representations for Cross-Modal Retrieval
Yash Patel, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C.V. Jawahar
International Conference on Multimedia Retrieval (ICMR), 2019
Spotlight
pdf   abstract   bibtex
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition--RRC-MLT-2019
Nibal Nayef*, Yash Patel*, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, Jean-Marc Ogier
International Conference on Document Analysis and Recognition (ICDAR), 2019
Oral
pdf   abstract   bibtex   portal
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Michal Bušta, Yash Patel, Jiri Matas
International Workshop on Robust Reading, Asian Conference on Computer Vision (ACCV), 2018
Best Paper Award
pdf   abstract   bibtex   code
TextTopicNet-Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces
Yash Patel, Lluis Gomez, Raul Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C.V. Jawahar
Under Review at Pattern Recognition Journal, arXiv e-print, 2018
pdf   abstract   bibtex   code
Learning Sampling Policies for Domain Adaptation
Yash Patel*, Kashyap Chitta*, Bhavan Jasani*
ArXiv e-prints, 2018
pdf   abstract   bibtex   code
Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces
Lluis Gomez*, Yash Patel*, Marçal Rusiñol, Dimosthenis Karatzas, CV Jawahar
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
pdf   abstract   bibtex   code
Dynamic Lexicon Generation for Natural Scene Images
Yash Patel, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas
International Workshop on Robust Reading, European Conference on Computer Vision (ECCV), 2016
pdf   abstract   bibtex   code
Dynamic Narratives for Heritage Tour
Anurag Ghosh*, Yash Patel*, Mohak Sukhwani, CV Jawahar
VisART, European Conference on Computer Vision (ECCV), 2016
pdf   abstract   bibtex   code