About
I am a Research Scientist at GenAI Research (Meta) where I lead the generative efforts on…
Activity
-
⚒️ Image, video, and audio generation are a fundamental building block for Generative AI research and applications in the real world. In this talk…
⚒️ Image, video, and audio generation are a fundamental building block for Generative AI research and applications in the real world. In this talk…
Liked by Ishan Misra
-
42 years ago today our digital vocabulary expanded tremendously through a simple :-) or :-( On Sept. 19, 1982, Scott Fahlman, a research professor…
42 years ago today our digital vocabulary expanded tremendously through a simple :-) or :-( On Sept. 19, 1982, Scott Fahlman, a research professor…
Liked by Ishan Misra
-
Spreading the :-) Scott Fahlman, a Carnegie Mellon University School of Computer Science professor who invented the smiley emotion 42 years ago…
Spreading the :-) Scott Fahlman, a Carnegie Mellon University School of Computer Science professor who invented the smiley emotion 42 years ago…
Liked by Ishan Misra
Experience
Education
-
Carnegie Mellon University
-
Activities and Societies: SCS Distinguished Thesis Award (Runner up) across all CS, NLP, ML, Vision, Robotics PhD thesis at CMU in 2018.
-
-
Activities and Societies: Siebel Scholarship 2014; Best Paper award at WACV 2014
-
-
Activities and Societies: Vocalist for College Rock Band (Apotheosys) Editorial Team member for College newspaper (Ping)
-
-
Publications
-
Cut Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
ICCV
A simple data augmentation scheme to train detectors for instance detection. Gets state-of-the-art results on benchmark datasets.
Other authorsSee publication -
From Red Wine to Red Tomato: Composition with Context
CVPR
Composing classifiers for zero-shot visual learning
-
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification
ECCV
Self-supervised representation learning method for video data.
-
Cross-stitch Networks for Multi-Task Learning
CVPR
We propose a new unit called "cross-stitch" unit that can combine information across different convolutional neural networks, and thus helps in multi-task learning.
-
Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels
CVPR
We propose a method to incorporate labeling noise while training visual classifiers.
-
Watch and Learn: Semi-Supervised Learning of Object Detectors from Video
CVPR
We use temporal constraints in a video to train object detectors in a semi-supervised fashion.
-
Using Org-mode and Subversion for Managing and Publishing Content in Computer Science Courses
IEEE Conference on Technology for Education (T4E)
Content creation and management is an inevitable part of teaching a course. This paper describes a novel way of handling this problem using Org-mode, a recently created text based information management tool being used within the Emacs user community. We list certain desirable features, specially for the purposes of computer science courses, such as support for collaborative development and literate programming. We show how Org-mode compares favourably over other approaches like wikis and other…
Content creation and management is an inevitable part of teaching a course. This paper describes a novel way of handling this problem using Org-mode, a recently created text based information management tool being used within the Emacs user community. We list certain desirable features, specially for the purposes of computer science courses, such as support for collaborative development and literate programming. We show how Org-mode compares favourably over other approaches like wikis and other content and course management systems. We describe why the combination of Org-mode and version control is suitable for creating and publishing content quickly, with minimum overhead in a collaborative manner.
Other authors -
Hybrid Implementation of Error Diffusion Dithering
IEEE International Conference on High Performance Computing (HiPC)
Many image filtering operations provide ample parallelism, but progressive non-linear processing of images is among the hardest to parallelize due to long, sequential, and non-linear data dependency. A typical example of such an operation is error diffusion dithering, exemplified by the Floyd-Steinberg algorithm. In this paper, we present its parallelization on multicore CPUs using a block-based approach and on the GPU using a pixel based approach. We also present a hybrid approach in which the…
Many image filtering operations provide ample parallelism, but progressive non-linear processing of images is among the hardest to parallelize due to long, sequential, and non-linear data dependency. A typical example of such an operation is error diffusion dithering, exemplified by the Floyd-Steinberg algorithm. In this paper, we present its parallelization on multicore CPUs using a block-based approach and on the GPU using a pixel based approach. We also present a hybrid approach in which the CPU and the GPU operate in parallel during the computation. High Performance Computing has traditionally been associated with high end CPUs and GPUs. Our focus is on everyday computers such as laptops and desktops, where significant compute power is available on the GPU as on the CPU. Our implementation can dither an 8K × 8K image on an off-the-shelf laptop with an Nvidia 8600M GPU in about 400 milliseconds when the sequential implementation on its CPU took about 4 seconds
Other authors
Patents
-
Optimizing multi-class image classification using patch features
Filed US WO 2016118286 A1
Optimizing multi-class image classification by leveraging patch-based features extracted from weakly supervised images to train classifiers is described.
Other inventorsSee patent -
Optimizing multi-class multimedia data classification using negative data
US 9785866
Techniques for optimizing multi-class image classification by leveraging negative multimedia data items to train and update classifiers are described
Other inventorsSee patent
Courses
-
Computer Vision
-
-
Distributed Systems
-
-
Machine Learning
-
-
Pattern Recognition
-
Projects
-
HOG and Spatial Convolution on SIMD Architecture
-
C++ Implementations for
- Exemplar SVM testing pipeline
- HOG feature computation
- Spatial ConvolutionOther creatorsSee project
Honors & Awards
-
MIT Tech Review's 35 innovators under 35
MIT Tech Review
Recognized for my contributions to self-supervised learning research by MIT Tech Review in 2022. I was featured in their 35 under 35 list which is compiled across all technological disciplines worldwide.
Read more - https://www.technologyreview.com/innovator/ishan-misra/ -
Best Student Paper: Data-Driven Exemplar Model Selection
IEEE WACV
-
Gold Medal for highest CGPA
International Institute of Information Technology
Languages
-
English
-
-
Hindi
-
-
Marathi
-
Organizations
-
Carnegie Mellon University
Graduate Student
- Present -
INRIA
Intern/Stagiaire
- -
Yale University
Summer Intern
-
More activity by Ishan
-
Thank you so much for the award, Carnegie Mellon University! Very grateful for the formative years of my research career and education :)
Thank you so much for the award, Carnegie Mellon University! Very grateful for the formative years of my research career and education :)
Shared by Ishan Misra
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Ishan Misra
-
Ishan Misra
-
Ishan Misra
Java Microservices, ETLs, Spring MVC & SpringBoot, Docker, Locust, Keycloak (IAM), Kubernetes Basics and Agile
-
Ishan Misra
Territory Sales Incharge - 1 at AMUL | Former R&D Intern at Marico Ltd. | NIFTEM
-
Ishan Misra
The WRITER you have been LOOKING FOR
12 others named Ishan Misra are on LinkedIn
See others named Ishan Misra