Publications

2026
Scale Space Diffusion
Scale Space Diffusion
CVPR 2026 new
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
CVPR 2026 new
CVPR
2026
Efficient and High-Fidelity Omni Modality Retrieval
Chuong Huynh, Manh Luong, Abhinav Shrivastava
CVPR 2026 new
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal, Abhinav Shrivastava, Matthew Gwilliam
ICLR 2026 new
NeRV-Diffusion: Diffuse Implicit Neural Representation for Video Synthesis
NeRV-Diffusion: Diffuse Implicit Neural Representation for Video Synthesis
Yixuan Ren, Hanyu Wang, Hao Chen, Bo He, Abhinav Shrivastava
ICLR 2026 new
arXiv
2026
Towards Understanding Best Practices for Quantization of Vision-Language Models
Gautom Das, Vincent La, Ethan Lau, Abhinav Shrivastava, Matthew Gwilliam
arXiv 2026 new
VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
ICRA 2026 new
How to Design and Train Your Implicit Neural Representation for Video Compression
How to Design and Train Your Implicit Neural Representation for Video Compression
Matthew Gwilliam, Roy Zhang, Namitha Padmanabhan, Hongyang Du, Abhinav Shrivastava
WACV 2026 new
2025
Characterizing Motion Encoding in Video Diffusion Timesteps
Characterizing Motion Encoding in Video Diffusion Timesteps
Vatsal Baherwani, Yixuan Ren, Abhinav Shrivastava
arXiv 2025
Growing Visual Generative Capacity for Pre-Trained MLLMs
Growing Visual Generative Capacity for Pre-Trained MLLMs
Hanyu Wang, Jiaming Han, Ziyan Yang, Abhinav Shrivastava
arXiv 2025
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
Seungjae Lee, Daniel Ekpo, Haowen Liu, Furong Huang, Abhinav Shrivastava, Jia-Bin Huang
CoRL 2025
Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor
Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor
Vatsal Agarwal, Matthew Gwilliam, Gefen Kohavi, Eshan Verma, Daniel Ulbricht, Abhinav Shrivastava
arXiv 2025
Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition
Trokens: Semantic-Aware Relational Trajectory Tokens for Few-Shot Action Recognition
ICCV 2025
Multi-entity Video Transformers for Fine-Grained Video Representation Learning
Multi-entity Video Transformers for Fine-Grained Video Representation Learning
Matthew Walmer, Rose Kanjirathinkal, Kai-Sheng Tai, Keyur Muzumdar, Taipeng Tian, Abhinav Shrivastava
FGVC Workshop, CVPR 2025
CoLLM: A Large Language Model for Composed Image Retrieval
CoLLM: A Large Language Model for Composed Image Retrieval
Chuong Huynh, Jinyu Yang, Ashish Tawari, Mubarak Shah, Son Tran, Raffay Hamid, Trishul Chilimbi, Abhinav Shrivastava
CVPR 2025
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
Hanyu Wang, Saksham Suri, Yixuan Ren, Hao Chen, Abhinav Shrivastava
ICLR 2025 oral
P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies
P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies
Mara Levy, Siddhant Haldar, Lerrel Pinto, Abhinav Shrivastava
ICRA 2025
TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Shuaiyi Huang, Mara Levy, Anubhav, Daniel Ekpo, Ruijie Zheng, Abhinav Shrivastava
ICRA 2025
A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Matthew Gwilliam, Michael Cogswell, Meng Ye, Karan Sikka, Abhinav Shrivastava, Ajay Divakaran
WACV 2025
WACV
2025
Unified Framework for Open-World Compositional Zero-shot Learning
Hirunima Jayasekara, Khoi Pham, Nirat Saini, Abhinav Shrivastava
WACV 2025
2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava, Abhinav Shrivastava
arXiv 2024
EMNLP Findings
2024
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Boyd-Graber, Tianyi Zhou, Dinesh Manocha
EMNLP Findings 2024
QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos
QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos
Sharath Girish, Tianye Li, Amrita Mazumdar, Abhinav Shrivastava, David Luebke, Shalini De Mello
NeurIPS 2024
Coarse to Fine Human Mesh Recovery with Transformers
Coarse to Fine Human Mesh Recovery with Transformers
Vatsal Agarwal, Mara Levy, Max Ehrlich, Yucheng Tang, Nanxuan Zhang, Abhinav Shrivastava
T-CAP Workshop, ECCV 2024
ECCV
2024
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren, Yifei Zhou, Jinyu Yang, Jing Shi, Difan Liu, Fuxiao Liu, Mingi Kwon, Abhinav Shrivastava
ECCV 2024
Do text-free diffusion models learn discriminative visual representations?
Do text-free diffusion models learn discriminative visual representations?
Soumik Mukhopadhyay, Matthew Gwilliam, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Jun Ohya, Abhinav Shrivastava
ECCV 2024
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS
Sharath Girish, Kamal Gupta, Abhinav Shrivastava
ECCV 2024
Fast Encoding and Decoding for Implicit Video Representation
Fast Encoding and Decoding for Implicit Video Representation
Hao Chen, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava
ECCV 2024
Investigating Style Similarity in Diffusion Models
Investigating Style Similarity in Diffusion Models
Gowthami Somepalli, Anubhav, Kamal Gupta, Shramay Palta, Micah Goldblum, Jonas Geiping, Abhinav Shrivastava, Tom Goldstein
ECCV 2024
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
ECCV 2024
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
ECCV 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Saksham Suri, Matthew Walmer, Kamal Gupta, Abhinav Shrivastava
ECCV 2024
Quantifying NBA Shot Quality: A Deep Network Approach
Quantifying NBA Shot Quality: A Deep Network Approach
Archit Kambhamettu, Abhinav Shrivastava, Matthew Gwilliam
ACM MMSports 2024
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Trajectory-aligned Space-time Tokens for Few-shot Action Recognition
Pulkit Kumar, Namitha Padmanabhan, Luke Luo, Sai Saketh Rambhatla, Abhinav Shrivastava
ECCV 2024
ARDuP: Active Region Video Diffusion for Universal Policies
ARDuP: Active Region Video Diffusion for Universal Policies
Shuaiyi Huang, Mara Levy, Zhenyu Jiang, Anima Anandkumar, Yuke Zhu, Linxi Fan, De-An Huang, Abhinav Shrivastava
IROS 2024
Nature Machine Intelligence
2024
Challenges, Evaluation and Opportunities for Open-World Learning
Mayank Kejriwal, Eric Kildebeck, Robert Steininger, Abhinav Shrivastava
Nature Machine Intelligence 2024
LPVL Workshop, CVPR
2024
Agglomerative Clustering of Atomic Actions for Unsupervised Action Segmentation
Pulkit Kumar, Austin Myers, Anurag Arnab, David A. Ross, Abhinav Shrivastava, Sudheendra Vijayanarasimhan
LPVL Workshop, CVPR 2024
UVIS: Unsupervised Video Instance Segmentation
UVIS: Unsupervised Video Instance Segmentation
Shuaiyi Huang, Saksham Suri, Kamal Gupta, Sai Saketh Rambhatla, Ser-Nam Lim, Abhinav Shrivastava
L3D Workshop, CVPR 2024
V-VIPE: Variational View Invariant Pose Embedding
V-VIPE: Variational View Invariant Pose Embedding
Mara Levy, Abhinav Shrivastava
RHOI Workshop, CVPR 2024
What is Point Supervision Worth in Video Instance Segmentation?
What is Point Supervision Worth in Video Instance Segmentation?
Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, Jose M. Alvarez, Abhinav Shrivastava, Anima Anandkumar
L3D Workshop, CVPR 2024
Beyond Seen Primitive Concepts and Attribute-Object Compositional Learning
Beyond Seen Primitive Concepts and Attribute-Object Compositional Learning
Nirat Saini, Khoi Pham, Abhinav Shrivastava
CVPR 2024
Composing Object Relations and Attributes for Image-Text Matching
Composing Object Relations and Attributes for Image-Text Matching
Khoi Pham, Chuong Huynh, Ser-Nam Lim, Abhinav Shrivastava
CVPR 2024
Explaining the Implicit Neural Canvas (XINC): Connecting Pixels to Neurons by Tracing their Contributions
Explaining the Implicit Neural Canvas (XINC): Connecting Pixels to Neurons by Tracing their Contributions
CVPR 2024
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Bo He, Hengduo Li, Young Kyun Jang, Menglin Jia, Xuefei Cao, Anshul Shah, Ser-Nam Lim, Abhinav Shrivastava
CVPR 2024
MaGGIe: Masked Guided Gradual Human Instance Matting
MaGGIe: Masked Guided Gradual Human Instance Matting
Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava, Joon-Young Lee
CVPR 2024
CVPR
2024
Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes
Gaurav Shrivastava, Abhinav Shrivastava
CVPR 2024
ICLR
2024
Video Decomposition Prior: Editing Videos Layer by Layer
Gaurav Shrivastava, Ser-Nam Lim, Abhinav Shrivastava
ICLR 2024
WAYEX: Waypoint Exploration using a Single Demonstration
WAYEX: Waypoint Exploration using a Single Demonstration
Mara Levy, Nirat Saini, Abhinav Shrivastava
ICRA 2024
Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks
Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks
Yixuan Ren, Jing Shi, Zhifei Zhang, Yifei Fan, Zhe Lin, Bo He, Abhinav Shrivastava
WACV 2024
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava
WACV 2024
WACV
2024
GRIT: GAN Residuals for Paired Image-to-Image Translation
Saksham Suri, Moustafa Meshry, Larry Davis, Abhinav Shrivastava
WACV 2024
Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement
Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement
Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava
WACV 2024
Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion
Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion
Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava
WACV 2024
2023
NeurIPS
2023
Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements
Gaurav Shrivastava, Abhinav Shrivastava
NeurIPS 2023
A Frequency Perspective of Adversarial Robustness
A Frequency Perspective of Adversarial Robustness
Shishira R Maiya, Max Ehrlich, Vatsal Agarwal, Ser-Nam Lim, Tom Goldstein, Abhinav Shrivastava
BMVC 2023
ASIC: Aligning Sparse in-the-wild Image Collections
ASIC: Aligning Sparse in-the-wild Image Collections
Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar
ICCV 2023 oral
BT2: Backward-compatible Training with Basis Transformation
BT2: Backward-compatible Training with Basis Transformation
Yifei Zhou, Zilu Li, Abhinav Shrivastava, Hengshuang Zhao, Antonio Torralba, Taipeng Tian, Ser-Nam Lim
ICCV 2023
Chop & Learn: Recognizing and Generating Object-State Compositions
Chop & Learn: Recognizing and Generating Object-State Compositions
ICCV 2023
MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery
MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery
Sai Saketh Rambhatla, Ishan Misra, Rama Chellappa, Abhinav Shrivastava
ICCV 2023 oral
SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations
SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations
Sharath Girish, Abhinav Shrivastava, Kamal Gupta
ICCV 2023
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining
Saksham Suri, Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava
ICCV 2023
Springer Book Chapter
2023
Novelty in Image Classification
Mohsen Jafarzadeh, Akshay Raj Dhamija, Steve Cruz, Chunchun Li, Abhinav Shrivastava, Terrance E. Boult
Springer Book Chapter 2023
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Bo He, Jun Wang, Jielin Qiu, Trung Bui, Abhinav Shrivastava, Zhaowen Wang
CVPR 2023
FlexNeRF: Photorealistic Free-viewpoint Rendering of Moving Humans from Sparse Views
FlexNeRF: Photorealistic Free-viewpoint Rendering of Moving Humans from Sparse Views
Vinoj Jayasundara, Amit Agrawal, Nicolas Heron, Abhinav Shrivastava, Larry Davis
CVPR 2023
HNeRV: A Hybrid Neural Representation for Videos
HNeRV: A Hybrid Neural Representation for Videos
Hao Chen, Matthew Gwilliam, Ser-Nam Lim, Abhinav Shrivastava
CVPR 2023
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling
Shishira R Maiya, Sharath Girish, Max Ehrlich, Hanyu Wang, Kwot Sin Lee, Patrick Poirson, Pengxiang Wu, Chen Wang, Abhinav Shrivastava
CVPR 2023
SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network
SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network
Chuong Huynh, Yuqian Zhou, Zhe Lin, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Abhinav Shrivastava
CVPR 2023
Teaching Matters: Investigating the Role of Supervision in Vision Transformers
Teaching Matters: Investigating the Role of Supervision in Vision Transformers
Matthew Walmer, Saksham Suri, Kamal Gupta, Abhinav Shrivastava
CVPR 2023
Towards Scalable Neural Representation for Diverse Videos
Towards Scalable Neural Representation for Diverse Videos
Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
CVPR 2023
COVID-VTS: Fact Extraction and Verification on Short Video Platforms
COVID-VTS: Fact Extraction and Verification on Short Video Platforms
Fuxiao Liu, Yaser Yacoob, Abhinav Shrivastava
EACL 2023
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish, Kamal Gupta, Saurabh Singh, Abhinav Shrivastava
ICLR 2023
2022
Burn After Reading: Online Adaptation for Cross-domain Streaming Data
Burn After Reading: Online Adaptation for Cross-domain Streaming Data
Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah
ECCV 2022
Improving Closed and Open Set Attribute Prediction using Transformers
Improving Closed and Open Set Attribute Prediction using Transformers
Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Hung Tran, Abhinav Shrivastava
ECCV 2022
Learning Semantic Correspondence with Sparse Annotations
Learning Semantic Correspondence with Sparse Annotations
Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava
ECCV 2022
ECCV
2022
Neural Space-Filling Curves
Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava
ECCV 2022
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Bo He, Xitong Yang, Le Kang, Zhiyu Cheng, Xin Zhou, Abhinav Shrivastava
CVPR 2022
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Matthew Gwilliam, Abhinav Shrivastava
CVPR 2022
Disentangling Visual Embeddings for Attributes and Objects
Disentangling Visual Embeddings for Attributes and Objects
Nirat Saini, Khoi Pham, Abhinav Shrivastava
CVPR 2022 oral
Dual-Key Multimodal Backdoors for Visual Question Answering
Dual-Key Multimodal Backdoors for Visual Question Answering
Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha
CVPR 2022
ObjectFormer for Image Manipulation Detection and Localization
ObjectFormer for Image Manipulation Detection and Localization
Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
CVPR 2022
Pose And Joint-Aware Action Recognition
Pose And Joint-Aware Action Recognition
Anshul Shah, Shlok Mishra, Ankan Bansal, Jun-Cheng Chen, Rama Chellappa, Abhinav Shrivastava
WACV 2022
Rethinking Pseudo Labels for Semi-Supervised Object Detection
Rethinking Pseudo Labels for Semi-Supervised Object Detection
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry Davis
AAAI 2022
2021
NeRV: Neural Representations for Videos
NeRV: Neural Representations for Videos
Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
NeurIPS 2021
PatchGame: Learning to Signal Mid-level Patches in Referential Games
PatchGame: Learning to Signal Mid-level Patches in Referential Games
Kamal Gupta, Gowthami Somepalli, Anubhav, Vinoj Jayasundara, Matthias Zwicker, Abhinav Shrivastava
NeurIPS 2021
Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation
Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation
Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Weinberger, Wei-Lun Chao, Ser-Nam Lim
ICCV 2021
Deep Video Inpainting Detection
Deep Video Inpainting Detection
Peng Zhou, Ning Yu, Zuxuan Wu, Larry Davis, Abhinav Shrivastava, Ser-Nam Lim
BMVC 2021
GTA: Global Temporal Attention for Video Action Understanding
GTA: Global Temporal Attention for Video Action Understanding
Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava
BMVC 2021
HR-RCNN: Hierarchical Relational Reasoning for Object Detection
HR-RCNN: Hierarchical Relational Reasoning for Object Detection
Hao Chen, Abhinav Shrivastava
BMVC 2021
Layout Generation and Completion with Self-attention
Layout Generation and Completion with Self-attention
Kamal Gupta, Alessandro Achille, Justin Lazarow, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava
ICCV 2021
ICCV
2021
Learned Spatial Representations for Few-shot Talking-Head Synthesis
Moustafa Meshry, Saksham Suri, Larry Davis, Abhinav Shrivastava
ICCV 2021
The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory
The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory
Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava
ICCV 2021
Towards Discovery and Attribution of Open-world GAN Generated Images
Towards Discovery and Attribution of Open-world GAN Generated Images
ICCV 2021
Leveraging Hand-Object Interactions in Assistive Egocentric Vision
Leveraging Hand-Object Interactions in Assistive Egocentric Vision
Kyungjun Lee, Abhinav Shrivastava, Hernisa Kacorri
TPAMI 2021
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry Davis
CVPR 2021
Hierarchical Video Prediction for Human Object Interaction
Hierarchical Video Prediction for Human Object Interaction
Navaneeth Bodla, Gaurav Shrivastava, Rama Chellappa, Abhinav Shrivastava
CVPR 2021
Knowledge Evolution in Neural Networks
Knowledge Evolution in Neural Networks
Ahmed Taha, Abhinav Shrivastava, Larry Davis
CVPR 2021 oral
Learning Graphs for Knowledge Transfer with Limited Labels
Learning Graphs for Knowledge Transfer with Limited Labels
Pallabi Ghosh, Nirat Saini, Larry Davis, Abhinav Shrivastava
CVPR 2021
Learning to Predict Visual Attributes in the Wild
Learning to Predict Visual Attributes in the Wild
Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Hung Tran, Abhinav Shrivastava
CVPR 2021
Style-based Encoder Pre-training for Multi-modal Image Synthesis
Style-based Encoder Pre-training for Multi-modal Image Synthesis
Moustafa Meshry, Yixuan Ren, Larry Davis, Abhinav Shrivastava
CVPR 2021
The Lottery Ticket Hypothesis for Object Recognition
The Lottery Ticket Hypothesis for Object Recognition
Sharath Girish, Shishira R Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava
CVPR 2021
Diverse Video Generation using a Gaussian Process Trigger
Diverse Video Generation using a Gaussian Process Trigger
Gaurav Shrivastava, Abhinav Shrivastava
ICLR 2021
No-frills Dynamic Planning using Static Planners
No-frills Dynamic Planning using Static Planners
Mara Levy, Vasista Ayyagari, Abhinav Shrivastava
ICRA 2021
A Unifying Framework for Formal Theories of Novelty
A Unifying Framework for Formal Theories of Novelty
Terrance E. Boult, Przemyslaw A. Grabowicz, Derek S. Prijatelj, Roni Stern, Lawrence Holder, Joshua Alspector, Mohsen Jafarzadeh, Touqeer Ahmad, Akshay Raj Dhamija, Chunchun Li, Steve Cruz, Abhinav Shrivastava, Carl Vondrick, Walter J. Scheirer
AAAI 2021 BlueSky talk
2020
All About Knowledge Graphs for Actions
All About Knowledge Graphs for Actions
Pallabi Ghosh, Nirat Saini, Larry Davis, Abhinav Shrivastava
arXiv 2020
A Generic Visualization Approach for Convolutional Neural Networks
A Generic Visualization Approach for Convolutional Neural Networks
Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis
ECCV 2020
Curriculum Manager for Source Selection in Multi-Source Domain Adaptation
Curriculum Manager for Source Selection in Multi-Source Domain Adaptation
Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava
ECCV 2020
Depth Completion using a View-constrained Deep Prior
Depth Completion using a View-constrained Deep Prior
Pallabi Ghosh, Vibhav Vineet, Larry Davis, Abhinav Shrivastava, Sudipta Sinha, Neel Joshi
3DV 2020
Group Ensemble: Learning an Ensemble of ConvNets in a single ConvNet
Group Ensemble: Learning an Ensemble of ConvNets in a single ConvNet
Hao Chen, Abhinav Shrivastava
arXiv 2020
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Kamal Gupta, Susmija Jabbireddy, Ketul Shah, Abhinav Shrivastava, Matthias Zwicker
3DV 2020 oral
Quantization Guided JPEG Artifact Correction
Quantization Guided JPEG Artifact Correction
Max Ehrlich, Ser-Nam Lim, Larry Davis, Abhinav Shrivastava
ECCV 2020
End-to-end Learning of Compressible Features
End-to-end Learning of Compressible Features
Saurabh Singh, Sami Abu-El-Haija, Nick Johnston, Johannes Balle, Abhinav Shrivastava, George Toderici
ICIP 2020
PatchVAE: Learning Local Latent Codes for Recognition
PatchVAE: Learning Local Latent Codes for Recognition
Kamal Gupta, Saurabh Singh, Abhinav Shrivastava
CVPR 2020
Scalable Model Compression by Entropy Penalized Reparameterization
Scalable Model Compression by Entropy Penalized Reparameterization
Deniz Oktay, Johannes Balle, Saurabh Singh, Abhinav Shrivastava
ICLR 2020
Boosting Standard Classification Architectures Through a Ranking Regularizer
Boosting Standard Classification Architectures Through a Ranking Regularizer
Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis
WACV 2020
Hand-Priming in Object Localization for Assistive Egocentric Vision
Hand-Priming in Object Localization for Assistive Egocentric Vision
Kyungjun Lee, Abhinav Shrivastava, Hernisa Kacorri
WACV 2020 oralbest paper award
Detecting Human-Object Interactions via Functional Generalization
Detecting Human-Object Interactions via Functional Generalization
Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa
AAAI 2020
Generate, Segment and Refine: Towards Generic Manipulation Segmentation
Generate, Segment and Refine: Towards Generic Manipulation Segmentation
Peng Zhou, Bor-Chun Chen, Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser-Nam Lim, Larry Davis
AAAI 2020
2019
Render4Completion: Synthesizing Multi-view Depth Maps for 3D Shape Completion
Render4Completion: Synthesizing Multi-view Depth Maps for 3D Shape Completion
Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker
GeoMDL Workshop, ICCV 2019
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
Saurabh Singh, Abhinav Shrivastava
ICCV 2019
Relational Action Forecasting
Relational Action Forecasting
CVPR 2019 best paper finalist
2018
Actor-centric Relation Network
Actor-centric Relation Network
ECCV 2018
Tracking Emerges by Colorizing Videos
Tracking Emerges by Colorizing Videos
ECCV 2018
2017
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
ICCV 2017 spotlight
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection
Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta
CVPR 2017
2016
Beyond Skip Connections: Top-Down Modulation for Object Detection
Beyond Skip Connections: Top-Down Modulation for Object Detection
arXiv 2016
Contextual Priming and Feedback for Faster R-CNN
Contextual Priming and Feedback for Faster R-CNN
Abhinav Shrivastava, Abhinav Gupta
ECCV 2016
Cross-stitch Networks for Multi-task Learning
Cross-stitch Networks for Multi-task Learning
Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert
CVPR 2016 spotlight
Training Region-based Object Detectors with Online Hard Example Mining
Training Region-based Object Detectors with Online Hard Example Mining
Abhinav Shrivastava, Abhinav Gupta, Ross Girshick
CVPR 2016 oral
2015
Applying artificial vision models to human scene understanding
Applying artificial vision models to human scene understanding
Frontiers in Computational Neuroscience 2015
Mid-level Elements for Object Detection
Mid-level Elements for Object Detection
Aayush Bansal, Abhinav Shrivastava, Carl Doersch, Abhinav Gupta
arXiv 2015
Watch and Learn: Semi-supervised Learning of Object Detectors from Videos
Watch and Learn: Semi-supervised Learning of Object Detectors from Videos
Ishan Misra, Abhinav Shrivastava, Martial Hebert
CVPR 2015
2014
Enriching Visual Knowledge Bases via Object Discovery and Segmentation
Enriching Visual Knowledge Bases via Object Discovery and Segmentation
Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta
CVPR 2014
Data-driven Exemplar Model Selection
Data-driven Exemplar Model Selection
Ishan Misra, Abhinav Shrivastava, Martial Hebert
WACV 2014 oralbest student paper award
2013
Building Part-based Object Detectors via 3D Geometry
Building Part-based Object Detectors via 3D Geometry
Abhinav Shrivastava, Abhinav Gupta
ICCV 2013
NEIL: Extracting Visual Knowledge from Web Data
NEIL: Extracting Visual Knowledge from Web Data
Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta
ICCV 2013 oral
HOG and Spatial Convolution on SIMD Architecture
HOG and Spatial Convolution on SIMD Architecture
Ishan Misra, Abhinav Shrivastava, Martial Hebert
CMU Technical Report 2013
Measuring and Increasing the capacity of Natural HOG Statistics
Measuring and Increasing the capacity of Natural HOG Statistics
CMU Technical Report 2013
2012
Constrained Semi-Supervised Learning using Attributes and Comparative Attributes
Constrained Semi-Supervised Learning using Attributes and Comparative Attributes
Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
ECCV 2012 oral
Exemplar-SVMs for Visual Object Detection, Label Transfer and Image Retrieval
Exemplar-SVMs for Visual Object Detection, Label Transfer and Image Retrieval
ICML 2012
Real-time Household Object Detection from First-person's view using Exemplar-SVMs
Real-time Household Object Detection from First-person's view using Exemplar-SVMs
Abhinav Shrivastava, Abhinav Gupta, Alexei A. Efros
Ego-Vision Workshop, CVPR 2012
2011
Data-driven Visual Similarity for Cross-domain Image Matching
Data-driven Visual Similarity for Cross-domain Image Matching
SIGGRAPH Asia 2011 oral

Patents

Action localization in images and videos using relational features
C. Sun, A. Shrivastava, C. L. Schmid, R. Sukthankar, K. P. Murphy, C. M. Vondrick
US 11163989 ยท Google Inc.
Visual Tracking by Colorization
A. Shrivastava, A. Fathi, S. G. Cotado, K. P. Murphy, C. M. Vondrick
US20210089777A1 ยท Google Inc.
Learning Compressible Features
A. Shrivastava, S. Singh, J. Balle, S. A. Haija, N. Johnston, G. Toderici
US20200311548A1 ยท Google Inc.
Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization
D. Oktay, S. Singh, J. Balle, A. Shrivastava
US20200364603A1 ยท Google Inc.
Determining documents that match a query
S. Mehrotra, J. Li, A. Shrivastava
US9442929B2 ยท Microsoft Technology Licensing LLC