Image enhancement approaches often assume that the noise is signal independent, and approximate the degradation model as zero-mean additive Gaussian noise. However, this assumption does not hold for biomedical imaging systems where sensor-based sources of noise are proportional to signal strengths, and the noise is better represented as a Poisson process. The MICCAI paper explores...
Multi-task learning commonly encounters competition for resources among tasks, specifically when model capacity is limited. This challenge motivates models which allow control over the relative importance of tasks and total compute cost during inference time. In this work, we propose such a controllable multi-task network that dynamically adjusts its architecture and weights to match the...
The majority of the existing attack mechanisms today are targeted toward misclassifying specific objects and activities. However, most scenes contain multiple objects and there is usually some relationship among the objects in the scene, e.g., certain objects co-occur more frequently than others. This is often referred to as context in computer vision. We have shown...
VCG researchers are involved in two projects related to machine learning for robot autonomy. The first, funded under the National Robotics Initiative, will help monitor the health of crops. It is a collaboration with UC Merced. The second, funded by the UC Office of the President, will study the impact of agricultural technology on workforce...
Given a text query, retrieving the relevant video segments is an important and challenging problem. While many methods have looked at this problem for individual videos, our work is the first that considers a video corpus. We propose a hierarchical approach that considers intra-video and inter-video semantic relationships. More details can be found in our...
1. The paper Unsupervised Multi-source Domain Adaptation Without Access to Source Data, accepted as oral in CVPR 2021, proposes the Data frEe multi-sourCe unsupervISed domain adaptatiON (DECISION), which identifies the optimal blend of source models with no source data to generate the target model by optimizing a carefully designed unsupervised loss. Under intuitive assumptions, it...
The paper titled "Spatio-Temporal Representation Factorization for Video-based Person Re-Identification" proposed a flexible new computational unit, Spatio-Temporal Representation Factorization module (STRF), that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID. The key innovations of STRF over prior works include explicit pathways for learning discriminative temporal and spatial features...
Can you predict an activity that may occur in the near future and provide a natural language description of it? See our recent journal paper on this topic.
Amit Roy-Chowdhury is leading a team of ECE and CSE faculty that has received a grant totaling almost $1 million from the Defense Advanced Research Projects Agency, or DARPA, to understand the vulnerability of computer vision systems to adversarial attacks. The project is part of the Techniques for Machine Vision Disruption program, which is part...
Papers on High-Frame Rate Video Enhancement, Adversarial Knowledge Transfer and Video Fast-Forwarding accepted to ACM-MM 2020. The paper, ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation, accepted as oral in ACM-MM 2020, proposes the Adaptive Latent Attention Network (ALANET), to synthesize sharp high frame-rate videos by jointly performing the task of both...
Three papers on different topics were accepted to ECCV 2020 The paper titled "Domain Adaptive Semantic Segmentation Using Weak Labels" proposes a unified framework for both unsupervised domain adaptation when using the estimated pseudo-weak labels and the novel scenario of weakly-supervised adaptation when using weak labels acquired from human annotators. This work bridges the gap...
Papers on Non-Adversarial Video Synthesis with Learned Priors and Camera On-boarding for Person Re-identification using Hypothesis Transfer Learning are accepted to CVPR 2020. 1. The paper, Non-Adversarial Video Synthesis with Learned Priors, in CVPR 2020 proposes an approach to generate videos from latent noise vectors, without any reference input frames by jointly optimizing the input...
Our recent paper, Learning from Trajectories via Subgoal Discovery, in NeurIPS 2019 proposes an approach to discover subgoals to solve complex tasks, from a limited number of expert demonstrations. Using these discovered subgoals leads to significantly faster robotic task learning with none to limited hand-designing of rewards. This work is done in collaboration between UCR...
New 1.2 Million NSF grant “III: Medium: Efficient Collaborative Perception over Controllable Agent Networks” was awarded to a team of 3 CSE Professors: Vagelis Christidis (PI), Vassilis Tsotras, and Vagelis Papalexakis and 2 ECE Professors: Amit Roy-Chowdhury and Konstantinos Karydis.
Amit Roy-Chowdhury, Konstantinos Karydis, and Salman Asif, electrical and computer engineering professors, along with Fabio Pasqualetti, a mechanical engineering professor, receive a $1.2M grant from the Office of Naval Research’s Science of Artificial Intelligence Program. Using recent advances in computer vision, machine learning, signal processing, robotics, and controls, researchers will explore AI’s capability of both...
Recent papers in CVPR, ACM MM, and ACM ICMR have analyzed the issue of jointly analyzing visual and textual data. The CVPR-19 paper focuses on video moment retrieval based on associated captions, the ACM MM-18 paper considers web supervision for retrieval in joint image-text databases with limited labeled data, while the ICMR paper considered cross-modal...
A recent paper in T-IP proposed a novel deep network architecture for identifying image forgeries and presented results on multiple state-of-the-art datasets. A paper in NDSS-19 proposed how an adversary could design perturbations in video classification systems. Hybrid LSTM and Encoder-Decoder Architecture for Detection of Image Forgeries, IEEE Trans. on Image Processing, 2019. Adversarial Perturbations...