Research

Active Infernece in Videos

  • Efficient inference for object-augmented denseCRF model for multi-class semantic video segmentaion task.
  • Active inference method to select informative subset of object proposals to scale up with a large number of proposals.
[ pdf | Abstract | show subgraph details]

Anytime Scene Understanding

  • Propose a dynamic hierarchical model for anytime scene labeling that allows us to achieve flexible trade-offs between efficiency and accuracy in pixel-level prediction.
  • Learn a high-quality policy of feature and model selection based on an approximate policy iteration method.
[ pdf | Abstract | Test-time algorithm | method overview]

Holistic Dynamic Scene Understanding

  • Joint infer mutiple (semantic and geometric) labels consistently in video by introducing an intermediate scene representation.
  • Introduce hierarchical structure to enforce long-range spatio-temporal labeling smoothness.
[ pdf | Abstract | scene model overview]

Semantic Video Parsing with Object Cues

  • Joint infer supervoxel, object activation and their relationships (support and occlusion) in a unified energy function.
  • Exploit temporal cues to learn from less training data : 10 to 20 exemplars for each class.
[ pdf | Abstract | show proposal pipeline]