Niveau: Supérieur, Doctorat, Bac+8
1Segmenting, Modeling, and Matching Video Clips Containing Multiple Moving Objects Fred Rothganger, Member, IEEE, Svetlana Lazebnik, Member, IEEE Cordelia Schmid, Senior Member, IEEE Jean Ponce, IEEE Fellow Abstract— This article presents a novel representation for dynamic scenes composed of multiple rigid objects that may undergo different motions and are observed by a moving camera. Multi–view constraints associated with groups of affine–covariant scene patches and a normalized description of their appearance are used to segment a scene into its rigid components, construct three–dimensional models of these components, and match in- stances of models recovered from different image sequences. The proposed approach has been applied to the detection and matching of moving objects in video sequences and to shot matching, i.e., the identification of shots that depict the same scene in a video clip. Index Terms— Affine-covariant patches, structure from mo- tion, motion segmentation, shot matching, video retrieval. I. INTRODUCTION THE explosion in both the richness and quantity of digitalvideo content available to the average consumer creates a need for indexing and retrieval tools to effectively manage the large volume of data and efficiently access specific frames, scenes, and/or shots. Most existing video search tools [1], [2], [3], [4] rely on the appearance and two-dimensional (2D) geometric attributes of individual frames in the sequence, and they do not take advantage of the stronger three-dimensional (3D) constraints associated with multiple frames.
- between fixed
- rigid components
- vectors joining
- projection model
- coordinate vectors
- patch parameters
- approach has
- local
- moving independently