[payload] Velocity Vector Fields, Computer Vision and Digital Video

Adam Sobieski <adamsobieski@hotmail.com> Sun, 02 December 2012 05:15 UTC

Return-Path: <adamsobieski@hotmail.com>
X-Original-To: payload@ietfa.amsl.com
Delivered-To: payload@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6FD2C21E80A3 for <payload@ietfa.amsl.com>; Sat, 1 Dec 2012 21:15:35 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 1.755
X-Spam-Level: *
X-Spam-Status: No, score=1.755 tagged_above=-999 required=5 tests=[BAYES_50=0.001, HTML_MESSAGE=0.001, MIME_BASE64_TEXT=1.753]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id x8M5HtnIFIAU for <payload@ietfa.amsl.com>; Sat, 1 Dec 2012 21:15:34 -0800 (PST)
Received: from snt0-omc2-s27.snt0.hotmail.com (snt0-omc2-s27.snt0.hotmail.com [65.55.90.102]) by ietfa.amsl.com (Postfix) with ESMTP id 116EC21E8091 for <payload@ietf.org>; Sat, 1 Dec 2012 21:15:34 -0800 (PST)
Received: from SNT002-W46 ([65.55.90.73]) by snt0-omc2-s27.snt0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Sat, 1 Dec 2012 21:15:32 -0800
Message-ID: <SNT002-W4652BC8CA7522A6B391A5AC5410@phx.gbl>
Content-Type: multipart/alternative; boundary="_8da7161c-6e3b-4a63-ab02-09efb42bc10e_"
X-Originating-IP: [174.100.43.136]
From: Adam Sobieski <adamsobieski@hotmail.com>
To: "payload@ietf.org" <payload@ietf.org>
Date: Sun, 02 Dec 2012 05:15:32 +0000
Importance: Normal
MIME-Version: 1.0
X-OriginalArrivalTime: 02 Dec 2012 05:15:32.0867 (UTC) FILETIME=[0DA5BD30:01CDD04C]
Subject: [payload] Velocity Vector Fields, Computer Vision and Digital Video
X-BeenThere: payload@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Audio/Video Transport Payloads working group discussion list <payload.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/payload>, <mailto:payload-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/payload>
List-Post: <mailto:payload@ietf.org>
List-Help: <mailto:payload-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/payload>, <mailto:payload-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 02 Dec 2012 05:15:35 -0000

Internet Engineering Task Force,
Payload Working Group,
 
Greetings.  A digital video technology topic is that challenges in motion estimation have included that motion was three-dimensional while video images were projections of 3D motion onto two-dimensional bitmaps.  Velocity and digital video are interrelated topics and exciting research, applicable to both computer vision and digital video topics, is underway with regard to real-time 3D velocity vector fields with Kinect sensors.
 
An ICCV 2011 paper, Kinecting the Dots: Particle Based Scene Flow from Depth Sensors, by Simon Hadfield (http://personal.ee.surrey.ac.uk/Personal/R.Bowden/publications/2011/ICCV/Hadfield_ICCV11.pdf, http://www.youtube.com/watch?v=10CWOZcGMv0) illustrates velocity vector fields from Kinect data and describes a CPU implementation while a parallel GPU implementation with real-time performance is anticipated.
 
With regard to 3D video, current technologies include MPEG multiview coding (http://www.ietf.org/id/draft-ietf-payload-rtp-mvc-02.txt), free viewpoint video, and 2D-plus-depth video, resembling the data from RGBD sensors, and, in addition to previous techniques for super-resolution, both the depth stream data and computed data, such as surface geometry and velocity vector fields, can be of use to heuristics for super-resolving video streams. 2D-Plus-Depth Based Resolution and Frame-rate Up-conversion Technique for Depth Video by Jinwook Choi, Dongbo Min, and Kwanghoon Sohn discusses some related super-resolution topics.
 
Interesting scenarios include: video calls, video conferences, video blogs and video forums. With the super-resolution of streams from RGBD sensors, streaming and storage efficiencies can be enhanced.  In addition to 2D-plus-depth, stereoscopic 3D video, and MPEG MVC, also possible are: 3D video streaming, video calling and conferencing, interoperable with head tracking, 2D+Velocity or RGBV video, and 3D video formats based upon, instead of sequences of bitmaps, sequences of point clouds or 3D surfaces.
 
With regard to the numerical precision of depth measurements and that of velocity and acceleration vector fields, functions to obtain 3D probability distributions around each 3D point are of interest. Accuracy Analysis of Kinect Depth Data by K. Khoshelham discusses the numerical precision of Kinect sensor depth measurements. 3D with Kinect by Jan Smisek, Michal Jancosek and Tomas Pajdla also discusses the calibration of and accuracy analysis of Kinect sensors as well as structure from motion.  Accuracy analysis of depth measurements and of the timestamps of the depth and RGB data streams can be of use to numerical algorithms for precisely obtaining and utilizing depth measurements and velocity and acceleration vector fields.
 
Additionally, towards combined uses of depth and RGB data, where depth data can super-resolve RGB data and RGB data can super-resolve depth data, some approaches include edge detection, silhouettes, visual hull construction, and topics discussed in Silhouette-based 3D Model Reconstruction from Multiple Images by Adem Yasar Mulayim, Ulas Yılmaz, and Volkan Atalay, and Shape-From-Silhouette of Articulated Objects and its Use for Human Body Kinematics Estimation and Motion Capture by German K. M. Cheung, Simon Baker, and Takeo Kanade.  Shape from shading is discussed in Shape from Shading: A Survey by Ruo Zhang, Ping-Sing Tsai, James Edwin Cryer, and Mubarak Shah and Recovering Surface Reflectance and Multiple Light Locations and Intensities from Image Data by S. Xu and A. M. Wallace.



Kind regards,
 
Adam Sobieski