Zhengyou Zhang Microsoft Research Digital Object Identifier: 10.1109/MMUL.2012.24 Publication Year:...

Zhengyou ZhangMicrosoft Research

Digital Object Identifier: 10.1109/MMUL.2012.24 Publication Year: 2012 , Page(s): 4 - 10

Professor: Yih-Ran SheuStudent : Chien-Lin Wu (MA220301)

AbstractIntroduction 1-Kinect Sensor 2-Kinect Skeletal Tracking 3-Head-Pose and Facial-Expression TrackingConclusion References

Kinect’s impact has extended far beyond the gaming industry. With its wide availability and low cost, many researchers and practitioners in computer science, electronic engineering, and robotics are leveraging the sensing technology to develop creative new ways to interact with machines and to perform other tasks, from helping children with autism to assisting doctors in operating rooms.

The Kinect sensor incorporates severaladvanced sensing hardware. Most notably, itcontains a depth sensor, a color camera, and afour-microphone array that provide full-body3D motion capture, facial recognition, andvoice recognition capabilities.

Infraredprojector

RGBcamera

Infraredcamera

The infrared (IR) dots seen by the IRcamera. The image on the left shows a close-upof the red boxed area.

Kinect sensor depth image. The sensorproduced this depth image from the infrared (IR)dot image.

Kinect calibration card. To recalibrate the Kinect sensor, the RGB camera’s coordinate system determines the 3D coordinates of the feature points on the calibration card, which are considered to be the true values.

In skeletal tracking, a human body is represented by a number of joints representing body parts such as head, neck, shoulders, and arms). Each joint is represented by its 3D coordinates. The goal is to determine all the 3D parameters of these joints in real time to allow fluent interactivity and with limited computation resources allocated on the Xbox 360 so as not to impact gaming performance. Rather than trying to determine directly the body pose in this high-dimensional space.

(a) Using a skeletal representation of various body parts, (b) Kinect usesper-pixel, body-part recognition as an intermediate step to avoid a combinatorial search over thedifferent body joints.

The Kinect skeletal tracking pipeline.

Head-pose and facial-expression tracking hasbeen an active research area in computer visionfor several decades. It has many applicationsincluding human-computer interaction,performance-driven facial animation, andface recognition. Most previous approachesfocus on 2D images, so they must exploitsome appearance and shape models becausethere are few distinct facial features. Theymight still suffer from lighting and texturevariations, occlusion of profile poses, and so forth.

An example of a human face captured by the Kinect sensor.(a)Video frame(texture)(b) Depth image(c) close up of the facial surface.

Facial expression tracking.These sample images show the results of Kinect tracking 2D feature points in video frames using a projected face mesh overlay.

The Kinect sensor offers an unlimited numberof opportunities for old and new applications.This article only gives a taste of what is possible.Thus far, additional research areas includehand-gesture recognition, human-activity recognition,body biometrics estimation (such asweight, gender, or height), 3D surface reconstruction, and healthcare applications.

1. Z. Ren, J. Yuan, and Z. Zhang, ‘‘Robust Hand Gesture Recognition Based on Finger-Earth Movers Distance with a Commodity Depth Camera,‘‘Proc. 19th ACM Int’l Conf. Multimedia (ACM MM), ACM Press, 2011, pp. 10931096.

2.W. Li, Z. Zhang, and Z. Liu, ‘‘Action Recognition Based on A Bag of 3D Points,‘‘ Proc. IEEE Int’l Workshop on CVPR for Human Communicative BehaviorAnalysis (CVPR4HB), IEEE CS Press, 2010, pp. 914.

3.C. Velardo and J.-L. Dugelay, ‘‘Real Time Extractionof Body Soft Biometric from 3D Videos,‘‘Proc. ACM Int’l Conf. Multimedia (ACM MM),ACM Press, 2011, pp. 781782.

4.S. Izadi et al., ‘‘KinectFusion: Real-Time Dynamic 3D Surface Reconstruction and Interaction,‘‘ Proc. ACM SIGGRAPH, 2011.

Zhengyou Zhang Microsoft Research Digital Object Identifier: 10.1109/MMUL.2012.24 Publication Year:...

Documents

Transcript of Zhengyou Zhang Microsoft Research Digital Object Identifier: 10.1109/MMUL.2012.24 Publication Year:...

Professor : Yih-Ran Sheu Student’s name : Nguyen Van Binh Student ID: MA02B203 Kinect camera 1 Southern Taiwan University Department of Electrical Engineering.

1 Maze Solving Algorithms for Micro Mouse Adviser: Yih-Ran Sheu Adviser : Yih-Ran Sheu Student : Yi-Fong Hong SN:M9820112 Mishra, S.; Bande, P.; Signal.

Prof. Yih Huang

An Ecological Trap for Ecologists: Zero-Modified Models Western Mensurationists’ Meeting 2009 Tzeng Yih Lam 06.23.2009 Tzeng Yih Lam, OSU Manuela Huso,

Survey of Household Energy Use (SHEU)

COL (Dr) Ng Yih Yng cardiovascular care

Susan Olmsted Advisor: Dr. John Kelley ND Project Coordinator: Grace Sheu.

C++ Object Oriented Programming Pei-yih Ting NTOU CSsquall.cs.ntou.edu.tw/cpp/103spring/slides/CPP16-DisciplinedCoding... · C++ Object Oriented Programming Pei-yih Ting NTOU CS 2

Sheu Laplace

Scientists creating synthetic organism B9802095 Kevin Sheu ( 許傑 ) 醫學二.

Science, Technology, & Tao Presented by Presented by Prof. Fu-Yih Shih, Ph.D., fuyih@mail.leader.edu.tw Prof. Fu-Yih Shih, Ph.D., fuyih@mail.leader.edu.tw.

Image Rectification for Stereo Vision Charles Loop Zhengyou Zhang Microsoft Research.

Ti Yih Presentation

Yam yih hwan pp1 assignment aug 14

Groundwater Salinity Simulation of A Subsurface Reservoir in Penghu Island (Pescadores), Taiwan Professor Yih-Chi TanProfessor Yih-Chi Tan Department of.

Camera Calibration, by Zhengyou Zhang - Computer · PDF fileChapter 2 CAMERA CALIBRATION Zhengyou Zhang Camera calibration is a necessary step in 3D computer vision in order to extract

David Wolber Yingfeng Su Yih-Tsung Chiang

AN EYE MOUSE SYSTEM USING FPGA AN EYE MOUSE SYSTEM USING FPGA Professor: Yih-Ran Sheu Student: Dinh Viet Thang(M992B206)

From Localization to Connectivity and... Lei Sheu 1/11/2011.

Flexible Camera Calibration by Viewing a Plane from Unknown Orientations Zhengyou Zhang Vision Technology Group Microsoft Research.