Mobile Image Matching - Towards Mobile Augmented Reality
Bernd Girod
Stanford University
MPI Distinguished Lecture Series
Bernd Girod is Professor of Electrical Engineering and (by courtesy)
Computer Science in the Information Systems Laboratory of Stanford
University, California. He was Chaired Professor of
Telecommunications in the Electrical Engineering Department of the
University of Erlangen-Nuremberg until 1999. His research interests
are in the areas of video compression, networked media systems, and
image databases. He has published over 400 conference and journal
papers, as well as 5 books. Professor Girod has been involved in
several startup ventures, among them Polycom (Nasdaq:PLCM), Vivo
Software, 8x8 (Nasdaq: EGHT), and RealNetworks (Nasdaq: RNWK). He
received the Engineering Doctorate from University of Hannover,
Germany, and an M.S. Degree from Georgia Institute of Technology.
Prof. Girod is a Fellow of the IEEE and of EURASIP and a member of the
German National Academy of Sciences. He received the 2002 EURASIP Best
Paper Award, the 2004 EURASIP Technical Achievement Award, and the
2007 IEEE Multimedia Communication Best Paper Award.
Handheld mobile devices, such as camera phones or PDAs, are expected
to become ubiquitous platforms for visual search and mobile augmented
reality applications. For mobile image matching, a visual data base is
typically stored at a server in the network. Hence, for a visual
comparison, information must be either uploaded from the mobile to the
server, or downloaded from the server to the mobile. With relatively
slow wireless links, the response time of the system critically
depends on how much information must be transferred in both directions.
We review recent advances in mobile matching, using a "bag-of-visual-
words" approach with robust feature descriptors, and show that
dramatic speed-ups are possible by considering recognition and
compression jointly.
We will use real-time implementations for different example
applications, such as recognition of landmarks or CD cover, to show
the benefit from image processing on the phone, the server, and/or both.