MPI-INF Logo
Campus Event Calendar

Event Entry

New for: D2, D3

What and Who

Large-Scale Matrix Factorization

Rainer Gemulla
Max-Planck-Institut für Informatik - D5
Joint MPI-INF/MPI-SWS Lecture Series
AG 1, AG 2, AG 3, AG 4, AG 5, SWS, RG1, MMCI  
MPI Audience
English

Date, Time and Location

Wednesday, 6 April 2011
12:15
60 Minutes
E1 4
024
Saarbrücken

Abstract

Low-rank matrix factorization is an effective tool for analysis of
``dyadic data,'' which aims at discovering and capturing the
interactions between two entities. Successful applications include topic
detection and keyword search (where the corresponding entities are
documents and terms), news personalization (users and stories), and
recommendation systems (users and items). I will talk about a novel
algorithm to approximately factor large matrices with millions of rows,
millions of columns, and billions of nonzero elements. Our approach
rests on stochastic gradient descent (SGD), an iterative stochastic
optimization algorithm; the idea is to exploit the special structure of
the matrix factorization problem to develop a new ``stratified'' SGD
variant that can be fully distributed and run on web-scale datasets
using MapReduce. The resulting distributed SGD factorization algorithm,
called DSGD, handles a wide variety of matrix factorizations, converges
significantly faster than alternative algorithms, and has better
scalability properties. My talk covers applications, algorithmic
aspects, and some experimental results.

Contact

Jennifer Mueller
900
--email hidden
passcode not visible
logged in users only

Jennifer Müller, 03/30/2011 15:07
Jennifer Müller, 03/29/2011 15:09
Anna-Lisa Overhoff, 03/16/2011 10:17 -- Created document.