Campus Event Calendar: Sanjar Karaev (02/10/2014 in E1 4/024)

Campus Event Calendar

Campus Event Calendar:
- All Upcoming:
  - only for D1
  - only for D2
  - only for INET
  - only for D4
  - only for D5
  - only for D6
  - only for RG1
  - Mailing Lists
  - by Speaker
  - by Type
  - by Category
  - by Title
  - Calendar
  - RSS Feed
- History of Events:

Event Entry

What and Who

Matrix Factorization over Max-Times Algebra for Data Mining

Sanjar Karaev

Fachrichtung Informatik - Saarbrücken

PhD Application Talk

Masters Student

AG 1, AG 2, AG 3, AG 4, AG 5, SWS, RG1, MMCI

Public Audience

English

Note: We use this to send email in the morning.

Date, Time and Location

Monday, 10 February 2014

10:50

90 Minutes

E1 4

024

Saarbrücken

Abstract

Decomposing a given matrix into two factor matrices is a frequently used technique in data mining for uncovering underlying latent patterns in the data. Unlike in pure mathematics, the emphasis is put on obtaining results that are interpretable, rather than necessarily having a small reconstruction error. One approach to increase interpretability is to pose constraints on the factors. For example, they might be restricted to the same type as the original matrix.

Among many different ways one can define matrix multiplication, the standard and Boolean cases are the most thoroughly studied. In this work we introduce matrix multiplication over max-times algebra, which is the set of nonnegative real numbers endowed with standard multiplication, but with addition being replaced by the maximization operation. The main objective of the thesis is to develop efficient factorization algorithms for this newly defined matrix multiplication.

We propose several methods for solving this problem. The choice of a particular algorithm depends on the nature of the data, in particular its density. The sparser the matrices, the closer their max-multiplication is to the standard matrix multiplication, and for extremely sparse data even an algorithm for nonnegative matrix factorization can be quite good. However, for other cases algorithms that are specifically tailored for max-times data are required.
The max matrix factorization problem is hard, and solving the problem precisely for large input sizes is infeasible, which means that approximate methods should be sought. It turned out in the experiments that the most promising approach is to relax the objective in such way that it becomes differentiable, after which convex optimization algorithms (e.g. gradient descent) can be used. This produces relatively good results when the max-times structure is present in the data.

Contact

--email hidden

System used:

Meeting URL:

Meeting ID:

Passcode:

passcode not visible

Code Visible for:

logged in users only

Aaron Alsancak, 02/06/2014 10:49
Aaron Alsancak, 02/06/2014 10:36 -- Created document.

Imprint / Impressum | Data Protection / Datenschutzhinweis