Campus Event Calendar: Dr. Pei Li (04/09/2015 in E1 4/433)

Campus Event Calendar

Campus Event Calendar:
- All Upcoming:
  - only for D1
  - only for D2
  - only for INET
  - only for D4
  - only for D5
  - only for D6
  - only for RG1
  - Mailing Lists
  - by Speaker
  - by Type
  - by Category
  - by Title
  - Calendar
  - RSS Feed
- History of Events:

Event Entry

What and Who

Series Discovery with Missing and Erroneous Values

Dr. Pei Li

University of Zurich

AG5 Talk

http://www.ifi.uzh.ch/dbtg/Staff/peili.html

AG 5

AG Audience

English

Note: We use this to send email in the morning.

Date, Time and Location

Thursday, 9 April 2015

10:00

60 Minutes

E1 4

433

Saarbrücken

Abstract

A series of real-world data, such as a series of music records, is often generated with order
dependency semantics; for example, music records in a series with larger catalog numbers are
usually released later in years. In this talk, I will discuss how order dependencies can be
exploited to discovery series as well as to repair missing and erroneous values of ordered
attributes in a dataset. The problem is challenging in the following aspects. First, order
dependency mechanisms are unknown a-priori and can vary among series. For example, a series can
assign catalog numbers to records in either increasing or decreasing order over time. Second,
order dependencies are often not satisfied by every record pair in a real-world series. There can
be a substantial number of records that slightly violate an order dependency. Existing ordering
integrity constraints would consider such records as exceptions, and blindly label them as
outliers. The two factors make our goal of ``one shot, two kills'' - series discovery as well as
error detection extremely challenging.
To make order dependencies applicable to real-world series, we propose the notion of longest
monotonic bands that characterize series, meanwhile being able to distinguish slight violations to
order dependencies from local outliers in a series. We also provide an efficient framework for
discovering series that are approximated by longest monotonic bands. In this talk, I will present
analyses of our proposed algorithms, and show the effectiveness of our framework with preliminary
results in real-world datasets.

Contact

Petra Schaaf

5000

--email hidden

System used:

Meeting URL:

Meeting ID:

Passcode:

passcode not visible

Code Visible for:

logged in users only

Petra Schaaf, 04/08/2015 10:13
Petra Schaaf, 04/08/2015 10:13 -- Created document.

Imprint / Impressum | Data Protection / Datenschutzhinweis