Campus Event Calendar: Razvan Belet (11/12/2012 in E1 4/024)

Campus Event Calendar

Campus Event Calendar:
- All Upcoming:
  - only for D1
  - only for D2
  - only for INET
  - only for D4
  - only for D5
  - only for D6
  - only for RG1
  - Mailing Lists
  - by Speaker
  - by Type
  - by Category
  - by Title
  - Calendar
  - RSS Feed
- History of Events:

Event Entry

What and Who

Leveraging Independence and Locality for Random Forests in a Distributed Environment

Razvan Belet

International Max Planck Research School for Computer Science - IMPRS

IMPRS Research Seminar

AG 1, AG 2, AG 3, AG 4, AG 5, SWS, RG1, MMCI

Public Audience

English

Note: We use this to send email in the morning.

Date, Time and Location

Monday, 12 November 2012

12:05

60 Minutes

E1 4

024

Saarbrücken

Abstract

With the emergence of big data, inducting regression trees on very large data sets became a common data mining task. Even though centralized algorithms for computing ensembles of Classification/Regression trees are a well studied machine learning/data
mining problem, their distributed versions still raise scalability, efficiency and accuracy issues.

Most state of the art tree learning algorithms require data to reside in memory on a single machine. Adopting this approach for trees on big data is not feasible as the limited resources provided by only one machine lead to scalability problems.
While more scalable implementations of tree learning algorithms have been proposed, they typically require specialized parallel computing architectures rendering those algorithms complex and error-prone.
In this presentation I will introduce two approaches to computing ensembles of regression trees on very large training data sets using the MapReduce framework as an underlying tool. The first approach, inspired by a tree inducting system proposed by
GOOGLE, employs the entire MapReduce cluster to parallely and fully distributedly learn tree ensembles. The second approach exploits locality and independence in the tree learning process.

Contact

--email hidden

System used:

Meeting URL:

Meeting ID:

Passcode:

passcode not visible

Code Visible for:

logged in users only

Marc Schmitt, 11/07/2012 14:02 -- Created document.

Imprint / Impressum | Data Protection / Datenschutzhinweis