Campus Event Calendar: R. Ravi (06/21/2011 in E1 4/024)

Campus Event Calendar

Campus Event Calendar:
- All Upcoming:
  - only for D1
  - only for D2
  - only for INET
  - only for D4
  - only for D5
  - only for D6
  - only for RG1
  - Mailing Lists
  - by Speaker
  - by Type
  - by Category
  - by Title
  - Calendar
  - RSS Feed
- History of Events:

Event Entry

New for: D3

What and Who

Approximation Algorithms for Correlated Knaspacks and Non-Martingale Bandits

R. Ravi

CMU

Talk

AG 1, AG 3, AG 5, SWS, AG 4, RG1, MMCI

AG Audience

English

Note: We use this to send email in the morning.

Date, Time and Location

Tuesday, 21 June 2011

13:00

45 Minutes

E1 4

024

Saarbrücken

Abstract

In the stochastic knapsack problem, we are given a knapsack with

size B, and a set of jobs whose sizes and rewards are drawn from a
known probability distribution. However, the only way to know the
actual size and reward is to schedule the job---when it completes,
we get to know these values. How should we schedule jobs to
maximize the expected total reward? We know constant-factor
approximations for this problem when we assume that rewards and
sizes are independent random variables, and that we cannot
prematurely cancel jobs after we schedule them. What can we say
when either or both of these assumptions are dropped?

Not only is the stochastic knapsack problem of interest in its own
right, but techniques developed for it are applicable to other
stochastic packing problems. Indeed, ideas for this problem have
been useful for budgeted learning problems, where one is given
several arms which evolve in a specified stochastic fashion with
each pull, and the goal is to pull the arms a total of B times to
maximize the reward obtained. Much recent work on this problem
focus on the case when the evolution of the arms follows a
martingale, i.e., when the expected reward from the future is the
same as the reward at the current state. However, what can we say
when the rewards do not form a martingale?

We give constant-factor approximation algorithms for the stochastic
knapsack problem with correlations and cancellations, and also for
some budgeted learning problems where the martingale condition is
not satisfied, using similar ideas. Indeed, we can show that
previously proposed linear programming relaxations for these
problems have large integrality gaps. We propose new time-indexed
LP relaxations; using a decomposition and ``shifting'' approach, we
convert these fractional solutions to distributions over
strategies, and then use the LP values and the time ordering
information from these strategies to devise a randomized scheduling
algorithm. We hope our LP formulation and decomposition methods may
provide a new way to address other correlated bandit problems with
more general contexts.

This is joint work with Anupam Gupta, Ravishankar Krishnaswamy and
Marco Molinaro, all from CMU. The paper is available at
http://arxiv.org/abs/1102.3749

Contact

Kurt Mehlhorn

--email hidden

System used:

Meeting URL:

Meeting ID:

Passcode:

passcode not visible

Code Visible for:

logged in users only

Kurt Mehlhorn, 04/19/2011 09:17
Kurt Mehlhorn, 02/01/2011 10:40 -- Created document.

Imprint / Impressum | Data Protection / Datenschutzhinweis