Extracting information from the Web remains a critical component in knowledge harvesting systems for building curated knowledge structures,
such as Knowledge Bases (KBs), and satisfying evolving user needs, which require operations such as aggregation and reasoning. Estimating the
cardinality of a set of entities on the Web to fulfill the information need of questions of the form “how many ..?” is a challenging task.
While, intuitively, cardinality can be estimated by explicitly enumerating the constituent entities, this is usually not possible due
to the low recall of entities on the Web. In this dissertation we present our contributions towards retrieving and estimating
cardinalities of entity sets on the Web.