Sampling bias

Panzeri, S; Magri, C; Carraro, L

doi:10.4249/scholarpedia.4258

Lokale TagsFreigabegeschichteDetailsÜbersicht

Sampling bias

Panzeri, S., Magri, C., & Carraro, L. (2008). Sampling bias. Scholarpedia, 3(9), 4258. doi:10.4249/scholarpedia.4258.

Item is Freigegeben

einblenden: alle

Basisdaten

ausblenden:

Datensatz-Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-C6AF-2 Versions-Permalink: https://hdl.handle.net/21.11116/0000-0003-2CCC-7

Genre: Zeitschriftenartikel

Dateien

einblenden: Dateien

Externe Referenzen

ausblenden:

externe Referenz:
http://www.scholarpedia.org/article/Sampling_bias (Verlagsversion) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

ausblenden:

Urheber:
Panzeri, S¹, Autor
Magri, C¹, Autor
Carraro, L, Autor

Affiliations:
1External Organizations, ou_persistent22

Inhalt

ausblenden:

Schlagwörter: -

Zusammenfassung: Sampling bias means that the samples of a stochastic variable that are collected to determine its distribution are selected incorrectly and do not represent the true distribution because of non-random reasons. Let us consider a specific example: we might want to predict the outcome of a presidential election by means of an opinion poll. Asking 1000 voters about their voting intentions can give a pretty accurate prediction of the likely winner, but only if our sample of 1000 voters is 'representative' of the electorate as a whole (i.e. unbiased). If we only poll the opinion of, 1000 white middle class college students, then the views of many important parts of the electorate as a whole (ethnic minorities, elderly people, blue-collar workers) are likely to be underrepresented in the sample, and our ability to predict the outcome of the election from that sample is reduced.

In an unbiased sample, differences between the samples taken from a random variable and its true distribution, or differences between the samples of units from a population and the entire population they represent, should result only from chance. If their differences are not only due to chance, then there is a sampling bias. Sampling bias often arises because certain values of the variable are systematically under-represented or over-represented with respect to the true distribution of the variable (like in our opinion poll example above). Because of its consistent nature, sampling bias leads to a systematic distortion of the estimate of the sampled probability distribution. This distortion cannot be eliminated by increasing the number of data samples and must be corrected for by means of appropriate techniques, some of which are discussed below. In other words, polling an additional 1000 white college students will not improve the predictive power of our opinion poll, but polling 1000 individuals chosen at random from the electoral roll would. Obviously, a biased sample may cause problems in the measure of probability functionals (e.g., the variance or the entropy of the distribution), since any statistics computed from that sample has the potential to be consistently erroneous.

Details

ausblenden:

Sprache(n):

Datum: Erschienen: 2008-10

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: DOI: 10.4249/scholarpedia.4258
BibTex Citekey: PanzeriMC2008

Art des Abschluß: -

Quelle 1

ausblenden:

Titel: Scholarpedia

Genre der Quelle: Zeitschrift

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: 3 (9) Artikelnummer: - Start- / Endseite: 4258 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1