Skip to main content
eScholarship
Open Access Publications from the University of California

UC Berkeley

UC Berkeley Previously Published Works bannerUC Berkeley

On the Number of Siblings and p-th Cousins in a Large Population Sample

Published Web Location

https://doi.org/10.1101/145599Creative Commons 'BY-NC-ND' version 4.0 license
Abstract

1. The number of individuals in a random sample with close relatives in the sample is a quantity of interest when designing Genome Wide Association Studies (GWAS) and other cohort based genetic, and non-genetic, studies. In this paper, we develop expressions for the distribution and expectation of the number of p -th cousins in a sample from a population of size N under two diploid Wright-Fisher models. We also develop simple asymptotic expressions for large values of N . For example, the expected proportion of individuals with at least one p -th cousin in a sample of K individuals, for a diploid dioecious Wright-Fisher model, is approximately 1 − e −(2 2 p −1 ) K/N . Our results show that a substantial fraction of individuals in the sample will have at least a second cousin if the sampling fraction ( K/N ) is on the order of 10 −2 . This confirms that, for large cohort samples, relatedness among individuals cannot easily be ignored.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View