bpb's tags:


EXPORT LIST RSS ?
bpb's bookmarks matching tag snp
 
Number of articles per page:
10 | 25 | 50 | 100
 
Correcting for ascertainment biases when analyzing SNP data: Applications to the estimation of linkage disequilibrium
Correcting for ascertainment biases when analyzing SNP data applications to the estimation of linkage disequilibrium
Theoretical Population Biology 63 (3), 245 (2003)
As large-scale sequencing efforts turn from single genome sequencing to polymorphism discovery, single nucleotide polymorphisms (SNPs) are becoming an increasingly important class of population genetic data. But because of the ascertainment biases introduced by many methods of SNP discovery, most SNP data cannot be analyzed using classical population genetic methods. Statistical methods must instead be developed that can explicitly take into account each method of SNP discovery. Here we review some of the current methods for analyzing SNPs and derive sampling distributions for single SNPs and pairs of SNPs for some common SNP discovery schemes. We also show that the ascertainment scheme has a large effect on the estimation of linkage disequilibrium and recombination, and describe some methods of correcting for ascertainment biases when estimating recombination rates from SNP data. \copyright 2003 Elsevier Science (USA). All rights reserved.
 
The Diploid Genome Sequence of an Individual Human
Samuel Levy et al.
PLoS Biology 5 (10), e254 (01 Oct 2007)
Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.
Posted by bpb and 21 others to snp indel human genome diploid on Wed Apr 30 2008 at 11:29 UTC | info | related
 
Biological implications of SNPs in signal peptide domains of human proteins.
Proteins Structure Function and Bioinformatics 70 (2), 394 (2008)
Proteins destined for secretion or membrane compartments possess signal peptides for insertion into the membrane. The signal peptide is therefore critical for localization and function of cell surface receptors and ligands that mediate cell-cell communication. About 4% of all human proteins listed in UniProt database have signal peptide domains in their N terminals. A comprehensive literature survey was performed to retrieve functional and disease associated genetic variants in the signal peptide domains of human proteins. In 21 human proteins we have identified 26 disease associated mutations within their signal peptide domains, 14 mutations of which have been experimentally shown to impair the signal peptide function and thus influence protein transportation. We took advantage of SignalP 3.0 predictions to characterize the signal peptide prediction score differences between the mutant and the wild-type alleles of each mutation, as well as 189 previously uncharacterized single nucleotide polymorphisms (SNPs) found to be located in the signal peptide domains of 165 human proteins. Comparisons of signal peptide prediction outcomes of mutations and SNPs, have implicated SNPs potentially impacting the signal peptide function, and thus the cellular localization of the human proteins. The majority of the top candidate proteins represented membrane and secreted proteins that are associated with molecular transport, cell signaling and cell to cell interaction processes of the cell. This is the first study that systematically characterizes genetic variation occurring in the signal peptides of all human proteins. This study represents a useful strategy for prioritization of SNPs occurring within the signal peptide domains of human proteins. Functional evaluation of candidates identified herein may reveal effects on major cellular processes including immune cell function, cell recognition and adhesion, and signal transduction. Proteins 2007. (c) 2007 Wiley-Liss, Inc.
Posted by bpb to snp signal-peptide Protein human on Wed Apr 30 2008 at 11:29 UTC | info | related
 
A Map of Recent Positive Selection in the Human Genome.
A map of recent positive selection in the human genome
PLoS Biology 4 (3), e72 (01 Mar 2006)
The identification of signals of very recent positive selection provides information about the adaptation of modern humans to local conditions. We report here on a genome-wide scan for signals of very recent positive selection in favor of variants that have not yet reached fixation. We describe a new analytical method for scanning single nucleotide polymorphism (SNP) data for signals of recent selection, and apply this to data from the International HapMap Project. In all three continental groups we find widespread signals of recent positive selection. Most signals are region-specific, though a significant excess are shared across groups. Contrary to some earlier low resolution studies that suggested a paucity of recent selection in sub-Saharan Africans, we find that by some measures our strongest signals of selection are from the Yoruba population. Finally, since these signals indicate the existence of genetic variants that have substantially different fitnesses, they must indicate loci that are the source of significant phenotypic variation. Though the relevant phenotypes are generally not known, such loci should be of particular interest in mapping studies of complex traits. For this purpose we have developed a set of SNPs that can be used to tag the strongest approximately 250 signals of recent selection in each population.
 
Haplotype homozygosity and derived alleles in the human genome.
Andrew E Fry et al.
American journal of human genetics 78 (6), 1053-9 (Jun 2006)
Haplotype-based techniques are being used to estimate the relative age of alleles?particularly in screening loci for signals of recent positive selection?but does this approach capture even coarse age differences? Using simulations and empirical data from the International HapMap Project, we show that a simple pairwise metric of haplotype homozygosity gives significantly higher mean values for human single-nucleotide-polymorphism alleles that appear to be derived than for those that appear to be ancestral, as determined by comparison with the chimpanzee genome. Our results support the use of haplotype-based techniques, such as extended haplotypic homozygosity, to assess the age of alleles.
 
Bayesian estimates of linkage disequilibrium
BMC Genetics 8 (1), 36 (2007)

<< Prev 0      Showing entries 1 to 6 of 6 total      Next 0 >>