<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-19517056</id><updated>2011-04-21T23:28:28.661Z</updated><category term='lundy'/><title type='text'>slack---line</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>24</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-19517056.post-67339480203349247</id><published>2007-09-18T11:56:00.000Z</published><updated>2007-09-18T12:00:46.391Z</updated><category scheme='http://www.blogger.com/atom/ns#' term='lundy'/><title type='text'>Better late than never...</title><content type='html'>&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-67339480203349247?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/67339480203349247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=67339480203349247' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/67339480203349247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/67339480203349247'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2007/09/better-late-than-never.html' title='Better late than never...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-3384501673809955965</id><published>2007-04-18T11:39:00.000Z</published><updated>2007-05-23T09:09:31.535Z</updated><title type='text'>Perl scripts &amp; LaTeX</title><content type='html'>Not really related to any news, but I've written a couple of perl scripts for formatting the output from two tagging programs.  In both cases the scripts take the standard output and generate &lt;a href="http://www.latex-project.org/" target="_blank"&gt;L&lt;sub&gt;a&lt;/sub&gt;T&lt;sub&gt;e&lt;/sub&gt;X&lt;/a&gt; formatted tables for subsequent use.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.broad.mit.edu/mpg/tagger/" target="_blank"&gt;Tagger&lt;/a&gt; (de Bakker &lt;i&gt;et al&lt;/i&gt; 2005) is implemented in the program &lt;a href="http://www.broad.mit.edu/mpg/haploview/" target="_blank"&gt;Haploview&lt;/a&gt;, and uses pair-wise LD to tag regions.  The script for re-formatting the output is written in perl and is available &lt;a href="http://slack.ser.man.ac.uk/files/tagger_tidy.pl"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www2.blogger.com/post-edit.g?blogID=19517056&amp;amp;postID=3384501673809955965" target="_blank"&gt;ldSelect&lt;/a&gt; (Carlson &lt;i&gt;et al&lt;/i&gt; 2004) is an alternative method of tagging SNPs, but also based on LD.  The output format from this is (in my opinion) a little awkward as the output uses the base-pair positions (mainly because the input file format is Prettybase).  Thus in addition to the output file this script requires the user to provide a tab-delimited text file of the markers and their base-pair positions and will convert the base-pair positions of markers to their rs numbers in the resulting latex table.  The script is again written in perl and is available &lt;a href="http://slack.ser.man.ac.uk/files/ldselect_tidy.pl"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h3&gt;References&lt;/h3&gt;&lt;br /&gt;&lt;br /&gt;Carlson, C. S., Eberle, M. A., Rieder, M. J., Yi, Q., Kruglyak, L., Nickerson, D. A. (2004)  Selecting a maximally informative set of single- nucleotide polymorphisms for association analyses using linkage disequilibrium &lt;a href="http://www.journals.uchicago.edu/AJHG/index.html" target="_blank"&gt;American Journal of Human Genetics&lt;/a&gt; &lt;b&gt;74&lt;/b&gt;:106-120 &lt;a href="http://www.broad.mit.edu/mpg/haploview/" target="_blank"&gt;PDF on PubMed&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;de Bakker, P. I. Yelensky, R. Pe'er, I. Gabriel, S. B. Daly, M. J., Altshuler, D. (2005) Efficiency and power in genetic association studies. &lt;a href="http://www.nature.com/ng/" target="_blank"&gt;&lt;i&gt;Nature Genetics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;37&lt;/b&gt;:1217-1223&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-3384501673809955965?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/3384501673809955965/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=3384501673809955965' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/3384501673809955965'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/3384501673809955965'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2007/04/not-really-related-to-any-news-but-ive.html' title='Perl scripts &amp; LaTeX'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-116830698472738949</id><published>2007-01-09T01:42:00.000Z</published><updated>2007-01-09T01:43:04.740Z</updated><title type='text'>To test or not to test that is the question</title><content type='html'>&lt;p&gt;A recent paper by Zou &amp; Donner (2006) questions whether testing for Hardy-Weinberg equilibrium (HW-eqm)case-control association studies is a viable strategy.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;The main point they make is that genotyping error is unlikely to be detected by testing for departure from HW-eqm.  This is for a number of reasons, the &lt;a href="http://en.wikipedia.org/wiki/Hardy-Weinberg_principle#Assumptions" target="_blank"&gt;assumptions underlying HW-eqm&lt;/a&gt;, and secondly (perhaps more importantly) that by performing a two stage analysis of screening for deviations for HW-eqm and then only taking forward markers which do not deviate inflates the Type 1 Error rate, because the p-value can not be interpreted as evidence that alleles are independent (i.e. HW-eqm holds).&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;This is of course appealing as it reduces the computational burden (particuarly when performing whole genome screens by association where multiple testing becomes a big problem), but also because a large proportion of associations are seen where loci do deviate from HW-eqm.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;The authors propose a new test for adjusted &amp;chi;&lt;sup&gt;2&lt;/sup&gt; test based on the difference in variance of the estimated allele frequencies in cases and controls which essentially is essentially the same as a Cochran-Armitage trend test (Sassieni, 1977).  The power and performance of this test is discussed in a upcoming paper in &lt;i&gt;Annals of Human Genetics&lt;/i&gt; (Ahn &lt;i&gt;et al&lt;/i&gt; 2007).&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;So the upshot of all of this is that as a first pass screen your probably better of using a robust test rather than worrying about deviations from Hardy-Weinberg equilibrium.&lt;/p&gt;&lt;br /&gt;&lt;h2&gt;References and Links&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;  &lt;li&gt;&lt;p&gt;Zou GY, Donnet A (2006) The Merits of Testing Hardy-Weinberg Equilibrium in the Analysis of Unmatched Case-Control Data: A Cautionary Note &lt;a href="http://www.blackwell-synergy.com/toc/ahg/70/6" target="_blank"&gt;&lt;i&gt;Annals of Human Genetics&lt;/i&gt; &lt;b&gt;70&lt;/b&gt;:923-933&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;  &lt;li&gt;&lt;p&gt;Ahn K, Haynes C, Kim W, Fleur RS, Gordon D, Finch SJ (2007) The Effects of SNP Genotyping Errors on the Power of the Cochran-Armitage Linear Trend Test for Case/Control Association Studies. &lt;a href="" target="_blank"&gt;&lt;i&gt;Annals of Human Genetics&lt;/i&gt; &lt;b&gt;AOP&lt;/b&gt;:&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;  &lt;li&gt;&lt;p&gt;Sassieni PD (1997) From genotype to genes: doubling the sample size &lt;a href="http://www.jstor.org/cgi-bin/jstor/printpage/0006341x/di009406/00p0055g/0.pdf?backcontext=table-of-contents&amp;dowhat=Acrobat&amp;config=jstor&amp;userID=825fc2c5@uwa.edu.au/01cce4405d00501b61540&amp;0.pdf" target="_blank"&gt;&lt;b&gt;53&lt;/b&gt;:1253-1261&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-116830698472738949?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/116830698472738949/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=116830698472738949' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116830698472738949'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116830698472738949'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2007/01/to-test-or-not-to-test-that-is.html' title='To test or not to test that is the question'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-116764823391071518</id><published>2007-01-01T11:29:00.000Z</published><updated>2007-01-01T11:26:57.900Z</updated><title type='text'>Fast Genome Wide Association Analysis</title><content type='html'>&lt;p&gt;Recently a new R package that provides an interface to running &lt;a href="http://www.biostat.harvard.edu/~clange/default.htm" target="_blank"&gt;PBAT&lt;/a&gt; in parallel was published in &lt;a href="http://bioinformatics.oxfordjournals.org" target="_blank"&gt;&lt;i&gt;Bioinformatics&lt;/i&gt;&lt;a&gt;.The package is described in the Open Access article &lt;a href="http://bioinformatics.oxfordjournals.org/cgi/content/full/22/24/3103" target="_blank"&gt;here&lt;/a&gt;.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;This is the first package that I've come across that provides an easy interface to analyse the massive numbers of data that are being generated by the new genotyping arrays such as Illumina's bead station and Affymetrix' SNP chip. Basically the data is split up and small chunks analysed on each processor, with a seperate instance of PBAT running on eahc processor. The graphical R front end allows users and easy way to select their analysis option and then handles splitting the data, sending off jobs and amalgamating the results when they are completed.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;Refernces and Links&lt;/h2&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt; &lt;li&gt;&lt;a href="" target="_blank"&gt;Hoffmann T, Lange C (2006) P&lt;sup&gt;2&lt;/sup&gt;BAT: a massive parallel implementation of PBAT for genome-wide association studies in R. &lt;i&gt;Bioinformatics&lt;/i&gt;:&lt;b&gt;22&lt;/b&gt;:3103-3105&lt;/a&gt;&lt;br /&gt; &lt;li&gt;&lt;a href="http://www.biostat.harvard.edu/~clange/default.htm" target="_blank"&gt;PBAT&lt;/a&gt; &lt;li&gt;&lt;a href="http://cran.r-project.org/" target="_blank"&gt;CRAN&lt;/a&gt; - Download P&lt;sup&gt;2&lt;/sup&gt;BAT R-package (and many other R packages)&lt;br /&gt; &lt;li&gt;&lt;a href="http://www.r-project.org" target="_blank"&gt;R&lt;/a&gt;&lt;br /&gt; &lt;li&gt;&lt;a href="http://people.fas.harvard.edu/~tjhoffm/pbatR.html" target="_blank"&gt;http://people.fas.harvard.edu/~tjhoffm/pbatR.html&lt;/a&gt; - Detailed instructions on installing and using P&lt;sup&gt;2&lt;/sup&gt;BAT&lt;br /&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-116764823391071518?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/116764823391071518/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=116764823391071518' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116764823391071518'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116764823391071518'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2007/01/fast-genome-wide-association-analysis.html' title='Fast Genome Wide Association Analysis'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-116156862053680242</id><published>2006-10-23T01:57:00.000Z</published><updated>2006-10-23T01:57:00.546Z</updated><title type='text'></title><content type='html'>A few special issues of journals have focused on Statistical Genetics recently.&lt;br /&gt;&lt;br /&gt;The journal &lt;a href="www3.interscience.wiley.com/cgi-bin/jhome/2988/" target="_blank"&gt;Statistics in Medicine&lt;/a&gt; is celebrating its 25th Anniversary and recently published a special issue focusing on statistical genetic analyses.  A number of articles are published on topics from genome-wide analyses, estimation of genetic and environmental components in specific diseases, multiple testing and more.  The full contents can be read &lt;a href="http://www3.interscience.wiley.com/cgi-bin/jissue/112771048" target="_blank"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The October issue of &lt;a href="http://www.nature.com/nrg/" target="_blank"&gt;Nature Reviews Genetics&lt;/a&gt; has a series of interesting articles on statistical genetics.  Papers include a reviews of population genetics software, methods of analysing molecular genetic variation, and the analysis of relatedness.  There is also a thoughtful perspective paper on the need for longtitudinal analyses for investigating the aetiology of complex diseases.  The full contents can be read &lt;a href="http://www.nature.com/nrg/journal/v7/n10/index.html" target="_blank"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-116156862053680242?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/116156862053680242/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=116156862053680242' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116156862053680242'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/116156862053680242'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/10/few-special-issues-of-journals-have.html' title=''/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-115742759125494281</id><published>2006-09-05T03:30:00.000Z</published><updated>2006-09-05T03:39:51.283Z</updated><title type='text'>Genetics Blogs....</title><content type='html'>A &lt;a href="http://www.nature.com/nrg/journal/v7/n9/full/nrg1957.html" target="_blank"&gt;Research Highlight&lt;/a&gt; in &lt;a href="http://www.nature.com/nrg/" target="_blank"&gt;Nature Reviews Genetics&lt;/a&gt; higlights the use of the much hyped Web2.0 (wiki's, blogs, social networking sites, interactive localised information etc.) by geneticist's and highlight's a few interesting sites (although manages to miss this one, prehaps that's a bit of motivation to actually get on and write more regularly about interesting developments!!!).&lt;br /&gt;&lt;br /&gt;The blogs listed are...&lt;br /&gt;&lt;br /&gt;&lt;a href="http://mendels-garden.blogspot.com/" target="_blank"&gt;Mendel's Garden&lt;/a&gt;&lt;br /&gt;&lt;a href="http://logic.itsc.cuhk.edu.hk/~s046149/cgi-bin/nph-proxy.cgi/000000A/http/blogs.nature.com/ng/freeassociation/" target="_blank"&gt;Free Association&lt;/a&gt;&lt;br /&gt;&lt;a href="http://scienceblogs.com/transcript/" target="_blank"&gt;Daily Transcript&lt;/a&gt;&lt;br /&gt;&lt;a href="http://genomics.weblog.glam.ac.uk/" target="_blank"&gt;Genomics Policy&lt;/a&gt;&lt;br /&gt;&lt;a href="http://scienceblogs.com/transcript/" target="_blank"&gt;Pahryngula&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;All a lot more topical than this little blog, but then the authors are probably more motivated to write than I am.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-115742759125494281?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/115742759125494281/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=115742759125494281' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/115742759125494281'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/115742759125494281'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/09/genetics-blogs.html' title='Genetics Blogs....'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-114605817477011932</id><published>2006-04-26T13:27:00.000Z</published><updated>2006-04-26T13:29:36.380Z</updated><title type='text'>Rub the lamp and make a wish....</title><content type='html'>A recent article by Allen-Brady , Wong &amp; Camp (2006) in &lt;a href="http://www.biomedcentral.com/bmcbioinformatics/" target="_blank"&gt;BMC Bioinformatics&lt;/a&gt; details a new program called &lt;a href="http://bioinformatics.med.utah.edu/PedGenie/index.html" target="_blank"&gt;PedGenie&lt;/a&gt; which provides a java (GUI) interface to performing association and transmission disequilibrium tests (TDT).&lt;br /&gt;&lt;br /&gt;Personally I have mixed feelings about GUI interfaces for analysis, firstly they mean that it is hard to make work reproducible, which is an essential part of any investigation (Gentleman 2005).  More importantly because people are often very familiar with point and click interfaces they can go ahead and perform their analysis witout necessarily understanding whether it is appropriate to perform their analysis, or even worse they'll perform every possible test they can and think that because they've found something "significant" they have found an important front. This may come acorss as elitist, but its really not meant to be, since when performing statistical analysis one needs to be very careful and precise about what you are doing.&lt;br /&gt;&lt;br /&gt;On the flip side GUI's provide a relatively easy interface to what can be a sometimes impenetrable field, since there is often a steep learning curve to becoming proficent in a given statistical package, OS or scripting language to the extent that one can perform their analysis in a reproducable manner.&lt;br /&gt;&lt;br /&gt;At the end of the day its about finding a balance between needing to getting things done, and hey, if it means theres one less person banging on my door needing their hand held to perform their analysis then its not all bad!&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;References&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;  &lt;li&gt;Allen-Brady K, Wong J, Camp NJ (2006) PedGenie: an analysis approach for genetic association testing in extended pedigrees and genealogies of arbitrary size &lt;a href="http://www.biomedcentral.com/1471-2105/7/209/abstract" target="_blank"&gt;BMC Bioinformatics&lt;/a&gt; &lt;b&gt;7&lt;/b&gt;:209&lt;br /&gt;  &lt;li&gt;Gentlemean R (2005) Reproducible Research: A Bioinformatics Case Study&lt;a href="http://www.bepress.com/sagmb/vol4/iss1/art2/" target="_blank"&gt;Statistical Applications in Genetics and Molecular Biology&lt;/a&gt; &lt;b&gt;4&lt;/b&gt;:2&lt;br /&gt;  &lt;li&gt;&lt;a href="" target="_blank"&gt;&lt;/a&gt; &lt;b&gt;&lt;/b&gt;:&lt;br /&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-114605817477011932?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/114605817477011932/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=114605817477011932' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114605817477011932'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114605817477011932'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/04/rub-lamp-and-make-wish.html' title='Rub the lamp and make a wish....'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-114484981015972185</id><published>2006-04-12T13:48:00.000Z</published><updated>2006-04-18T04:57:28.233Z</updated><title type='text'>(m)utterings....</title><content type='html'>A rather provocative article appeared in the &lt;a href="" target="_blank"&gt;European Journal of Human Genetics&lt;/a&gt; recently from Terwilliger &amp; Tero Hiekkalinna (2006) "An utter refutation of the 'Fundamental Theorem of the HapMap'".  The article provides a mathematical proof that the theorem behind the &lt;a href="http://www.hapmap.org/" target="_blank"&gt;HapMap&lt;/a&gt; is flawed.&lt;br /&gt;&lt;br /&gt;It is often assumed that the power to detect association is directly proportional to the correlation coefficent (&lt;i&gt;r&lt;sup&gt;2&lt;/sup&gt;&lt;/i&gt;) between the disease locus B and the marker locus A, but this is &lt;u&gt;not&lt;/u&gt; the case.  This makes intuitive sense, for the power of testing a proxy marker to be equal to that of testing the disease polymorphism itself, the allele frequencies must be identical.  In such cases &lt;i&gt;p&lt;sub&gt;a&lt;/sub&gt;&lt;/i&gt; =  &lt;i&gt;p&lt;sub&gt;b&lt;/sub&gt;&lt;/i&gt; (i.e. &lt;i&gt;r&lt;sup&gt;2&lt;/sup&gt;&lt;/i&gt; = 1), if the allele frequencies are not the same the there will be instances when the marker allele (&lt;i&gt;a&lt;/i&gt;) occurs with the 'normal' allele &lt;i&gt;B&lt;/i&gt;.  The paper claims that because this assumption is flawed then it will not be possible to utilise LD mapping to sucessfully map disease polymorphisms.  This may be true, but there is evidence that the structure of LD in the human genome is not uniform, there are some regions of strong LD where &lt;i&gt;D'&lt;/i&gt; and &lt;i&gt;r&lt;sup&gt;2&lt;/sup&gt;&lt;/i&gt; are high, and others where it is very weak.  As a consequence there will be some regions where LD mapping will work, and others where it will not.&lt;br /&gt;&lt;br /&gt;The wording is very strong in the paper, and is rather acerbic in its tone as exemplified in the title "Utter" refutation and and the inverted commas around the Fundamental Theorm (which to the best of my memory I have never heard of it being referred to as such, although it is implicit in the strategy of the &lt;a href="http://www.hapmap.org/" target="_blank"&gt;HapMap&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;To quote someone else "The devil is in the detail" and what is written above is far from all that is covered in the paper, but as it is freely available at the EJHG web-site, head on over, read it and make up your own mind.&lt;br /&gt;&lt;br /&gt;Terwilliger J, Hiekkalinna (2006) An utter refutation of the 'Fundamental Theorem of the HapMap' &lt;a href="http://www.nature.com/ejhg/journal/v14/n4/full/5201583a.html" target="_blank"&gt;&lt;i&gt;Eurpoean Journal of Human Genetics&lt;/i&gt; &lt;b&gt;14&lt;/b&gt;:426-437&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-114484981015972185?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/114484981015972185/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=114484981015972185' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114484981015972185'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114484981015972185'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/04/mutterings.html' title='(m)utterings....'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-114222140424708237</id><published>2006-03-13T03:42:00.000Z</published><updated>2006-04-18T04:58:08.666Z</updated><title type='text'>Usual Suspects?</title><content type='html'>A recent article in &lt;a href="http://bioinformatics.oxfordjournals.org/" target="_blank"&gt;Bioinformatics&lt;/a&gt; describes a web-based server for prioritising candidate genes within  linkage regions.  It combines annotations and sequence baed approaches, and would be most useful for people who have large regions of chromosomes to follow up on the basis of linkage studies.&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;References&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;Adie EA, Adams RR, Evans KL, Porteus DJ, Pickard BS (2006) SUSPECTS: enabling fast and effective prioritization of positional candidates.  &lt;a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/6/773" target="_blank"&gt;&lt;i&gt;Bioinformatics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;22&lt;/b&gt;:773-774&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-114222140424708237?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/114222140424708237/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=114222140424708237' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114222140424708237'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114222140424708237'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/03/usual-suspects.html' title='Usual Suspects?'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-114168639993090978</id><published>2006-03-06T23:05:00.000Z</published><updated>2006-03-07T00:19:21.636Z</updated><title type='text'>Don't over estimate the power....</title><content type='html'>A lucid example of why one should not treat estimated haplotypes as observed haplotypes has been provided by David Curtis and Pak Sham in a letter to the &lt;a href="http://www.journals.uchicago.edu/AJHG/journal/" target="_blank"&gt;&lt;i&gt;American Journal of Human Genetics&lt;/i&gt;&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;They demonstrate through a simple example, and simulations that the method of Zhao &lt;i&gt;et al's&lt;/i&gt; (2005) entropy based approach, and that of Sham and Curtis (1995) both of which essentially treat estimated haplotypes as observed, are anticonservative (i.e. over estimate any association/effect).&lt;br /&gt;&lt;br /&gt;However in the same issue Zhao &lt;i&gt;et al&lt;/i&gt; (2006) provide a counter example  , and claim that the range of haplotype frequencies that have been investigated to date is too small, and suggest that further theoretical work is required on this area.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;References&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;Curtis D, Sham P (2006) Estimated Haplotype Counts from Case-Control Samples Cannot Be Treated as Observed Counts. &lt;a href="http://www.journals.uchicago.edu/AJHG/journal/contents/v78n4.html" taget="_blank"&gt;&lt;i&gt;American Journal of Human Genetics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;78&lt;/b&gt;:729-730&lt;br /&gt;&lt;br /&gt;Zhao J, Boerwinkle E, Xiong M (2006) Impact of Haplotype-Frequency Estimation Error on Test Statistics in Association Studies. &lt;a href="http://www.journals.uchicago.edu/AJHG/journal/contents/v78n4.html" taget="_blank"&gt;&lt;i&gt;American Journal of Human Genetics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;78&lt;/b&gt;:728-729&lt;br /&gt;&lt;br /&gt;Zhao J, Boerwinkle E, Xiong M (2005) An entropy-based statistic for genomewide association studies &lt;a href="http://www.journals.uchicago.edu/AJHG/journal/contents/v77n1.html" taget="_blank"&gt;&lt;i&gt;American Journal of Human Genetics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;77&lt;/b&gt;:27-40&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-114168639993090978?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/114168639993090978/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=114168639993090978' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114168639993090978'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114168639993090978'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/03/dont-over-estimate-power.html' title='Don&apos;t over estimate the power....'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-114048558584258474</id><published>2006-02-21T01:32:00.000Z</published><updated>2006-02-21T01:33:05.926Z</updated><title type='text'>What have we 'Ear'?</title><content type='html'>An AOP publication in &lt;a href="http://www.nature.com/ng/"&gt;&lt;i&gt;Nature Genetics&lt;/i&gt;&lt;/a&gt; claims to have identified a gene involved in the secretion of wet earwax (&lt;a href="http://www.nature.com/ng/journal/vaop/ncurrent/abs/ng1733.html" target="_blank"&gt;Yoshiura &lt;i&gt;et al&lt;/i&gt;&lt;/a&gt;).  Quite why they chose to investigate this trait is beyond my understanding (after all I feel that auto-immune and cardio-vascular diseases and cancers have a far larger impact on human health world-wide than the need for the occasional ear syringe), particularly given that it is mendelian and not polygenic in nature, despite the contentious nature of the association with breast-cancer (which is likely to only be pure co-incedence).&lt;br /&gt;&lt;br /&gt;The claim that this is "...the first example of DNA polymorphism determining a visible genetic trait" is I feel somewhat grandiose and misled as there are innumerable polmorphims which have a direct effect on the phenotype of individuals who carry them, for example the &amp;Delta;F508 mutation in Cystic Fibrosis, or the Huntington's Disease gene.&lt;br /&gt;&lt;br /&gt;More interstingly the current issue of &lt;a href="http://www.nature.com/ng/" target="_blank"&gt;Nature Genetics&lt;/a&gt; has an article on a sex-xpecific QTL (Quantitative-Trait Locus).  In this article they review 17 quantitative traits (including systolic and diastolic blood pressure, triglycerides, serontin levels, LDL and HDL levels) in the &lt;a href="http://en.wikipedia.org/wiki/Hutterites" target="_blank"&gt;Hutterite's&lt;/a&gt; to see if there are any sex-specific effects.  They found that 11 of these showed differences between the sexes at a significance level of p &lt; 0.05 (although really they should have applied a Bonferroni correction and set the significance level to 0.0029 at a bare minimum assuming all the traits were independent).  Heritability differed between sexes in some cases as well.&lt;br /&gt;&lt;br /&gt;This obviously has important implications when analysing genetic data for QTL's and sex should therefore always be included in the early stages of the analysis to determine whether it has a significant effect.&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;Links&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.genes.uchicago.edu/hutterite/home.html" target="_blank"&gt;Complex Trait Mapping in the Hutterites&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-114048558584258474?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/114048558584258474/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=114048558584258474' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114048558584258474'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/114048558584258474'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/02/what-have-we-ear.html' title='What have we &apos;Ear&apos;?'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113929728747353447</id><published>2006-02-07T07:28:00.000Z</published><updated>2006-02-07T07:28:07.496Z</updated><title type='text'>DNA databases...</title><content type='html'>Back in April 2005 (21st to be precise), I attended a public debate entitled &lt;a href="http://nowgen.org.uk/event/index.php?eid=21"&gt;DNA Profiling and You: Do genetic databases threaten or benefit society?&lt;/a&gt; at the &lt;a href="http://www.nowgen.org.uk/"&gt;Nowgen Centre&lt;/a&gt; in Manchester.&lt;br /&gt;&lt;br /&gt;I heard that when the databases were initiated in 1999 it was only admisable to take DNA samples from individuals who had been convicted of a crime, but was astounded that since then the laws have been relaxed to such an extent that samples are routinely taken by people who are arrested even if they are acquited of the crime for which they were arrested.  This was verified by a friend of mine who was arrested but never charged, yet was obliged (under the threat of force) to provide a DNA sample.&lt;br /&gt;&lt;br /&gt;Recent articles in the news are highlighted in this months &lt;a href="http://www.nature.com/nrg/"&gt;Nature Reviews Genetics&lt;/a&gt;, and highlights not only the racial bias that exists within the UK database, but the rapid expansion that it has undergone (see the article &lt;a href="http://www.nature.com/nrg/journal/v7/n2/full/nrg1796.html"&gt;here&lt;/a&gt; if you can access it).&lt;br /&gt;&lt;br /&gt;I was astounded to read that only ten microsatellites are used for typing individuals as this seems exceptionally small (in the future I may do some simulations to investigate the statistical probabilites of finding matches by chance alone with such a small sample size if time permits).&lt;br /&gt;&lt;br /&gt;I do not wish to comment on the moral or ethical implications of collecting such databases, but find it amazing that such databases exist and continue to expand relatively unchecked.  It is heinous that samples can be taken and retained freely, particularly in light of the extensive legal and ethical standards that organised medical research projects have to adhere to when applying for and executing medical research.&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;Links&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://news.bbc.co.uk/2/hi/uk_news/4579366.stm"&gt;DNA database continues to swelll&lt;/a&gt; (BBC News, 04/01/2006)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113929728747353447?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113929728747353447/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113929728747353447' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113929728747353447'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113929728747353447'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/02/dna-databases.html' title='DNA databases...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113919048191010705</id><published>2006-02-06T01:48:00.000Z</published><updated>2006-02-06T01:48:01.920Z</updated><title type='text'>Statalist Blog...</title><content type='html'>I use the statistical package &lt;a href="http://www.stata.com"&gt;Stata&lt;/a&gt; and today I came across the blogger feed of the mailing list and thought I'd menion it here.&lt;br /&gt;&lt;br /&gt;To view posts to this mailing list simply go &lt;a href="http://statalist.blogspot.com/"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113919048191010705?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113919048191010705/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113919048191010705' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113919048191010705'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113919048191010705'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/02/statalist-blog.html' title='Statalist Blog...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113886034462303246</id><published>2006-02-02T06:06:00.000Z</published><updated>2006-02-02T06:05:44.640Z</updated><title type='text'>Festa(r) &amp; Festa(r) tagging SNPs</title><content type='html'>Yet another SNP tagging algorithm and program has been released, this time from the University of Michigan Center of Statistical Genetics.&lt;br /&gt;&lt;br /&gt;Qin &lt;i&gt;et al&lt;/i&gt; (2006) &lt;a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/2/220"&gt;&lt;i&gt;Bioinformatics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;22&lt;/b&gt;:220-225 propose tagging on the basis of the pairwise LD measure r&lt;sup&gt;2&lt;/sup&gt; and have developed a fast and efficent algortihm which breaks genomic regions down into sections, allowing for faster computation of tagging SNPs has been developed to carry this out.&lt;br /&gt;&lt;br /&gt;It avoids the potential drawback of multimarker approaches which may not perform well when genotyping fails.&lt;br /&gt;&lt;br /&gt;This is a nice simple approach to increasing efficency, since there is little point in attempting to tag SNPs over long distances.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113886034462303246?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113886034462303246/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113886034462303246' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113886034462303246'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113886034462303246'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/02/festar-festar-tagging-snps.html' title='Festa(r) &amp; Festa(r) tagging SNPs'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113748552809165416</id><published>2006-01-17T08:12:00.000Z</published><updated>2006-01-30T10:55:24.540Z</updated><title type='text'>Jump in the pool...</title><content type='html'>An interesting article in  &lt;a href="http://www.nature.com/ng"&gt;Nature Genetics&lt;/a&gt;  by Skol &lt;i&gt;et al&lt;/i&gt; (&lt;a href="http://www.nature.com/ng/journal/v38/n2/abs/ng1706.html"&gt;v38:209-213&lt;/a&gt;)investigates different strategies when performing Whole Genome Screens by Association (WGSA) using a two stage design whereby a subset of the total cohort of cases and controls are typed for all markers across the genome (perhaps with &lt;a href="http://www.affymetrix.com/"&gt;Affymetrix's&lt;/a&gt; 100k or 500k SNP chip or &lt;a href="http://www.perlegen.com"&gt;Perlegen's&lt;/a&gt; bead arrays), and a subset of associated markers are selected for follow up in the remaining samples.&lt;br /&gt;&lt;br /&gt;The paper shows that irrespective of the proprtion split made in the cohort, or the number of markers that are to be followed up, the most powerful strategy for detecting effects is to analyse the first (screening) and second (replication) cohorts jointly instead of trying to replicate associations in the second cohort alone.&lt;br /&gt;&lt;br /&gt;Intuitivevly this makes sense since you are including individuals who have already demonstrated association from the first stage, and seeing if the inclusion of the second cohort reinforces the effect or dissimulates it.&lt;br /&gt;&lt;br /&gt;Personally quite interesting since I'm currently involved in such a study.&lt;br /&gt;&lt;br /&gt;Reference&lt;br /&gt;&lt;br /&gt;Skol AD, Scott LJ, Abecasis GR, Boehnke M (2005) Joint analysis is more efficent than replication-based analysis for two-stage genome-wide association studies.  &lt;a href="http://www.nature.com/ng/journal/v38/n2/abs/ng1706.html"&gt;&lt;i&gt;Nature Genetics&lt;/i&gt;&lt;/a&gt; &lt;b&gt;AOP&lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113748552809165416?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113748552809165416/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113748552809165416' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113748552809165416'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113748552809165416'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2006/01/jump-in-pool.html' title='Jump in the pool...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113594838797258963</id><published>2005-12-30T13:12:00.000Z</published><updated>2006-01-03T14:14:54.386Z</updated><title type='text'>Donate your idle CPU time to medicine...</title><content type='html'>Two worthwhile &lt;a href="http://www.google.com/search?q=define%3A%20distributed%20computing"&gt;distributed computing&lt;/a&gt; projects that I came across today are &lt;a href="http://folding.stanford.edu/"&gt;Folding@home&lt;/a&gt; and &lt;a href="http://genomeathome.stanford.edu/"&gt;Genome@home&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The former seeks to understand the way in which proteins secondary and tertiary structures are formed, whilst the later seeks to use this information to derive novel genes (and hence proteins) for application in theraputic medicines.&lt;br /&gt;&lt;br /&gt;If you have a computer on your desk, even if you use it throughout the day there is likely to be time when its full CPU capacity is not being used, and by downloading on of the clients you can help solve some of the computational problems that exist within medical research.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113594838797258963?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113594838797258963/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113594838797258963' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113594838797258963'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113594838797258963'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/donate-your-idle-cpu-time-to-medicine.html' title='Donate your idle CPU time to medicine...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113589978441297279</id><published>2005-12-29T23:43:00.000Z</published><updated>2005-12-29T23:43:04.440Z</updated><title type='text'>Lost hunting for genes?</title><content type='html'>The &lt;a href="http://www.nature.com/ng/journal/v38/n1/index.html"&gt;current&lt;/a&gt; issue of &lt;a href="http://www.nature.com/ng/"&gt;Nature Genetics&lt;/a&gt; has an interesting article entitled &lt;a href="http://www.nature.com/ng/journal/v38/n1/abs/ng0106-3.html"&gt;"A road map for efficient and reliable human genome epidemiology"&lt;/a&gt;, which champions the notion that researchers need to co-ordinate their work.&lt;br /&gt;&lt;br /&gt;One key point the authors make is that there is still a publication bias which favours positive associations.  This clear bias detracts from the scientific quality of the area as a whole, and runs the risk of engendering data dredging.&lt;br /&gt;&lt;br /&gt;A number of online databases are also listed (many of these and more can be found &lt;a href="http://slack.ser.man.ac.uk/"&gt;here&lt;/a&gt; in the Links section of my web-site)...&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.cdc.gov/genomics/hugenet/default.htm"&gt;Human Genetic Epidemiology Network (HuGENet)&lt;/a&gt;&lt;br /&gt;(&lt;a href="http://www.hugenet.org.uk/"&gt;UK HuGENet Coordinating Centre&lt;/a&gt;)&lt;br /&gt;&lt;a href="http://geneticassociationdb.nih.gov/"&gt;Genetic Association Database&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.pharmgkb.org/"&gt;PharmGKB: The Pharmacogenetics and Pharmacogenomics Knowledge Base&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.strobe-statement.org/"&gt;STROBE statement&lt;/a&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113589978441297279?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113589978441297279/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113589978441297279' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113589978441297279'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113589978441297279'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/lost-hunting-for-genes.html' title='Lost hunting for genes?'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113508029440363683</id><published>2005-12-20T11:55:00.000Z</published><updated>2005-12-20T12:04:54.413Z</updated><title type='text'>Common disease variants are invariant</title><content type='html'>A recent article in &lt;a href="http://www.journals.uchicago.edu/AJHG/home.html"&gt;The American Journal of Human Genetics&lt;/a&gt; by &lt;a href="http://www.journals.uchicago.edu/AJHG/journal/issues/v78n1/42639/brief/42639.abstract.html"&gt;Lohmueller &lt;span style="font-style:italic;"&gt;et al&lt;/span&gt;&lt;/a&gt; shows that &lt;a href="http://www.phgu.org.uk/info_database/glossary.html"&gt;SNP&lt;/a&gt; variants involved in 'common' diseases tend not to vary considerably between the two populations they compared.&lt;br /&gt;&lt;br /&gt;This is good news in the world of gene hunting since it suggests that these variants arose prior to the divergence of human populations, and that they variants are therefore common to most human populations.  This is not to say that &lt;a href="http://www.phgu.org.uk/info_database/glossary.html"&gt;genetic heterogeneity&lt;/a&gt; does not exist, since there are likely to be other mutations that influence disease that are population specific, but it is encouraging since it means that it should be reasonably easy to replicate associations with some of the common variants (indeed that was one of the criteria for selection used in the paper).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113508029440363683?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113508029440363683/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113508029440363683' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113508029440363683'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113508029440363683'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/common-disease-variants-are-invariant.html' title='Common disease variants are invariant'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113455777084430232</id><published>2005-12-14T10:45:00.000Z</published><updated>2005-12-14T10:56:10.850Z</updated><title type='text'>Proceedings of Genetic Analysis Workshop 14</title><content type='html'>The summary papers for the 14th &lt;a href="http://www.gaworkshop.org/"&gt;Genetic Analysis Workshop&lt;/a&gt; have been published as Supplementary issue to the journal &lt;a href="http://www3.interscience.wiley.com/cgi-bin/jhome/35841"&gt;Genetic Epidemiology&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;These papers are summaries for each of the working groups that were formed as part of the workshop.  Individual work from each members of the group are due to be published as a supplement in &lt;a href="http://www.biomedcentral.com/bmcgenet/"&gt;BMC Genetics&lt;/a&gt; in the not too distant future.&lt;br /&gt;&lt;br /&gt;(I attended this meeting and was part of group 7).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113455777084430232?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113455777084430232/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113455777084430232' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113455777084430232'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113455777084430232'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/proceedings-of-genetic-analysis.html' title='Proceedings of Genetic Analysis Workshop 14'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113412055118412735</id><published>2005-12-09T09:29:00.000Z</published><updated>2005-12-09T09:29:12.040Z</updated><title type='text'>Recommended Reading</title><content type='html'>I've just finished reading James Watson's "DNA: The Secret of Life" and can highly recommend it to anyone interested in the area, even if they are already familiar with the field.&lt;br /&gt;&lt;br /&gt;Personally I found it very interesting from a historical point of view since it discussed a lot of the origins of the field, and it should be on the recommended reading list of any undergraduate human genetics course.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113412055118412735?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113412055118412735/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113412055118412735' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113412055118412735'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113412055118412735'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/recommended-reading.html' title='Recommended Reading'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113380937070531640</id><published>2005-12-05T18:57:00.000Z</published><updated>2005-12-05T19:07:59.006Z</updated><title type='text'>Interesting Blogs</title><content type='html'>Nothing much to add today, but here are a few links to other blogs readers may find interesting.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://blogs.nature.com/ng/freeassociation/" target="_blank"&gt;Free Association&lt;/a&gt; is the &lt;a href="http://www.nature.com/ng/" target="_blank"&gt;Nature Genetics&lt;/a&gt; blog.&lt;br /&gt;&lt;br /&gt;And after reading the above blog Paul Myer's blog &lt;a href="http://pharyngula.org/" target="_blank"&gt;Pharyngula&lt;/a&gt; would be of general interest as well.&lt;br /&gt;&lt;br /&gt;When I've got more time I shall add a blog links section to the right.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113380937070531640?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113380937070531640/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113380937070531640' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113380937070531640'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113380937070531640'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/interesting-blogs.html' title='Interesting Blogs'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113368663268125815</id><published>2005-12-04T08:57:00.000Z</published><updated>2005-12-04T09:10:31.150Z</updated><title type='text'>R for Statistical Genetics</title><content type='html'>&lt;a href="http://www.r-project.org/"&gt;R&lt;/a&gt; is becoming increasingly popular as a tool for statistical genetics. There are already a number of packages available, and these are detailed at the &lt;a href="http://www.stats.bris.ac.uk/R/src/contrib/Views/Genetics.html"&gt;CRAN Task View: Statistical Genetics&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;R has also been extended considerably as part of the &lt;a href="http://www.bioconductor.org/"&gt;Bioconductor&lt;/a&gt; project which develops tools for bioinformatics and micro-array analyses.&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113368663268125815?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113368663268125815/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113368663268125815' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113368663268125815'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113368663268125815'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/r-for-statistical-genetics.html' title='R for Statistical Genetics'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113359836999015970</id><published>2005-12-03T08:25:00.000Z</published><updated>2005-12-03T08:26:55.000Z</updated><title type='text'>Pan autoimmune genes...</title><content type='html'>Okay so I guess I should start adding stuff to this blog.&lt;br /&gt;&lt;br /&gt;Theres an interesting article on the role of PTPN22, which shows numerous &lt;a href="http://geneticassociationdb.nih.gov/cgi-bin/tableview.cgi?table=allview&amp;cond=GENE=%27PTPN22%27"&gt;associations with autoimmune diseases&lt;/a&gt;.  The article is in Nature Genetics and can be read&lt;br /&gt;&lt;a href="http://www.nature.com/ng/journal/v37/n12/abs/ng1205-1300.html;jsessionid=D7EE31FFC69E442C0824874DE9535676"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113359836999015970?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113359836999015970/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113359836999015970' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113359836999015970'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113359836999015970'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/pan-autoimmune-genes.html' title='Pan autoimmune genes...'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-19517056.post-113353604522847519</id><published>2005-12-02T15:03:00.000Z</published><updated>2005-12-04T12:53:42.860Z</updated><title type='text'>Welcome</title><content type='html'>Welcome to my blog!&lt;br /&gt;&lt;br /&gt;I work in the fast-moving (exciting?) world of human genetics and am a genetic statistician.&lt;br /&gt;&lt;br /&gt;I intend to use this page to detail useful information about statistical genetics, so if you work in the field as well you may find it interesting, if you don't then I hope you enjoy reading it anyway.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/19517056-113353604522847519?l=slack---line.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://slack---line.blogspot.com/feeds/113353604522847519/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=19517056&amp;postID=113353604522847519' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113353604522847519'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/19517056/posts/default/113353604522847519'/><link rel='alternate' type='text/html' href='http://slack---line.blogspot.com/2005/12/welcome.html' title='Welcome'/><author><name>slack---line</name><uri>http://www.blogger.com/profile/12064710888589423976</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
