Pragya Srivastava, Shivali Malhotra, Souvik Maiti, Mudit Vaid, Devendra Parmar, Moumita Chaki, Manoj Jain, Rupali Chopra, Mitali Mukerji, Tsering Stobdan, Suchita Singh, Meenakshi Chakravorty, Alok Dhawan, Partha P. Majumder, Chitra Chauhan, Jinny A. Paul, Srikanta Kumar Rath, Debasis Dash, Ankur Saxena, Rajshekhar Chatterjee, R. S. Bharti, S.K. Das, Moulinath Acharya, S. Siva, Arindam Biswas, B. R. K. Shukla, Dwaipayan Bharadwaj, Shilpy Sharma, Swapan K Das, Ravi Shankar, Qadar Pasha, Saman Habib, Mridula Singh, Ishani Deb, Abhay Sharma, Komal Virdi, Ajay Vidhani, Ishita Chattopadhyay, Shantanu Sengupta, Kumarasamy Thangaraj, Jitender Kumar, J. P. Srivatava, Gautam Ghosh, Mamta Sharma, Aarif Ahsan, Rubina Tabassum, Mohini Anand, A Saha, Keya Chaudhuri, Giriraj R. Chandak, J. R. Gupta, Ravishankar Roy, Charu Rajput, Samira Bahl, Ashok K. Singh, Saibal Mukherjee, Anwar J. Khan, Ranjana Verma, Kunal Ray, Arunava Banerjee, Arijit Mukhopadhyay, Prashant Singh, Samir K. Brahmachari, Ashok Kumar, Arvind P. Singh, Rukhsana Chowdhury, Arnab Gupta, Taruna Madan, Shiladitya Sengupta, Ashima Bhattacharyya, Taraswi Banerjee, Chaitali Misra, Kamlesh Bisht, Ganga Nath Jha, Jagmohan Singh, Anubha Mahajan, Jyotsna Batra, Susanta Roychoudhury, Amit Kumar Mitra, Madhu Singh, Rana Nagarkatti, Suddhasil Mookherjee, Arun Bandyopadhyay, V. R. Rao, Shrawan K. Mishra, Balaram Ghosh, Tufan Naiya, Vinay Khanna, Swapnil Sinha, Somnath Dutta, Aradhita Baral, Amitabh Sharma, Vijaya Banerjee, Nitin Maurya, Sreenivas Chavali, Rajesh Pandey, Gourish Monadal, Uma Mittal, and Lalji Singh
Indian population, comprising of more than a billion people, consists of 4693 communities with several thousands of endogamous groups, 325 functioning languages and 25 scripts. To address the questions related to ethnic diversity, migrations, founder populations, predisposition to complex disorders or pharmacogenomics, one needs to understand the diversity and relatedness at the genetic level in such a diverse population. In this backdrop, six constituent laboratories of the Council of Scientific and Industrial Research (CSIR), with funding from the Government of India, initiated a network program on predictive medicine using repeats and single nucleotide polymorphisms. The Indian Genome Variation (IGV) consortium aims to provide data on validated SNPs and repeats, both novel and reported, along with gene duplications, in over a thousand genes, in 15,000 individuals drawn from Indian subpopulations. These genes have been selected on the basis of their relevance as functional and positional candidates in many common diseases including genes relevant to pharmacogenomics. This is the first large-scale comprehensive study of the structure of the Indian population with wide-reaching implications. A comprehensive platform for Indian Genome Variation (IGV) data management, analysis and creation of IGVdb portal has also been developed. The samples are being collected following ethical guidelines of Indian Council of Medical Research (ICMR) and Department of Biotechnology (DBT), India. This paper reveals the structure of the IGV project highlighting its various aspects like genesis, objectives, strategies for selection of genes, identification of the Indian subpopulations, collection of samples and discovery and validation of genetic markers, data analysis and monitoring as well as the project's data release policy.