Anup Thesis
Anup Thesis
Anup Thesis
Jadavpur University
Kolkata, India
2008
License: CC BY-NC-ND
License: CC BY-NC-ND
To
My Beloved Parents
(Late Sri Ramesh Chandra Das and Late Smt. Bani Das)
License: CC BY-NC-ND
Acknowledgement
This is high time for me to take this opportunity to express gratitude and
thanks from the core of my heart to those without whose cooperation it would
not at all possible to complete this task. At first, I want to express profuse
thanks to my research supervisors Dr. Chaitali Dutta and Prof. Bimal Kanti
Sen. They rendered expert guidance and generous help in the successful
turnover of this work and assisted me in different ways by allowing exploiting
different facilities on their own behalf.
I am highly obliged to my academic mentors Prof. S.B. Ghosh, Smt T. A.
Pandalai and Late Smt Aruna Karanjai for their valuable discussions and
suggestions about my topic. At many critical junctures in my life, the motherly
affection of Smt Karanjai helped me in attainment of my professional goals.
I am extremely thankful to the respondents of my survey questionnaire, and
key persons of the digitization initiatives across India who have been
interviewed by me during my field study. They have also extended their
friendly supports while demonstrating their digitization processes. In this
connection, I would like to mention pioneering names in this nascent area
who gave me a deep insight on the subject during my field visit at their
institutions: Dr. Usha Mujoo Munshi of INSA, New Delhi; Dr. M.G. Sreekumar
of IIM Kozhikode; Shri A. Hariharan of SERC, Chennai; Dr. Jagdish Arora of
IIT Delhi; Prof. A.R.D. Prasad of DRTC, Bangalore; Dr. Vamshi Ambati of IIIT
Hyderabad; Dr. V.N. Shukla of CDAC Noida; Prof. N. Balakrishnan of IISc
Bangalore; Dr. S. Arunachalam of MSSRF, Chennai; and Dr. Ramesh C.
Gaur of IGNCA, New Delhi.
My
extreme
gratitude
finds
no
horizon
to
express
her
thankful
License: CC BY-NC-ND
Mahesh of NISCAIR; Prof. Uma Kanjilal, Prof. Neena Talwar Kanungo and
Dr. Jaideep Sharma of IGNOU; and Shri Ashok Sharma and Dr. Shankar
Chowdhury of UNESCO.
I am expressing hearty thanks to my friends Bidyarthi, Bhaswati, Susmita-di,
Indrani-di, Subir, Swapan, Sujit-da, Gopal-da, Aulie, Sanjay, Sabuj, Sevukan,
Manidipa (Chotan), Dr. Premesh Sarkar, Dr. Jyotirmoy Samajder, Dr. Kaushik
Sur, and other anonymous friends who stretched their friendly hands at some
critical junctures to accelerate progress of this project.
This is the right time for me to remember my respected parents, my beloved
sister (Late Smt Ranjana Das), my beloved Banima (Late Smt Bani Rani
Ghosh, 1924-2002), who left me and was surely happiest persons at such
buoyant moment of my life. I am also grateful to my uncle (Prof. Ranajit Kr.
Dutta), parents-in-law (Shri S.N. Mukherjee and Smt Dipali Mukherjee) and
brothers (Ashim and Ujjwal) for their incessant inspiration and cooperation in
executing this job.
Last but not the least, my wife Smt Anasua sacrificed a lot for successful
turnover of my applied effort and lent her hands at every moment.
License: CC BY-NC-ND
Preface
Digitization of library materials provides an excellent opportunity to widely
disseminate our documentary heritages and greatly increases access to library
collections of rare documents as well as current research literature. Indian
digitization initiatives aim at producing a vast amount of digitized documents
pertaining to different forms of recorded human knowledge, ranging from the rare
manuscripts to current research literature.
This work contains six chapters in all. The first chapter is Introduction, and it is
sub-divided in different sections. The first section describes the background of
the work. The next few sections have introduced with some fundamental
concepts, for instance, open access, digital library, information retrieval, etc.
These concepts form the basis of this study. In this chapter, some national and
international policy instruments and guidelines along with the published literature
by Indian practitioners are critically reviewed for better theoretical understandings
of the subject.
Digital Library of India (DLI) is the largest digitization initiative in India spreading
across states of India and involving over ninety organizations to ensure several
thousands of rare books written in Indian languages as well as non-Indian
languages are accessible through Internet channel. Chapter 2 critically appraises
different aspects of the DLI project such as collaboration pattern, digitization
processes, metadata practices, information retrieval, etc.
The higher education system in a country produces advanced research literature
in the form of theses and dissertations in both thrust areas of the country as well
as in the emerging academic fields. The researchers in the country need to
consult results of past researches in order to expand frontiers of knowledge.
Theses and dissertations are constant in demand for our academic research,
although availability of the same is the matter of concern. Chapter 3 critically
appraises different aspects of digitization work for theses and dissertations in
higher learning institutions in India, more particularly initiatives such as
Vidyanidhi Digital Library project, ETD@IISc and OpenMed@NIC.
vi
License: CC BY-NC-ND
vii
License: CC BY-NC-ND
Table of Contents
Page
Chapter 1: Introduction
139
1.1 Background
1.5 Scope
1.6 Methodology
10
31
1.9 References
33
Chapter 2:
40140
42
49
53
66
69
viii
License: CC BY-NC-ND
74
76
78
88
90
93
101
109
114
131
134
136
138
2.19 References
139
141199
141
152
158
160
172
3.6 OpenMed@NIC
177
183
184
188
ix
License: CC BY-NC-ND
200287
4.1 Introduction
200
203
221
229
237
243
License: CC BY-NC-ND
264
4.8 References
266
Chapter 5:
268320
5.1 Introduction
268
274
278
284
289
305
315
319
5.9 References
319
Chapter 6:
321334
6.1 Prologue
321
322
xi
License: CC BY-NC-ND
332
6.4 Recommendations
334
Annexure
335349
335
343
344
345
346
348
xii
License: CC BY-NC-ND
List of Tables
Page
Chapter 2
2.1: Comprehensive List of DLI Project Partners in India
59
71
82
97
116
120
125
Hyderabad
2.9: Subject-wise Distribution of Digitized Books
127
129
Hyderabad
Chapter 3
3.1: Research Fellowship Schemes available during 11th Five Year Plan
143
155
156
166
169
169
170
Database
3.8: Advanced Search Option in Indian Theses Database
170
171
175
177
182
xiii
License: CC BY-NC-ND
186
191
193
194
Chapter 4
4.1: Coverage of Digitization and Digital Library Projects in IGNCA
213
216
220
of Materials
4.4: Different Kinds of Multimedia and Textual Collections in DL-RICH
225
226
236
241
256
261
Chapter 5
5.1: Scientific Periodicals Published by the Science Academies in India
277
282
283
283
284
287
288
288
295
297
300
305
xiv
License: CC BY-NC-ND
310
312
Chapter 6
Model 1: Public Access to Digitized Collections
325
327
328
331
xv
License: CC BY-NC-ND
List of Figures
Page
Chapter 1
1.1: Research Design
32
Chapter 2
2.1: Vision and Mission Statements of Digital Library of India
41
44
45
45
46
47
47
48
48
49
51
52
52
53
67
68
68
70
73
75
77
83
83
2.24: Book Movement within the Source Library and Scanning Centre
84
85
xvi
License: CC BY-NC-ND
86
87
87
89
89
90
92
93
96
99
99
2.37: Tagged Metadata Displayed in XML Format for the Same Book
100
101
102
103
Hyderabad
2.41: Two Search Parameters Entered in Advanced Search Interface
106
106
107
107
108
108
110
111
Hyderabad
2.49: BookReader-1: Book Displayed in TIFF Format Single Page View
112
113
View
2.51: Security Check for Reading Online in DLI Portal of RMSC
113
Hyderabad
2.52: Language-wise Distribution of Books in main DLI Portal
116
xvii
License: CC BY-NC-ND
117
117
Hyderabad
2.55: Language-wise Distribution of Scanned Pages in DLI Portal of
118
RMSC Hyderabad
2.56: Scanning Centre-wise Distribution of Digitized Books
122
123
123
124
125
Hyderabad
2.61: Subject-wise Distribution of Digitized Books
128
130
133
a DLI Portal
2.64: Data Flow in a DLI Portal integrated with Text to Speech (TTS)
133
System
2.65: Benefits of Digital Library of India
135
137
Chapter 3
3.1: Membership pattern of INDEST-AICTE Consortium
147
154
157
159
159
161
Library Project
3.7: Aims of Vidyanidhi
162
xviii
License: CC BY-NC-ND
164
165
167
174
175
176
176
180
181
181
182
187
192
193
196
Doctoral Theses
3.23: Advanced Search Interface of National Online Union Catalogue of
196
Doctoral Theses
Chapter 4
4.1: Organizational Structure of IGNCA
204
209
210
211
212
212
215
218
220
221
222
xix
License: CC BY-NC-ND
224
Access Resources
4.13: Intranet-based Retrieval Application for DL-RICH Resources
224
227
231
231
232
233
234
236
237
238
239
241
242
243
247
247
248
250
250
251
Manuscripts Library
4.33: Brochure of Manus Granthavali the Software Used for Creation of
254
256
258
258
260
264
xx
License: CC BY-NC-ND
Chapter 5
5.1: Modes of Access to Digitized Journal Collections
270
273
Journals
5.3: Open Access Channels for Digitized Journal Articles
273
274
276
277
280
281
281
282
287
292
Division at NIC
5.13: Schematic Structure of medIND Project
293
293
293
294
299
299
303
303
303
304
304
308
xxi
License: CC BY-NC-ND
309
of MedKnow Publications
5.27: Digitization Work in Progress for Udbodhan Collection
316
318
ROM Collection
5.29: Search Engine for Udbodhan Centenary CD-ROM Collection
319
Chapter 6
6.1: Transition from No Online Public Access to Full-text Open Access to
329
ETD Collection
6.2: National Digital Information System integrating Indigenous Digital
333
Libraries
xxii
License: CC BY-NC-ND
CHAPTER 1
Introduction
Digital Library is where the past meets the present and creates the future...
Dr. A.P.J. Abdul Kalam
Former President of India
1.1 Background
India is one of the emerging economies having consistent economic growth rate
in last two decades due to increasing contributions from the knowledge-based
service sector. Leveraging prospect of liberalization and globalization of Indian
economy that was introduced during early 1990s, India has become one of the
top software exporting countries. While India exports cutting edge IT products
and services to the developed nations, capabilities of other knowledge-based
industries are also getting strengthened in order to meet global demand of
knowledge-based services as well as stay ahead in the global competition (NKC,
2006).
Knowledge-based
industries
are
human
capital
intensive
and
License: CC BY-NC-ND
License: CC BY-NC-ND
License: CC BY-NC-ND
libraries both kinds of documents are available. The digitized documents are
stored either in image formats or in text formats. If the original documents are
available in European languages such as English, French, German and Spanish,
the optical character recognition (OCR) software can automatically convert them
into searchable digital text format, where qualitative OCR conversion rate is
much higher. On the other hand, if the original documents are available in Indian
languages such as Sanskrit, Hindi, Bengali, Oriya, Telugu and Tamil, the
contents are made available either in image formats or re-keying the texts for the
inclusion in the digital libraries. OCR software for Indian languages is still in the
developmental or testing stage, where OCR conversion rate is much lower than
acceptable rate. The full-text searching is possible in textual documents but this
facility is absent in image documents.
1.2.1 Documents and Collections in Digital Library Systems
Digital library is the concept of information stored digitally and made accessible
to users through digital systems and networks, but having no single location. It is,
therefore, analogous to a library as a storehouse of information, but has a virtual
existence in the digital spaces. Digital library is essentially a fully automated
information system with all resources in digial form. Many views of digital libraries
stem from what libraries currently do. Traditional libraries collect, organise,
provide access to, and preserve objects in their collections. A library collection
may include books, magazines, journals, theses, dissertations, manuscripts,
audio-visuals, maps, etc. The flexibility of digital technology allows it to handle
new kinds of object efficiently. Digital library collections can include things without
direct physical analogs, such as algorithms or real time data feeds. They also
may include digitized representations of what have traditionally appeared largely
in museums and archives. With the rise of cost of paper publications and library
storage, increasing use of computers, decreasing budgets, many libraries have
to reduce their acquisition of books as well as their journal subscriptions.
Documents in electronic form can become more easily available and widely used
because the cost of digital storage and processing is going down.
4
License: CC BY-NC-ND
Documents are the heart of digital libraries. Without documents there would be
no digital libraries. In digital libraries, documents are not only what are stored in
traditional libraries (e.g., books, journals, pictures and videos), but also include
many works uncommon to those libraries, e.g., multilingual, multimedia, and
structured documents (e.g., books broken into chapters, sections, subsections,
figures with attached captions, colour graphics or images, attached or linked
sound or video files, appendices, indexes, and 'front matter'); programs,
algorithms, bulletin board archives, besides others. A document can have various
representations depending on its intended use; for example, some applications
require high-resolution images of documents with invisible watermarks for
security purposes as well as low-resolution images for children to download from
the Internet. Collections of digital library ranges from small, self-contained, and
narrowly defined collections to ones spread across physical and logical spaces.
One of the common requirements for a digital library is the ability to deal with
distributed collections of information.
License: CC BY-NC-ND
Digitization workflow;
License: CC BY-NC-ND
License: CC BY-NC-ND
1.5 Scope
This research work aims at producing State of the Art Report in the emerging
area based on exploratory research method.
research is to gather preliminary information that will help define problems and
suggest hypotheses. Given its fundamental nature, exploratory research often
does not require any hypothesis (Kotler & Armstrong, 2006). Thus, this research
work was carried out without any hypothesis.
In this thesis, some well-known projects have been evaluated based on some
parameters as indicated in Section 1.4 of this Chapter. Apart from some national
level initiatives, some institutional cases are also covered in this thesis while
these institutional initiatives are strongly representing national ethos on this
emerging area.
1.6 Methodology
This research work has been carried out in different phases.
Phase I: In this phase the author was involved in review of literature with special
focus on national and international policy instruments, digitization standards,
digitization guidelines, best practices, country reports, institutional case studies,
License: CC BY-NC-ND
and current research findings related to digital library development in the country
and abroad.
Phase II: In this Phase the author prepared a structured questionnaire and tested
this questionnaire in consultation with digital library practitioners in India. After
successful testing, questionnaires were circulated to project coordinators of
selected digitization initiatives across India. This questionnaire is provided the
Annex I of this thesis. Later, the author undertook field visits to the important
digitization project sites across India for collecting data, applying observation and
interview methods. The basis for selecting the institutes for study and field work
was (i) representative project sites of national level initiatives such as Digital
Library of Indias regional mega scanning centres, scanning centres and source
libraries; Indian Institute of Science (IISc), Indira Gandhi National Centre for the
Arts (IGNCA), National Mission for Manuscripts (NMM), National Informatics
Centre (NIC) and many other important institutions across the country; (ii) prior
appointments with the key persons in the national level initiatives whoever
responded in time.
In this field work, the author interviewed project coordinators, project supervisors
and project staff whoever were available for recording strategic, operational
and practical information related to digitization work. In this phase critical and
exhaustive studies were carried out mostly following case study method. Case
studies revealed detailed descriptions of digital library projects. Each component
in a digitization project set up such as ICT infrastructure, digitization equipment,
software used for content digitization, quality control and aggregation; information
storage and retrieval system are critically observed and recorded. Observations
in the field visits were properly documented as per scientific research method.
Phase III: This phase revolves around critical analysis of gathered data. In this
phase, final report is prepared in the form of thesis incorporating analytical
description of selected digitization projects, recommendations and suggesting
future research directions.
License: CC BY-NC-ND
In this thesis APA Style is followed for citing references and sources of
information, which is based on fifth edition of the Publication Manual of the
American Psychological Association (APA, 2001).
10
License: CC BY-NC-ND
11
License: CC BY-NC-ND
12
License: CC BY-NC-ND
knowledge
societies.
number of
open
access
declarations/ statements were made during the past decade, where the worlds
leading research institutions agreed on the open access mandates. An indicative
list of major open access statements or declarations is given here:
License: CC BY-NC-ND
14
License: CC BY-NC-ND
and Working Group on Open Access and Open Educational Resources have
strongly recommended digitization of documentary heritage collections, open
access to public-funded research literature and supported establishment of open
courseware repositories for countrywide dissemination of quality courseware to
many cross sections of people. If implemented, these recommendations will have
far-reaching implications in the knowledge creation and dissemination cycle. The
documentary heritage collections, scholarly literature and lifelong learning
materials digitized or produced by state-sponsored institutions would then be
made accessible through open access channels such as national and
institutional repositories. This way the NKCs recommendation on peer-reviewed
research papers resulting from public-funded research would be validated by
subject experts when making these resources available through open access
channels. NKC also recommended creation of national knowledge portals for
basic needs/ key sectors such as water, energy, environment, education, food,
health, agriculture, employment and citizen rights. Already national portals on
water (India Water Portal, www.indiawaterportal.org), energy (India Energy
Portal, www.indiaenergyportal.org), environment (India Environment Portal
www.indiaenvironmentportal.org.in) and education (Sakshat, www.sakshat.ac.in)
have been launched that provide open access to information, knowledge and
learning
resources
on
the
relevant
areas.
An
excerpt
of
relevant
15
License: CC BY-NC-ND
Report of the Working Group on Open Access and Open Educational Resources, NKC
http://knowledgecommission.gov.in/downloads/documents/wg_open_course.pdf
2
Report to the Nation 2007. New Delhi: National Knowledge Commission.
16
License: CC BY-NC-ND
17
License: CC BY-NC-ND
digitization
projects.
Some
guidelines
deal
with
technical
specifications and other operational issues in great details, whereas some other
18
License: CC BY-NC-ND
guidelines help the project managers in handling other aspects of the digitization
projects such as human resources, finance, legal matters including intellectual
property rights, advocacy, and how to involve stakeholders.
1.7.2.1 International Guidelines
Several guidelines are in existence at the national and international levels. Some
international guidelines are followed in Indian digitization projects, whereas some
national guidelines have adopted international guidelines with some degree of
localization. Popular international guidelines are described in this Section.
a) Guidelines for Digitization Projects for Collections and Holdings in the
Public Domain, particularly those held by Libraries and Archives
This document is jointly prepared and published by UNESCO, IFLA
(International Federation of Library Associations and Institutions) and ICA
(International Council on Archives) in March 2002. These Guidelines are the
result of a joint venture of a group of experts on behalf of IFLA and ICA, who had
been invited to draft these for UNESCO (2002). This document contains eight
comprehensive chapters such as:
o Selection of documents for digitization
o Technical requirements & implementation
o Legal aspects
o Budgeting
o Human resource planning
o Development & maintenance of Web interfaces
o Preservation of digital content
o Project management.
b) The Guide for Electronic Theses and Dissertations
This guidebook is the result of a joint venture of a group of experts on behalf of
UNESCO and NDLTD (Networked Digital Library of Theses and Dissertations).
This document is prepared for ETD stakeholders such as students, researchers
research supervisors, policymakers, ETD practitioners and NDLTD members
(UENSCO, 2002). This guidebook provides a roadmap for implementing ETD at
19
License: CC BY-NC-ND
20
License: CC BY-NC-ND
This Manual covers operational aspects of mass digitization work for rare books
and printed documents. This document contains five chapters, namely:
o Data Production
o Getting MARC Records from OCLC
o Creating Metadata Using Dublin Core
o Minolta PS 7000 Quickscan Software Instructions
o ABBYY FineReader 6.0 Instructions
b) Guidelines for Digitization of Manuscripts
This document was prepared by the digitization team of National Mission for
Manuscripts (2004). This manual is used by manuscript digitization centres of the
Mission across India. This is a very comprehensive manual covering topics
namely:
o Criteria for Selection
o Technical Requirements and Implementation
o Output Specification
o Metadata Creation
o Collection Management
o Image Quality Check List
c) UGC (Submission of Metadata and Full-text of Doctoral Theses in
Electronic Format) Regulations, 2005
This document was prepared in 2005 for national level consultation of
stakeholders in Indian higher education system. This document provides a
detailed roadmap for achieving ETD repositories at the national and university
levels. This document is divided into two parts. The first part provides
background information on Indian and international scenarios. In this part, some
specific standards for theses data are also elaborated. The second part deals
with implementation process. An outline of this manual is given below:
o Part I: Background Information, Current Scenario, Major Issues &
Data Standards
21
License: CC BY-NC-ND
International Scenario
Technical Requirements
22
License: CC BY-NC-ND
National Digital Libraries Cell (2008). List of Projects and Current Status including Achievements
<www.mit.gov.in/default.aspx?id=325>
2
Technology Information Facilitation Programme (2008). List of Projects Supported.
<www.dsir.gov.in/tpdup/tifp/tifplop.htm>
23
License: CC BY-NC-ND
Some of the successful digitization projects funded by DSIR and analyzed in this
thesis are:
24
License: CC BY-NC-ND
The
First
IEEE
International
Conference
on
Digital
Information
25
License: CC BY-NC-ND
themes of the Conference was digital libraries and digital archives. In this
conference, a number of Indian professionals presented institutional cases
on digital libraries and open access archives (Chandra, 2006).
The International Conference on Building Knowledge Repositories: CrossSectoral Collaborations, organized by National Institute of Fashion
Technology (NIFT), was held in New Delhi during 7-9 February 2008. In
this conference, a number of Indian professionals presented institutional
cases on knowledge repositories and open access archives as well as
shared their experiences on the cross-sectoral partnership in knowledge
dissemination.
The
Conference
recommended
(i)
facilitating
and
26
License: CC BY-NC-ND
contextualize the situations with their organizations they are working in their
papers.
First beads of papers on the digitization issues were written by veteran library
professionals citing their concerns on the implementation processes of this new
domain. Dasgupta (2001, 2004) in her papers raises some concerns on the
sustainability issue. To her viewpoint, sustainability is always in doubt as most of
the digitization projects are started by getting a one time grant without a follow up
plan. She points out that sustainability is a broad term which refers to many
factors starting from technological issues of preserving digital data to social and
economic questions for long-term accessibility of information to the people. For
tackling sustainability factors, she suggests a focused attention on the broad
spectrum comprising:
a) Policy and strategies,
b) Technical issues,
c) Economic aspects,
d) Social and educational issues,
e) Political and administrative will,
f) Capacity building,
g) Cooperative ventures, and
h) Changing role of librarians and information professionals.
Sood and Chandrasekharan (2004) published a baseline study of digital libraries
in India, indicating different initiatives at the national and institutional level. In this
paper technical and content creation issues are greatly dealt with. Kalra (2001),
Singh (2002), Bhattacharya (2004), Gulati (2004), Jeevan (2004), Jain and
Babbar (2006), Singh (2006), Mittal and Mahesh (2008) besides many others
published baseline surveys in different years contextualizing technological
development at that time. However, Das et al. (2007) analyzed search and
retrieval features of eight Indian digital libraries available in public domain,
namely, Digital Library of India, Kalasampada: Digital Library-Resource for Indian
Cultural Heritage, Cultural Heritage Digital Library in Hindi, Traditional
Knowledge Digital Library, Archives of Indian Labour: Integrated Labour History
27
License: CC BY-NC-ND
Das (2008)
28
License: CC BY-NC-ND
License: CC BY-NC-ND
have become standard models for many institutional repositories in India and
even aboard. Jayakanth (2008), Anuradha (2005), and Rajasekhar (2003) have
published papers describing open access publishing model adopted in IISc for
worldwide dissemination of research papers published by researchers and
faculty members of IISc. They also described operational aspects of different
open access initiatives in IISc such as EPrints@IISc, ETD@IISc and Pravabhi:
IISc Publications database.
In two separate papers, Deb (2006) and Deb and Kar (2005) describe digitization
activities and integration of digitized contents into existing hybrid digital library at
The Energy and Resources Institute (TERI) in New Delhi.
In his paper, Sutradhar (2006) shares his experience on implementation of a
intranet-base institutional repository in IIT Kharagpur using DSpace open source
software.
In her paper, Patra (2006) shares her views on need for a digital library on
ceramics. She also proposes a roadmap for implementing a digital library in
ceramics in her institute.
National Library of India (NLI) is an institution of national importance under
Ministry of Culture having rarest of collections of manuscripts and books. In 1999
NLI undertook a pilot project entitled "Down Memory Lane" to digitize its rare and
brittle books.
containing more than 2.5 million pages were scanned and archived in 548 CDROMs (in duplicate). This digitized collection can be consulted in NLI premises
only. In a paper (National Library of India, 2003), NLI describes its digitization
activities for manuscripts and rare books along with some technical details about
the project.
There are many other institutional case studies mostly communicated in
conferences and national journals (Doctor & Ramachandran, 2008; Bansode,
2008; Doctor, 2008; Doctor, 2007; etc.). In this section, only some significant
cases are reviewed that have linkages with this research work.
30
License: CC BY-NC-ND
1.8 Chapterization
This thesis addresses essentially two issues the digitization of library materials
available with memory institutions and digitization of scholarly materials produced
by Indian researchers and making these scholarly materials available through
open access channels. This section presents an overview as to how the various
parts and sections of this research are related to the research problems and
observations of the evidences leading to the recommendation of functional as
well as business models for self-sustainability of the digitization projects. As
indicated in Figure 1.1, each chapter, within the chapters 2 and 5, presents the
evidence from perspectives of implemented projects. Chapter 2 presents critical
appraisal of Indias largest digitization initiative Digital Library of India. Chapter
3 presents critical appraisal of digitization work pertaining to theses and
dissertations that makes the way to the creation of electronic theses and
dissertations (ETD). In this chapter Indian initiatives such as Vidyanidhi Digital
Library, ETD@IISc, OpenMed@NIC, and CSIR e-Thesis projects are described.
Chapter 4 presents critical appraisal of digitization work of documentary heritage
collections. This chapter deals with digitization work for most venerable pieces of
artifacts such as rare manuscripts, rare books, old photographs, photographic
slides, audio tapes and microfilms. In this chapter, digitization initiatives of two
prominent Indian institutions, namely, Indira Gandhi National Centre for the Arts
(IGNCA) and National Mission for Manuscripts (NMM) have been evaluated.
Chapter 5 presents critical appraisal of digitization work of Indian periodicals,
more specifically the projects carried out by the Indian National Science
Academy (INSA), Indian Academy of Sciences (IAS), Indian Medlars Centre at
the National Informatics Centre (medIND@NIC), Medknow Publications and
Ramakrishna Math. Digitized journal collections make the way to the creation of
open access journals in India. Thus, the genesis of open access journal
development in India is also indicated in this chapter.
Chapter 6 concludes with the summary of findings and recommendation of
futuristic functional models suitable for changing contexts in the country.
31
License: CC BY-NC-ND
Chapter 1
Introducing the context of the study
Disclosing our preconceptions
Chapter 2
Problem Area: Rare Books
Case Study: Digital Library of India
Chapter 3
Problem Area: Theses and dissertations
Case Studies: ETD Initiatives
Chapter 4
Problem Area: Documentary Heritages
Case Studies: IGNCA and NMM Initiatives
Chapter 5
Problem Area: Indian Journals
Case Study: Open Access to Journals
Chapter 6
Summary of Findings, Conclusion and
Recommendations
32
License: CC BY-NC-ND
1.9 References
Ahmed, F. & Rather, R. (2007). Open access digital repositories: an Indian
scenario. KnowGenesis International Journal for Technical Communication,
2(1), 16-20.
Ambati, V. & Reddy, R. (2005). Towards a self healing information system for
digital libraries. Journal of Zhejiang University Science, 6A(11),1221-1228.
American Psychological Association (2001). Publication manual of the American
Psychological
Association,
5th
edition.
Washington,
DC:
American
Psychological Association.
Anuradha, K.T. (2005). Design and development of institutional repositories: A
case study. The International Information & Library Review, 37(3), 169-178
Arora, J. (2003). Indian National Digital Library in Engineering Science and
Technology (INDEST): a proposal for strategic cooperation for consortiabased access to electronic resources. The International Information & Library
Review, 35(1), 1-17.
Arunachalam, S. (2008). Open access to scientific knowledge. DESIDOC Journal
of Library and Information Technology, 28(1), 7-14.
Balakrishnan, N. (2005). Universal Digital Library future research directions.
Journal of Zhejiang University Science, 6A(11),1204-1205.
Balakrishnan, N., Reddy, R., Ganapathiraju, M. & Ambati, V. (2006). Digital
Library of India: a testbed for Indian language research. TCDL Bulletin, 3(1).
Bansode, S. (2008). Creation of digital library of manuscripts at Shivaji
University, India. Library Hi Tech News, 25(1), 13-15.
Benner, M. (2003). The digital archive of the Swedish East India Company, 17311813: a joint project of a university library and a history department. Online
Information Review, 27(5), 328-332.
Bhattacharya, P. (2004). Advances in digital library initiatives: a developing
country perspective. The International Information & Library Review, 36(3),
165-175.
33
License: CC BY-NC-ND
15
July
2008
from
www.arl.org/sparc/bm~doc/IR_Guide_&_Checklist_v1.pdf
Das, A.K. & Mukherjee Das, A. (2008). Role of lifelong learning in emerging
knowledge economy in India. In: Prof Alaka Buragohain Festschrift Volume.
Guwahai: Assam College Librarians Association.
Das, A.K. (2008). Open access to knowledge and information: scholarly literature
and digital library initiatives the South Asian scenario. New Delhi: UNESCO.
Das, A.K., Dutta, C. & Sen, B.K. (2007). Information retrieval features in Indian
digital libraries: a critical appraisal. OCLC Systems & Services, 23(1), 92-104.
Dasgupta, K. (2001). Asian content development: unity in diversity. Library
Review, 50(7/8), 403-406.
Dasgupta, K. (2005). Digitization, sustainability and access in the Indian context.
Presented in the World Library and Information Congress: 71st IFLA General
Conference and Council, "Libraries - A voyage of discovery", 14-18 August
2005,
Oslo,
Norway.
Retrieved
15
July
2008
from
www.ifla.org/IV/ifla71/papers/132e-Dasgupta.pdf
Deb, S. & Kar, D.C. (2005). Setting up an electronic library: the case of TERI.
The Electronic Library, 23(2), 189-199.
Deb, S. (2006). TERI Integrated Digital Library Initiative. The Electronic Library,
24(3), 366-379.
Doctor, G. & Ramachandran, S. (2008). DSpace@IBSA: knowledge sharing in a
management institute. VINE, 38(1), 42-52
Doctor, G. (2007). Knowledge sharing: developing the digital repository of SIPS.
VINE, 37(1), 64-73.
34
License: CC BY-NC-ND
License: CC BY-NC-ND
Retrieved
15
May
2008
from
www.library.cmu.edu/Libraries/MillionBookManual.pdf
Ministry of Communications and Information Technology (2004). Digitization of
culture - background paper for Asia IT Ministers' 2nd Summit. Retrieved 15
July 2008 from http://asiaitsummit.nic.in/Digitisation.pdf
Ministry of Communications and Information Technology (2004). Hyderabad
Declaration
2004.
Retrieved
15
July
2008
from
http://asiaitsummit.nic.in/finalplan.htm
Mittal, R. & Mahesh, G. (2008). Digital libraries and repositories in India: an
evaluative study. Program: electronic library and information systems, 42(3),
286-302.
Moghaddam, G.G. & Talawar, V.G. (2008). The use of scholarly electronic
journals at the Indian Institute of Science: a case study in India. Interlending &
Document Supply, 36(1), 15-29.
36
License: CC BY-NC-ND
Moxley, J.M., Masiello, D. & Fox, E.A. (2002). The guide for electronic theses
and
dissertations.
Paris:
UNESCO.
Retrieved
15
July
2008
from
http://etdguide.org/etdguide.pdf
Mujoo-Munshi, U. (2003). Building digital resources: creating facilities at INSA.
The International Information & Library Review, 35(2-4), 281-309.
National Digital Libraries Cell (2008). List of projects and current status including
achievements.
Retrieved
15
July
2008
from
www.mit.gov.in/default.aspx?id=325.
National Knowledge Commission (2006). Report of the Working Group on
Libraries. New Delhi: National Knowledge Commission.
National Knowledge Commission (2006). Report to the Nation 2006. New Delhi:
National Knowledge Commission.
National Knowledge Commission (2007). Report to the Nation 2007. New Delhi:
National Knowledge Commission.
National Library of India (2003). Digitization of Manuscripts of the National
Library of India. CDNLAO Newsletter, 47. Retrieved 15 July 2008 from
www.ndl.go.jp/en/publication/cdnlao/047/473.html.
National Mission for Manuscripts (2004). Guidelines for digitization of
manuscripts.
Retrieved
15
July
2008
from
http://namami.gov.in/pdfs/final%20digitization.pdf
Pandian, M.P., Jambhekar, A. & Karisiddappa, C.R. (2002). IIM digital library
system: consortia-based approach. The Electronic Library, 20(3), 211-214.
Patra, C. (2006). Developing a digital library on ceramics. The Electronic Library,
24(4), 453-469.
Prahallad, K. & Black, A. (2005). A text to speech interface for Universal Digital
Library. Journal of Zhejiang University Science, 6A(11), 1229-1234.
Prahallad, L., Prahallad, K. & Ganapathiraju, M. (2005). A simple approach for
building transliteration editors for Indian languages. Journal of Zhejiang
University Science, 6A(11),1354-1361.
37
License: CC BY-NC-ND
Nepal.
Retrieved
15
July
2008
from
www.healthnet.org.np/ocs/index.php/icikm-2008/icikm/paper/view/15/12/.
Sood, A.D. & Chandrasekharan, U. (2004). Digital libraries in India: a baseline
study. Bangalore: Educational Development Centre.
Suber, P. & Arunachalam, S. (2005). Open access to science in the developing
world. In: Open access: unrestricted access to publisher research.
Hyderabad: 93rd Indian Science Congress.
Sutradhar, B. (2006). Design and development of an institutional repository at
the Indian Institute of Technology Kharagpur. Program: electronic library and
information systems, 40(3), 244-255.
Swan, Alma (2008). Open access for Indian scholarship. DESIDOC Journal of
Library and Information Technology, 28(1), 15-24.
Technology Information Facilitation Programme (2008). List of projects
supported. Retrieved 15 July 2008 from www.dsir.gov.in/tpdup/tifp/tifplop.htm.
38
License: CC BY-NC-ND
2008
from
http://unesdoc.unesco.org/images/0013/001310/131025eb.pdf.
United Nations (2006). WSIS outcome documents booklet. Geneva: International
Telecommunication Union.
Urs, S.R., Rajashekar, T.B. & Raghavan, K.S., eds. (2001). Digital libraries:
dynamic landscapes for knowledge creation, access and management Proceedings of the 4th International Conference of Asian Digital Libraries
(ICADL 2001). Mysore: University of Mysore.
Vikas, Om, ed. (2005). Special issues on fonts technology and Digital Library of
India. Vishwabharat@TDIL, 17, 59-106.
39
License: CC BY-NC-ND
CHAPTER 2
Indias Largest Digitization Initiative Digital Library of India
The Digital Library of India (DLI) is the large scale digital library initiative in South
Asia, spreading all over the country and establishing a network of four mega
scanning centres and 21 scanning centres, which feed the digital contents into
the digital library systems. Dr. A.P.J. Abdul Kalam, the former President of India,
launched the portal of Digital Library of India on 8 September, 2003. Digital
Library of India project is a part of the Universal Digital Library (UDL) or Million
Book Project, coordinated by the Carnegie Mellon University in United States of
America. Although the original UDL project has already been concluded and
successfully implemented in the United States, the DLI project still ripples around
with much acclamation. The former President of India, Dr. Abdul Kalam has
shown keen personal interest for overall development and implementation of this
project. In different forums, he promotes Digital Library of India for the benefits of
younger generations and nation building. He also provided digital contents of his
own writings for hosting in the DLI portals.
The Vision and Mission statements of DLI are similar to the same of parent
Universal Digital Library project, except some local variations. Figure 2.1 depicts
the Vision and Mission statements of Digital Library of India project. This is the
simplified version of Vision and Mission statements as elaborated in the DLI
portals. Figure 2.1 also indicates two major benefits of Digital Library of India
project, i.e. (i) To supplement the formal education system by making knowledge
available to anyone who can read and has access; and (ii) to make locating the
relevant information inside of books far more reliable and much easier.
40
License: CC BY-NC-ND
Mission
Benefits
As a first step in
will be to
significant literary,
it is proposed to
supplement the
formal education
system by making
be digitally preserved
read, searchable
knowledge available
collection of one
available, in every
million books,
predominantly in
access.
Indian languages,
A secondary benefit
study, and
available to everyone
of online search is to
relevant information
generations.
Source: http://www.new.dli.ernet.in/pturn/book_dli.html
Fig. 2.1: Vision and Mission Statements of Digital Library of India
41
License: CC BY-NC-ND
42
License: CC BY-NC-ND
43
License: CC BY-NC-ND
India
China
Egypt
United
States
Digital
Library of
India
China-US
Million
Book
Digital
Library
Project
Digital
Assets
Repository
The
Universal
Library
Portal
Hosted by:
Bibliotheca
Alexandrina
Portal
Hosted by:
Carnegie
Mellon
University
Portals
Hosted by:
IISc
Bangalore;
IIIT
Hyderabad;
CDAC Noida
Portal
Hosted by:
Zhejiang
University
UDL Portals
44
License: CC BY-NC-ND
ht t p: //w w w .ulib.org/
Egypt
2.31%
India
22.21%
China
75.48%
45
License: CC BY-NC-ND
46
License: CC BY-NC-ND
ht t p: //dar.bibalex.org/
47
License: CC BY-NC-ND
ht t p: //dli.iiit .ac.in/
48
License: CC BY-NC-ND
ht t p: //w w w .dli.cdacnoida.in/
49
License: CC BY-NC-ND
50
License: CC BY-NC-ND
RMSC
IIIT
Hyderabad
RMSC
IIIT
Allahabad
RMSC
CDAC
Kolkata
RMSC
CDAC
Noida
SC
SC
SC
SC
SC
SC
SC
SC
SL
SL
SL
SL
SL
SL
SL
SL
51
License: CC BY-NC-ND
RMSC
IIIT
Hyderabad
SC
SC
RMSC
IIIT
Allahabad
SC
SC
RMSC
CDAC
Kolkata
SC
RMSC
CDAC
Noida
SC
SC
SC
ERNET India
Fig. 2.12: ICT Infrastructure Provided by ERNET India
DLI project
site at IISc
Regional
Mega
Scanning
(International
Scanning
Centre
Collaboration,
Centre
National
(Quality
(Creation of
Coordination
Assurance
Digital
and
and Hosting
Objects
Research)
of Digitized
through
Documents)
Digitization)
Source
Library
(Supplying
Physical
Documents)
52
License: CC BY-NC-ND
53
License: CC BY-NC-ND
License: CC BY-NC-ND
License: CC BY-NC-ND
region. DLI portal at IIIT Hyderabad hosts digitized books from the following
scanning centres and source libraries, operated under regional mega scanning
centre in IIIT Hyderabad:
Food and Agriculture Organization of the United Nations (FAO), Italy (119
books digitized and hosted)
License: CC BY-NC-ND
57
License: CC BY-NC-ND
IIIT Allahabad has tied up with few other zonal public institutions for the second
phase of DLI project. These institutions are:
58
License: CC BY-NC-ND
License: CC BY-NC-ND
Table 2.1 presents a comprehensive list of DLI partners across India, including
funding agencies, regional mega scanning centres, scanning centres, source
libraries and infrastructure partner. This Table also indicates a huge involvement
of ninety plus organizations to make this an initiative of national importance.
Table 2.1: Comprehensive List of DLI Project Partners in India
Name of DLI Partner
Indian Institute of Science, Bangalore
State
Role
Line
Authority
Karnataka
National
UDL Carnegie
Coordinator
Melon
[www.iisc.ernet.in]
University
National Digital Libraries Cell,
Delhi
Funding,
Government
Monitoring &
of India
Evaluation
[www.mit.gov.in/]
Office of the Principal Advisor to the
Delhi
Funding
Government of India
Government
of India
[www.psa.gov.in/]
Technology Development for Indian
Delhi
Research
Partner
Government
ICT
DLI, IISc
of India
Delhi
Infrastructure
Provider
International Institute of Information
Andhra
Pradesh
RMSC
DLI, IISc
RMSC
DLI, IISc
Uttar Pradesh
60
License: CC BY-NC-ND
State
Role
Line
Authority
West Bengal
RMSC
DLI, IISc
Uttar Pradesh
RMSC
DLI, IISc
Karnataka
Scanning
DLI, IISc
Centre
Tamil Nadu
[www.annauniv.edu]
Scanning
DLI, IISc
Centre
Tamil Nadu
Scanning
DLI, IISc
Centre
[www.kalasalingam.ac.in/]
Goa University [www.unigoa.ac.in/]
Goa
Scanning
DLI, IISc
Centre
Indian Institute of Astrophysics,
Karnataka
Bangalore [www.iiap.res.in/]
Indian National Science Academy,
Scanning
DLI, IISc
Centre
Delhi
Content
DLI, IISc
Partner
www.insa.ac.in/]
Maharashtra Industrial Development
Maharashtra
Scanning
DLI, IISc
Centre
[www.midcindia.org/]
Poornapragna Vidyapeetha,
Karnataka
Bangalore
Scanning
DLI, IISc
Centre
Punjab
[www.ptu.ac.in/]
Scanning
DLI, IISc
Centre
Delhi
[www.presidentofindia.nic.in/]
Scanning
DLI, IISc
Centre
Contd
61
License: CC BY-NC-ND
State
Role
Line
Authority
Tamil Nadu
Scanning
DLI, IISc
Centre
(SASTRA), Tanjavur
[www.sastra.edu/]
Sri Chandrasekharendra Saraswathi
Tamil Nadu
Scanning
DLI, IISc
Centre
University), Kanchi
[www.kanchiuniv.ac.in/]
Sri Sringeri Sharada Peetham,
Karnataka
Sringeri
Scanning
DLI, IISc
Centre
[www.sringerisharadapeetham.org]
University of Pune
Maharashtra
[www.unipune.ernet.in/]
Allahabad High Court
Scanning
DLI, IISc
Centre
Uttar Pradesh
[www.allahabadhighcourt.in/]
Source
RMSC
Library
Allahabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Madanapalli
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Andhra
Scanning
RMSC
Pradesh
Centre
Hyderabad
Andhra
Scanning
RMSC
(BRAOU), Hyderabad
Pradesh
Centre
Hyderabad
[www.apts.gov.in/andhrastatearchive
s/]
[www.btcollege.org]
C.P. Brown Library, Kadapa
[www.braou.ac.in/]
Contd
62
License: CC BY-NC-ND
State
Role
Line
Authority
Madhya
Source
RMSC
[http://eklavya.in/]
Pradesh
Library
Hyderabad
Uttar Pradesh
Source
RMSC
Library
Allahabad
Source
RMSC
Library
Allahabad
Source
RMSC
Library
Allahabad
Source
RMSC
Library
Allahabad
Source
RMSC
Library
Allahabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Andhra
Scanning
RMSC
[www.osmania.ac.in/]
Pradesh
Centre
Hyderabad
Andhra
Vendor
RMSC
Hyderabad
Pradesh
Paropakarini Sabha
Andhra
Source
RMSC
[http://paropkarini.org]
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Andhra
Scanning
RMSC
(PSTU), Hyderabad
Pradesh
Centre
Hyderabad
Uttar Pradesh
Source
RMSC
Library
Allahabad
Uttar Pradesh
Uttar Pradesh
Uttar Pradesh
Kanpur [www.iitk.ac.in/]
National Academy of Sciences,
Uttar Pradesh
Allahabad
[www.nasi.org.in/]
Oriental Research Institute, Tirupati
Hyderabad
[www.teluguuniversity.ac.in/]
Prayag Sangeet Samiti, Allahabad
[www.prayagsangeetsamiti.in/]
Contd
63
License: CC BY-NC-ND
Line
Authority
State
Role
Andhra
Scanning
RMSC
Hyderabad
Pradesh
Centre
Hyderabad
Andhra
Source
RMSC
Vetapalem
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Tirupati [www.padmavatiwomen-
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Rajahmundry
Pradesh
Centre
Hyderabad
Andhra
Vendor
RMSC
Pradesh
[www.salarjungmuseum.in/]
[http://saraswataniketanam.org/]
univ.org/]
Hyderabad
Tirupati [www.tirtumala.org/]
Sri Venkateswara University, Tirupati
Andhra
Source
RMSC
[www.svuniversity.in/]
Pradesh
Library
Hyderabad
Andhra
Scanning
RMSC
Pradesh
Centre
Hyderabad
Tamil Nadu
Source
RMSC
Library
Hyderabad
Vendor
RMSC
Andhra
Hyderabad [www.thrinaina.com/]
Pradesh
Andhra
Scanning
RMSC
Pradesh
Centre
Hyderabad
Uttar Pradesh
Source
RMSC
Library
Allahabad
Hyderabad
Tirupati
[www.tirumala.org/]
University of Allahabad, Allahabad
[www.allduniv.ac.in/]
University of Hyderabad
Andhra
Scanning
RMSC
[www.uohyd.ernet.in/]
Pradesh
Centre
Hyderabad
Contd
64
License: CC BY-NC-ND
State
Role
Line
Authority
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
Uttar Pradesh
Source
RMSC
Library
Allahabad
Andhra
Source
RMSC
Pradesh
Library
Hyderabad
West Bengal
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC
Library
Kolkata
West Bengal
Statistics, Kolkata
Dr. Meghnad Saha Collection,
West Bengal
Kolkata [www.saha.ac.in/]
Hooghly Mohsin College, Hooghly
West Bengal
[www.hooghlymohsincollege.org/]
Natya Sodh Sansthan, Kolkata
West Bengal
[http://natyashodh.org/]
North Bengal State Library, Cooch
West Bengal
Behar
Raj Bhavan Library, Kolkata
West Bengal
[http://rajbhavankolkata.gov.in;
http://rajbhavankolkata.nic.in]
Ramakrishna Mission Institute of
West Bengal
Culture, Kolkata
[www.sriramakrishna.org/]
State Central Library, Kolkata
West Bengal
Assam
Jorhat [http://jorhat.nic.in/ttc.htm]
Contd
Line
65
License: CC BY-NC-ND
State
Role
Line
Authority
West Bengal
Source
RMSC
Library
Kolkata
Scanning
RMSC
Centre
Kolkata
Scanning
RMSC
Centre
Kolkata
Source
RMSC
Library
Kolkata
Source
RMSC Noida
[www.asiaticsocietycal.com/]
Uttarpara Jayakrishna Public Library
West Bengal
Hooghly
Visva Bharati, Santiniketan
West Bengal
[www.visva-bharati.ac.in/]
West Bengal Legislative Assembly
West Bengal
House, Kolkata
Arya Samaj New Delhi
Delhi
[www.aryasamaj.com/]
Association of Indian Universities,
Library
Delhi
Source
RMSC Noida
Library
Delhi
[http://jnanpith.net/]
Scanning
RMSC Noida
Centre
Rajasthan
Source
RMSC Noida
Library
[www.bits-pilani.ac.in/]
Delhi College of Engineering, Delhi
Delhi
[www.dce.edu/]
Source
RMSC Noida
Library
Delhi
Source
RMSC Noida
Library
Govind Ballabh Pant University of
Uttarakhand
Source
RMSC Noida
Library
Pantnagar [www.gbpuat.ac.in/]
Gurukul Kangri University, Haridwar
Uttarakhand
[www.gkvharidwar.org/]
Indian Agricultural Research Institute
Scanning
RMSC Noida
Centre
Delhi
Source
RMSC Noida
Library
Contd
66
License: CC BY-NC-ND
State
Role
Line
Authority
Delhi
Source
RMSC Noida
Library
[www.iccrindia.org/]
Kumaun University, Nainital
Uttarakhand
[www.kumaununiversity.org/]
Lal Bahadur Shastri National
Source
RMSC Noida
Library
Uttarakhand
Academy of Administration,
Source
RMSC Noida
Library
Mussoorie [www.lbsnaa.ernet.in/]
Nagari Pracharini Sabha, Varanasi
Uttar Pradesh
[http://tempweb34.nic.in/xnagari/]
National Council of Educational
Scanning
RMSC Noida
Centre
Delhi
Source
RMSC Noida
Library
Rajasthan
Source
RMSC Noida
Library
[www.brahmakumaris.com/]
The Energy and Resources Institute
Delhi
Source
RMSC Noida
Library
Delhi
Source
RMSC Noida
Library
License: CC BY-NC-ND
total number of scanners. Obviously, this RMSC has huge digitization capacity,
unparallel to any other RMSCs. This Figure also depicts that amongst two brands
of book scanners, Zeutschel-branded scanners outnumbered Minolta scanners,
i.e., 100 out of 171. Figure 2.16 provides information on availability of book
scanners across Indian states, where Andhra Pradesh stands first (54.4%), Uttar
Pradesh second (15.8%) and Tamil Nadu third (8.2%). Figure 2.16 provides a
mapping of scanning partners across India, which shows many states of India
are left behind in this project until now. This means, Digital Library of India portals
may contain insignificant number of books, published in those non-participating
states of India or in the states official languages.
50
Zeutschel
Minolta
45
40
No. of Scanners
35
30
25
44
20
15
4
10
15
10
10
10
10
7
4
5
2
v
ni
U
ch
Te an
n. av
Pu .Bh
h
as
R
P
IIA
C
ID iv
M Un
ne iv
Pu Un tt
a u
nn M t
A hi
ut
c
an ri M
K
ge
in v.
Sr ni
U
oa
G
CE
K e
A
t
ko
el d
M -Hy
L
C
C Hyd niv
L- t.U
SC Ce n
.
yd
U
O
v.
ni
M
SJ u U
g
lu
Te TRA
S
SA
D
TT -K a
AC oid
D
C C- N
A
D
c
IIS
A
TIII
H
TIII
License: CC BY-NC-ND
Goa 1.2%
Maharashtra
4.1%
Delhi 2.3%
Punjab 1.8%
West Bengal
5.8%
Karnataka
6.4%
Andhra
Pradesh
54.4%
Tamilnadu
8.2%
Uttar Pradesh
15.8%
License: CC BY-NC-ND
70
License: CC BY-NC-ND
The regional mega scanning centres and local scanning centres of DLI are free
to generate their additional resources for the self-sustainability of the project by
collaborating with the state authorities and non-government trusts including
religious trusts and industrial agencies. Some scanning centres, in collaboration
with private vendors, also undertake out-sourced digitization activities from the
overseas clients as a spin-off of the project. DLI project also accepts donations
from individuals and institutions. Donations are used for activities such as
selection and acquisition of books, packing and shipping costs, and quality
assurance of the digitized contents.
Ministry of
Communicati
on and
Information
Technology,
India
Office of the
Principal
Scientific
Advisor,
India
Digital
Library of
India
U.S.
National
Science
Foundation
Other
Sources
(e.g. donations,
state agencies,
religious
trusts)
71
License: CC BY-NC-ND
Status
No.
1.
of
the
Project
Establishment of Digital Library of India
Project
ongoing.
By ERNET India
Installed servers at
13 Nodal centres.
Provided
the
broadband
connectivity
to
nodal centres.
2.
3.
completed.
By IIIT Allahabad
Web enabled
completed.
completed.
Advaita
Sarada
Project
(Digitization
of
Ancient Project
completed.
completed.
and Old Magazines from mid-19th Century to 1960 Scanned data Web
available with Nagari Pracharini Sabha, Varanasi
enabled.
By CDAC Noida
72
License: CC BY-NC-ND
7.
8.
9.
10.
11.
12.
13.
completed.
enabled.
ongoing.
By University of Goa
enabled.
ongoing.
By University of Hyderabad
enabled.
completed.
enabled.
completed.
By University of Pune
enabled.
Project
By CDAC Noida
Books
scanned,
printed
and
completed.
distributed
in
schools.
14.
completed.
15.
completed.
73
License: CC BY-NC-ND
17.
(Project
initiated
By IIIT Hyderabad
October 2007)
on
19th
(Project
Library of India
initiated
on
By IISc Bangalore
October 2007)
30
Other
Projects,
15%
DLI-linked
Projects,
85%
74
License: CC BY-NC-ND
outsourced
digitization
services,
both
from
Indian
and
overseas
75
License: CC BY-NC-ND
Academic
Institutions
(e.g. Anna
University)
Cultural
Institutions
Research
Agencies
(e.g. Salarjung
Museum)
(e.g.
CDAC- Noida)
Digital
Library of
India
Religious
Institutions
Industrial
Agencies
(e.g. Tirumala
Tirupati
Devasthanam)
(e.g. Thrinaina
Informatics
Ltd.)
Government
Agencies
(e.g.
Rashtrapati
Bhavan)
76
License: CC BY-NC-ND
Multimedia Laptop
Laser Printer
Other than books available in DLI portals, some full-text e-books are also made
available locally in its portal in either one of DJVU, PDF and DOC formats. Some
of the books are meant for neo-literates in spreading functional literary and
lifelong learning in remote areas. The locally available e-books are written in
Hindi language, as this initiative is covered in Hindi-speaking states only.
77
License: CC BY-NC-ND
http://mobilelibrary.cdacnoida.in
License: CC BY-NC-ND
79
License: CC BY-NC-ND
The operator creates the structural meta information for the book he / she
scans;
The operator/ meta data entry operator enters the administrative (admin)
meta information for the books scanned on a particular day;
80
License: CC BY-NC-ND
The scanned books undergo image processing and OCR and also quality
control is done by the contractors/supervisors;
The scanned and processed books are copied onto hard disks and DVDs;
The hard disks are brought to the central server at the RMSC or DLI
project site at IISc for Web enablement.
Structured and admin metadata are entered in online/offline metadata entry form.
For this activity, metadata software is used which is produced by DLI national
coordination office. This software automatically generates XML file, incorporating
Dublin core metadata elements, for each scanned book.
During scanning process, the books are scanned page-wise, one or two pages at
a single exposure. The image of each scanned page is captured in a TIFF file,
where TIFF stands for Tagged Image File Format a loss-less uncompressed
image file format, widely used in digitization work. Original TIFF (OTIFF) file of
each page is preserved for future use. Then image of each page of a book is
processed or edited at the scanning centre to comply with quality control policy of
DLI. The edited image file is then saved in another TIFF file, called processed
TIFF (PTIFF). Scanned images of books written in European scripts such as in
English, French, German, and Spanish, are usually converted into editable text
formats using optical character recognition (OCR) software ABBYY FineReader.
This software delivers superior accuracy (sometimes 99%) in converting any undistorted image file into texts. Output files from the OCR process are .HTML
(simple Web page), .TXT (simple Text file), and .RTF (Rich Text Format).
The process of optical character recognition (OCR) is depicted in Figure 2.27,
where the OCR engine corrects image from any distortion; segments texts into
paragraphs, lines, words and characters; extract features such as character
height, width, horizontal lines, vertical lines, and slope lines; classifies characters
based on SVM (support vector machine) algorithm; performs Unicode mapping;
and finally recognizes texts in the page. The latest version of OCR software
81
License: CC BY-NC-ND
82
License: CC BY-NC-ND
Missing pages
Blank Pages
Automatable Parameters:
Number of files
Table 2.3 provides a list of major quality parameters for processed TIFF images
(Pratha, Ambati & Sankar, 2005). DLI upholds these parameters as a baseline
for maintaining uniformity within the project. These parameters are easily
achievable utilizing existing of hardware, software and manpower supports
available at any DLI scanning centre.
Table 2.3: Major Quality Parameters for processed TIFF images
Name of Parameter
Value
Dimensions
600 or above
Compression algorithm
Margin
Skew
Less than 2o
Blank pages
83
License: CC BY-NC-ND
stages of quality control. Then final digital objects are produced within an
acceptable quality.
Physical Documents
Pre-Scanning Process
Scanning Process
Digitized Documents
Fig. 2.22: DLI Process Workflow
Books Procured
from Source Library to Scanning Centre
Yes
Verify Duplication
Reject
No
Assign Barcode
84
License: CC BY-NC-ND
Fig. 2.24: Book Movement within the Source Library and Scanning Centre
85
License: CC BY-NC-ND
License: CC BY-NC-ND
87
License: CC BY-NC-ND
Book Folder
(123236_Nationalism)
Metadata
meta.xml
OTIFF
PTIFF
HTML
00000001.TIF
00000001.TIF
00000001.HTM
00000001.TXT
00000001.RTF
00000002.TIF
00000002.TIF
:
:
:
:
:
00000241.TIF
00000002.HTM
:
:
:
:
:
00000241.HTM
00000002.TXT
:
:
:
:
:
00000241.TXT
00000002.RTF
:
:
:
:
:
00000241.RTF
:
:
:
:
:
00000241.TIF
TXT
RTF
88
License: CC BY-NC-ND
compensate for page curvature and can eliminate centerline shadows, outside
frames, even fingers holding the outside edges of pages. For books with fragile
bindings, scanners book table bends lower in the centre or on either side to
accommodate the spine. These book scanners also have large format scanning
area scanning area is big enough to accommodate oversized bound volumes,
manuscripts, archival records, artworks, cartographic and legal documents with
ease. These are meant to be easy for anyone to use and seem to offer features
to help those who are less skilled. These book scanners are fast, safe, and
simple to use. The book scanners also provide high image quality - clean images
with 600 dpi or more resolution.
Figure 2.29 provides a sketch of manual book scanner, specifically OmniScan
Zeutschel book scanner that captures images using a digital camera and wideangled light sources. Figures 2.30 and 2.31 provide images of different brands of
book scanners used in DLI project.
Figure 2.30 also shows a workstation, where book scanner is connected. In this
workstation scanning software such as ABBYY FineReader or OmniScan is
installed. The book scanner produces high quality images, and scanning
software processes those images and stores in hard disk of that workstation.
89
License: CC BY-NC-ND
License: CC BY-NC-ND
91
License: CC BY-NC-ND
92
License: CC BY-NC-ND
DLI project has developed several other utility software tools for the convenience
of system administrators in the project sites. These software tools are: (i) Server
Management Tool, which provides server-site applications and maintains its
services; and (ii) Workflow Management Software, which is monitoring and
evaluation tool to determine any bottleneck or any deviation from targets and
deadlines.
A glimpse of software tools developed, customized and used is shown in Figure
2.32. OM i-Trans Editor, a transliteration tool for Indian languages, is an example
of many useful software tools developed in this project. This software is shown in
Figure 2.33.
Metadata Software
OM Transliteration Software
Software Tools
Developed,
Customized
and Used
93
License: CC BY-NC-ND
94
License: CC BY-NC-ND
95
License: CC BY-NC-ND
screen. Figure 2.37 shows tagged metadata displayed in XML file for the book
titled Nationalism, written by Rabindranath Tagore.
In the scanning centre, some metadata fields are not filled up due to many
reasons. These unspecified metadata elements are also displayed blank on
screen. But, key metadata elements are always filled up and checked during
different phases of quality control.
In Digital Library of India portals, some metadata elements are searchable
whereas some others are not searchable. Searchable metadata elements are:
Name of Author
Subject
Year of Publishing
Language, and
Search result produces a list of books matching the search criteria, where
following metadata are displayed:
Title of Book
Name of Author
Subject
Language
The Advanced Search Interface is available in the DLI homepage (Figure 2.38).
This interface also specifies the searchable metadata elements. In advanced
search, combination of two or more elements to narrow the search is possible.
96
License: CC BY-NC-ND
Structural
Metadata
Admin
Metadata
Language
Title
Title
Title
Author
Author
Creator
(Author/
Editor)
Subject
Subject
Language
Language
Keyword
Barcode
Barcode
Subject
Year
Year
Publisher
Date of
Publishing
Format
Publisher
Vendor
Document
Type
Scanning
Centre
Format
Scanning
Location
Identifier
Source
Source
Library
Copyright
Date
Digital RePublisher
Scanning
Centre
Barcode
Digital
Publication
Date
Scanning
Number
Numbered
Pages
Digital RePublisher
Un-Numbered
Pages
Digital
Total Pages
Publication
Date
Fig. 2.34: Metadata Used in Digital Library of India
97
License: CC BY-NC-ND
Whether
Example &
Metadata
Mandatory
Help
Field
Language
Yes
Title
Yes
Creator/Author
Yes
Keywords
No
Description
Subject
Yes
Publisher
Yes
Contributor
No
Date
Yes
YYYY/MM/DD format
E.g., only Year is known, e.g. 1923/00/00
Both Year and Month are known, e.g. 1924/03/00
All three entries are known, e.g. 1926/11/26
Document
Type
Yes
98
License: CC BY-NC-ND
Format
No
TIFF
Identifier
No
URL http://www.nameofsite.org/
ISBN 0304701564
Accession No. DLI334455 (when DLIB India Number is
assigned)
Source
Yes
Relation
No
Coverage
No
Rights
No
Copyright
No
Date
copyright restriction.
Scanning
Yes
Centre
Scanner
No
Number
Digital
Yes
Republisher
Digital
Publication
Yes
Date
99
License: CC BY-NC-ND
100
License: CC BY-NC-ND
Fig. 2.37: Tagged Metadata Displayed in XML Format for the Same Book
101
License: CC BY-NC-ND
102
License: CC BY-NC-ND
Electrical,
Engineering,
General,
Geography,
Health,
Law,
Literature,
License: CC BY-NC-ND
104
License: CC BY-NC-ND
By Author: User can enter full name, surname, first name or partial name of the
author of the book for searching a book. For this field Unicode search is enabled,
similar to Title search.
By Subject:
User can select a subject from given list of the subjects (e.g.,
Advanced Search
In the DLI Homepage advanced search interface is available where user can
search by more than one search parameters at a time. Users utilize advanced
search option to refine a search, i.e., to avoid a large set of data being retrieved
using only a single parameter. When users enters two search parameters in the
advanced search interface, DLI system automatically takes an AND Boolean
105
License: CC BY-NC-ND
Operand for each added parameter. For example, when physics is given in the
subject field, information on all books pertaining to that particular subject is
appeared in search result. For further narrowing the search result, another
parameter can be given in the same search interface such as Bengali language.
Here DLI system interprets it as Subject=Physics AND Language=Bengali, i.e.
physics books in Bengali language to be appeared in the search result. Similarly,
few other parameters can also be added for further narrow down a search result,
e.g. a year (1950), or a range of years (1900-1950), or a scanning centre (CDAC
Kolkata).
Figure 2.41 illustrates how two search parameters are being entered in the
advanced search interface. Figure 2.42 shows a shaded area where Boolean
operand AND finds books matching both parameters.
Similarly, Figure 2.43 illustrates how three search parameters are being entered
in the same advanced search interface. Figure 2.44 shows a shaded area where
Boolean operand AND finds books matching all parameters.
Figure 2.45 illustrates advanced search interface available with DLI Portal of
RMSC Hyderabad. This Figure also indicates that more searchable metadata
elements are available with this portal. DLI main portal provides six searchable
metadata fields, i.e., Title, Author, Subject, Year, Language and Scanning
Centre. Additional searchable fields available with DLI portal in IIIT Hyderabad
are: Vendor, Source Library and Scanning Location.
In advanced search interface, Unicode-based Indian language texts can be
entered only in author and title metadata fields. Other metadata fields are
enabled in English only, thus, do not accept Indian language search strings. If an
end user does not have Indian language typing facility or skill, he can always use
an Indian language transliteration tool for getting Unicode-based Indian language
text inputs, as illustrated in Figures 2.33 and 2.46.
106
License: CC BY-NC-ND
107
License: CC BY-NC-ND
108
License: CC BY-NC-ND
License: CC BY-NC-ND
110
License: CC BY-NC-ND
License: CC BY-NC-ND
Fig. 2.48: Search Result Displayed in Unicode in DLI Portal of RMSC Hyderabad
112
License: CC BY-NC-ND
113
License: CC BY-NC-ND
Fig. 2.51: Security Check for Reading Online in DLI Portal of RMSC
Hyderabad
114
License: CC BY-NC-ND
115
License: CC BY-NC-ND
116
License: CC BY-NC-ND
Language
Number of Books
Number of Pages
English
79,040
26,477,207
Telugu
14,090
2,812,929
Hindi
13,266
3,570,438
Sanskrit
4,854
1,820,761
Urdu
3,811
921,095
Tamil
1,617
411,723
Portuguese
1,220
138,021
Persian
1,186
305,769
Arabic
617
198,552
10
Bengali
554
338,213
Others
3,356
1094,954
Total
123,611
38,089,662
Others, 2.7%
Persian, 1.0%
English
Telugu
Portuguese, 1.0%
Hindi
Bengali, 0.4%
Sanskrit
Arabic, 0.5%
Urdu
Tamil, 1.3%
Tamil
Urdu, 3.1%
Arabic
Sanskrit, 3.9%
Bengali
Hindi, 10.7%
Portuguese
Persian
Others
English, 63.9%
Telugu, 11.4%
117
License: CC BY-NC-ND
Others, 2.9%
English
Persian, 0.8%
Telugu
Portuguese, 0.4%
Hindi
Bengali, 0.9%
Sanskrit
Arabic, 0.5%
Urdu
Tamil, 1.1%
Tamil
Urdu, 2.4%
Arabic
Bengali
Sanskrit, 4.8%
Portuguese
Hindi, 9.4%
Persian
Telugu, 7.4%
Others
English, 69.5%
English
Others, 0.3%
Telugu
Hindi, 2.1%
Urdu
Kannada, 2.4%
Arabic
Tamil, 2.5%
Persian
Persian, 4.3%
Tamil
Arabic, 6.7%
Kannada
Hindi
Sanskrit
M arathi
Urdu, 8.7%
Others
Telugu, 16.2%
English, 54.3%
118
License: CC BY-NC-ND
Marathi, 0.7%
Sanskrit, 1.9%
Others, 0.3%
Hindi, 0.6%
English
Telugu
Kannada, 0.9%
Urdu
Tamil, 1.3%
Arabic
Persian, 3.7%
Persian
Arabic, 7.3%
Tamil
Kannada
Hindi
Urdu, 7.5%
Sanskrit
Marathi
Others
Telugu, 7.7%
English, 68.2%
119
License: CC BY-NC-ND
Although IIIT Hyderabad hosts another DLI portal, it is still a contributing partner
to the main DLI portal. Some scanning centres operating under this regional
mega scanning centre such as City Central Library in Hyderabad, Tirumala
Tirupati Devasthanams in Tirupati, State Central Library in Hyderabad, Salarjung
Museum in Hyderabad, Osmania University in Hyderabad and Sri Venkateswara
Oriental Research Institute in Tirupati are also contributing partners to the main
DLI portal. On the other hand, regional mega scanning centre in CDAC Noida
hosts another DLI portal, but it is not a contributing partner to the main DLI portal.
Two scanning centres operating under regional mega scanning centre at CDAC
Noida, viz. Rashtrapati Bhavan and Bharatiya Jnanpith, are also contributing
content partners of main DLI portal. Figure 2.56 depicts the scanning-centre-wise
distribution of digitized books in main DLI portal.
CDAC Kolkata hosts regional mega scanning centre for eastern region. It does
not have its own DLI portal. It provides digitized contents to main DLI portal. The
search interface of DLI portal provides searchability within the collection of a
scanning centre. Figure 2.57 depicts language-wise distribution of books,
digitized under regional mega scanning centre in Kolkata as on 30th June 2008.
This Figure indicates that English books have been predominantly digitized in
this regional mega scanning centre (70.34%), followed by Bengali books
(12.54%).
Almost all Bengali books available in main DLI portal have been
120
License: CC BY-NC-ND
in the ranked list. State Central Library in Hyderabad stands second and City
Central Library in Hyderabad third in the list. Figure 2.59 depicts the scanningcentre-wise distribution of digitized books in DLI portal hosted by regional mega
scanning centre in Hyderabad.
In regional mega scanning centre at IIIT Hyderabad, three scanning vendors
provide digitization services to the DLI project. The vendors are responsible for
execution of scanning work in mutually agreed timeframe without compromising
quality of work. These vendors represent DLI partners from the small and
medium sized enterprise (SME) segment.
Other regional mega scanning centres and scanning centres across India also
utilize digitization services from the third party vendors. But DLI portal hosted in
Hyderabad only provides access point to search by the name of the vendor.
Table 2.8 provides a ranked list of vendors operated under regional mega
scanning centre at IIIT Hyderabad, in terms of contributed digitized books in DLI
portal hosted by IIIT Hyderabad. This Table shows that Par Informatics Limited, a
Hyderabad-based vendor is the major contributor to digitized contents into DLI
portal and stands first in the ranked list. Thrinaina Informatics Limited in
Hyderabad stands second and SV Infosys in Tirupati third in the list. SV Infosys
is affiliated to Tirumala Tirupati Devasthanams in Tirupati. Figure 2.60 depicts the
vendor-wise distribution of digitized books in DLI portal hosted by regional mega
scanning centre in Hyderabad.
Scanning Centre
Number
Number of
Number of
of
Pages
Pages In
Books
Millions
IIIT Allahabad
37,633 10,669,514
10.67
12,952
2,762,568
2.763
IIIT Hyderabad
11,759
5,121,265
5.121
121
License: CC BY-NC-ND
11,705
3,019,386
3.019
AKCE, Virudhunagar
7095
1,036,014
1.036
6479
2,281,804
2.282
6128
2,339,384
2.339
5779
1,975,659
1.976
4832
2,023,364
2.023
Delhi
10
IISc Bangalore
4418
1,620,729
1.621
11
CDAC Kolkata
4353
2,056,937
2.057
12
SASTRA, Thanjavur
1872
577,050
0.577
13
University of Pune
1686
497,074
0.497
14
1568
560,577
0.561
1297
394,001
0.394
1214
134,481
0.134
Bangalore
15
16
Goa University
17
958
306,439
0.306
18
495
216,271
0.216
492
160,185
0.16
20
292
92,570
0.093
21
247
122,904
0.123
Others
357
121,486
Total
123,611 38,089,662
38.089
122
License: CC BY-NC-ND
37,633
IIIT Allahabad
12,952
CCL Hyderabad
IIIT Hyderabad
11,759
11,705
7,095
6,479
SCL Hyderabad
6,128
Salarjung Museum
5,779
Osmania University
4,832
4,418
IISc Bangalore
4,353
CDAC Kolkata
SASTRA Deemed Univ. Thanjavur
1,872
Pune University
1,686
IIAP Bangalore
1,568
ASR Melkote
1,297
Goa University
1,214
958
495
492
Others
357
292
247
Anna University
0
5,000
123
License: CC BY-NC-ND
Bengali,
12.54%
Others,
17.11%
English,
70.34%
1200
1,173
1,197
1100
No. of Books
1,036
1000
900
800
784
700
650
600
Literature
Religion
Science
Geography
History
124
License: CC BY-NC-ND
27,893
8,993,360
8.993
4,489
1,463,897
1.464
1,436
412,900
0.413
1,318
288,429
0.288
14
6,567
0.007
Others
25,396
8,009,367
8.009
Total
60,546
19,174,520
19.175
As on 1 July 2008
HCU
14
TTD
1,318
CCL
1,436
4,489
SCL
25,396
Others
27,893
IIITH
5000
10000
15000
20000
25000
30000
No. of Books
License: CC BY-NC-ND
Name of Vendor
Number
Number of
of Books
Pages
19,814
6,394,118
9,885
3,709,802
5,904
1,085,890
Pradesh (PAR)
2
5,904
SVI
9,885
TIL
19,814
PAR
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
No. of Books
126
License: CC BY-NC-ND
127
License: CC BY-NC-ND
Subject
Number of
Books
Number of
Pages
No of Pages In
Millions
(approx)
Literature
27,277
9,066,027
9.066
Science
12,068
4,041,056
4.041
History
7,158
2,381,669
2.382
Religion
5,865
1,981,556
1.982
Geography
5,002
1,535,218
1.535
Philosophy
4,038
1,359,908
1.360
Psychology
3,377
1,186,654
1.187
General
1,855
740,196
0.740
Art
771
220,152
0.220
10
Engineering
676
104,491
0.104
11
Chemistry
365
104,640
0.105
12
Law
313
164,227
0.164
13
Poetry
313
50,637
0.051
14
Economics
289
91,557
0.092
15
Mathematics
245
78,030
0.078
16
Health
244
31,068
0.031
17
Electrical
205
31,202
0.031
18
Astronomy
203
80,802
0.081
19
Education
172
23,251
0.023
20
Biology
126
44,435
0.044
21
Music
126
36,093
0.036
22
Medical
121
30,471
0.030
23
Astrophysics
103
38,613
0.039
24
Politics
87
30,882
0.031
25
Sociology
86
27,136
0.027
26
Marketing
80
3,324
0.003
27
Statistics
73
16,021
0.016
128
License: CC BY-NC-ND
28
Devotional
35
5,759
0.006
29
Autobiography
26
9,900
0.010
30
Architecture
24
9,274
0.009
31
Zoology
12
3,655
0.004
32
Commerce
2,510
0.003
33
Mythology
887
0.001
34
Dance
515
0.001
35
Unknown
513
0.001
36
No Subject
Metadata Available
Total
52,320
14,565,342
14.565
123,675
38,097,671
38.098
27,277
Literature
12,068
Science
7,158
History
5,865
Religion
5,002
Geography
4,038
Philosophy
Psychology
3,377
Other Subjects
3,268
1,855
General
Art
771
Engineering
676
52,320
No Subject Metadata
0
10,000
20,000
30,000
40,000
50,000
60,000
License: CC BY-NC-ND
Subject
Number of
Books
Number of
Pages
No of Pages in
Million
(approx)
Literature
3,291
889,463
0.889
History
3,276
1,749,435
1.749
Religion
1,962
477,262
0.477
General
1,452
454,085
0.454
Education
356
155,335
0.155
Philosophy
349
100,875
0.101
Art
337
106,995
0.107
Poetry
295
79,405
0.079
Chemistry
272
98,052
0.098
10
Sociology
232
84,780
0.085
11
Economics
216
71,085
0.071
12
Mathematics
244
62,842
0.063
13
Psychology
193
72,501
0.073
14
Biology
139
53,669
0.054
15
Science
133
40,750
0.041
16
Politics
126
47,364
0.047
17
Geography
112
30,564
0.031
18
Music
107
34,071
0.034
19
Medical
80
40,766
0.044
20
Law
74
29,996
0.041
21
Engineering
64
21,992
0.022
22
Astronomy
48
19,344
0.019
23
Marketing
38
12,411
0.012
24
Zoology
35
13,720
0.014
25
Architecture
31
12,012
0.012
26
Mythology
25
11,679
0.012
130
License: CC BY-NC-ND
27
Autobiography
18
3,727
0.004
28
Statistics
16
4,381
0.004
Unknown
2,187
670,697
0.671
44,999
13,736,422
13.736
60,707
19,185,680
19.186
No Subject
Metadata Available
Total
Sociology
232
Chemistry
272
Poetry
295
Art
337
Philosophy
349
Education
356
1,452
General
1,962
Religion
3,276
History
3,291
Literature
0
500
1,000
1,500
2,000
2,500
3,000
3,500
131
License: CC BY-NC-ND
132
License: CC BY-NC-ND
The data servers in DLI site have series of servers to store the digital images of
pages from scanned books. Images in TIFF format require a considerable large
storage space, because these are uncompressed loss-less image files. As DLI
maintains other file formats such as TXT (text), RFT (formatted text) and HTML
(webpage), data servers are integrated for storage of all supported file formats.
The hyperlinks from metadata servers to data servers should be routinely verified
in regular interval to address the problem of hyperlink failures, i.e., reducing
page not found notices.
Digital Library of India portals try to address Web accessibility issue, particularly
making DLI portals usable by people of all abilities and disabilities. A new
architecture is proposed to make contents of the portals audible to the visually
impaired persons, by introducing a parallel Text-to-Speech (TTS) server. Visual
impairments include blindness, low vision, poor eyesight and color blindness.
TTS refers to ability of the computer to convert text into spoken words. The TTS
server, with the help of TTS software tools, will store and disseminate converted
spoken words from DLI document repositories to the visually impaired end users.
Figure 2.64 provides a simple data flow diagram for a DLI portal integrated with
Text-to-Speech (TTS) System.
So far DLI network architecture has proved to be as an efficient model, with selfhealing mechanisms to address the network failure. If any of the nodes of DLI
network temporarily breaks down, this architecture allows DLI portal operational
without reducing any efficiency (Ambati & Reddy, 2005).
All DLI portals maintain similar kind of architecture, only number of data servers
varies depending upon the collection size.
133
License: CC BY-NC-ND
Fig. 2.64: Data Flow in a DLI Portal integrated with Text to Speech (TTS)
System
134
License: CC BY-NC-ND
Makes
available
the
classic
literature
originated
from
Indian
Inculcates reading habits to school children and young adults, who have
access to and can interact with digital world in a meaningful way;
Makes locating the relevant information inside of books far more reliable
and much easier;
135
License: CC BY-NC-ND
Supplements
to the
Collections
of Small
Libraries
Supplements
to the Formal
Education
System
Supplements
to the
Lifelong
Learning
System
Digital
Library of
India
Inculcates
Reading
Habits to
Children and
Young Adults
Makes
available the
out-of-print, &
out-ofcirculation
books
Makes
available the
Indian
Classic
Literature
Makes
available uncopyrighted &
out-ofcopyright
books
136
License: CC BY-NC-ND
137
License: CC BY-NC-ND
OCR
(Optical
Character
Recognition)
Software
Indian
Language
Book
Editor
TTS
(Text-toSpeech)
System
DLI
Research
Partners
Indian
Language
Search
Engine
Transliteration
Editor
Multilingual
Machine
Translator
138
License: CC BY-NC-ND
139
License: CC BY-NC-ND
Retrieved
15
May
2008
from
www.valaconf.org.au/vala2006/papers2006/93_Balakrishnan_Final.pdf
Balakrishnan, N., Reddy, R., Ganapathiraju, M. & Ambati, V. (2006). Digital
Library of India: a testbed for Indian language research. TCDL Bulletin, 3(1).
Covey, D.T. (2005). Global cooperation for global access: the Million Book
Project.
Retrieved
15
May
2008
from
www.library.cmu.edu/People/troll/TrollCoveyBelgiumFinal2r.ppt
Ganapathiraju, M., Balakrishnan M., Balakrishnan N. & Reddy R. (2005). Om:
One tool for many (Indian) languages. Journal of Zhejiang University Science,
6A(11),1348--1353.
Michalek, G.V., ed. (2003). Million Book Universal Library Project: manual for
metadata capture, digitization, and OCR. Carnegie: Carnegie Mellon
140
License: CC BY-NC-ND
University.
Retrieved
15
May
2008
from
www.library.cmu.edu/Libraries/MillionBookManual.pdf
Prahallad, K. & Black, A. (2005). A text to speech interface for Universal Digital
Library. Journal of Zhejiang University Science, 6A(11), 1229-1234.
Prahallad, L., Prahallad, K. & Ganapathiraju, M. (2005). A simple approach for
building transliteration editors for Indian languages. Journal of Zhejiang
University Science, 6A(11),1354-1361.
Pratha, L., Ambati, V. & Sankar, P. (2005). Content development for Digital
Library
of
India.
Retrieved
15
May
2008
from
www.ulib.org/conference/genpub/dli.pdf
Seethalakshmi, R. et al. (2005). Optical character recognition for printed Tamil
text using Unicode. Journal of Zhejiang University Science, 6A(11), 12971305.
Shukla, V.N., Arora, K.K. & Gugnani, V. (2004). Digital library: language centered
research, test beds and applications. Retrieved 15 May 2008 from
www.cdacnoida.in/technicalpapers/PaperICDL1.pdf
141
License: CC BY-NC-ND
CHAPTER 3
Digitization Work for Theses and Dissertations
3.1 Introduction
India is an emerging knowledge economy, where an array of specialized
research institutions, advanced research centres and universities exist in almost
all major subject areas. A country worldwide reputed for its IT industries, India is
now hosting corporate R&D centres of some major multinational enterprises
owing to its global reputation for academic and research excellence. The
accreditation and granting agencies for Indian institutions envisaged this
phenomenal growth in knowledge-based industries and service industries in
India. Thus, these agencies created a structured higher education and research
environment in the country in phases at par with global standard. Over the last
few decades the Government strengthened ICT and research infrastructure in
reputed Indian institutions across the country, where access to knowledge
resources and laboratory facilities is made adequate for the students,
researchers and faculty members. The government research agencies
introduced research fellowship schemes for pursuing research degrees to attract
meritorious and talented students into research and teaching professions and for
reversing the brain-drain in developed countries. The establishment of open
access digital archives of scholarly literature, produced in Indian universities and
institutions, is also a kind of ICT intervention that is envisaged to portray Indias
intellectual capabilities and to ascertain quality of research (Ghosh & Das, 2007).
Electronic theses and dissertations (ETD) initiatives and ETD policies in India lie
with Indias international reputation and to streamline into a systematized and
focused research system to generate more social benefits and applications.
141
License: CC BY-NC-ND
142
License: CC BY-NC-ND
Responsible
Agency
CSIR
Amount in
Rupees
(Crore)
870.00
US
Dollar
(Million)
206.41
CSIR
50.00
11.86
CSIR
14.00
3.32
CSIR
16.00
3.80
UGC
1500.00
355.86
UGC
150.00
35.87
Interdisciplinary Areas
Teaching Assistantship for Doctoral Students
(Non Fellowship)
143
License: CC BY-NC-ND
144
License: CC BY-NC-ND
145
License: CC BY-NC-ND
implement INTED and decentralized ETD repositories across the country. This
way the National Knowledge Commissions recommendation on peer-reviewed
research papers resulting from publicly funded research would be validated by
making them available through open access channels (NKC, 2006).
3.1.2.2 INDEST-AICTE Consortium An Open Access Advocacy Group
The Indian National Digital Library in Engineering Sciences and Technology
(INDEST) Consortium is an innovative initiative supported by the Ministry of
Human Resource Development (MHRD) and All India Council for Technical
Education (AICTE). Indian Institute of Technology Delhi (IITD) is the coordinating
agency for this consortium. The INDEST-AICTE Consortium tries to address the
problems of information poverty in the country and more particularly in its
member technical institutions through highly discounted rates of subscription and
better terms of agreement with the publishers of electronic resources such as
full-text e-journals and scholarly databases. The Consortium maintains three tiers
of membership where total membership strength is 741 as on 27 July 2008.
Categories of membership are shown below:
Self-supported
Institutions
(AICTE-accredited
and
UGC-affiliated
institutions) (183)
146
License: CC BY-NC-ND
Core
Members
(Supported
by MHRD)
Self-
INDEST-
supported
AICTE
Institutions
Consortium
AICTEsupported
Members
(Government
Institutions)
Selfsupported
Members
under New
Scheme
147
License: CC BY-NC-ND
The core member institutions of INDEST Consortium are globally reputed for
their academic and research excellence. These institutions have ICT
infrastructure and an access to e-resources at par with the best institutions of the
world. Some of these institutions have already established open access
repositories to make their research literature available globally. INDEST
Consortium holds annual meetings to discuss the policies and operational issues.
In its 2003 Annual Meeting, INDEST core members proposed to establish an
INDEST Digital Theses Archive, where Indian Institutes of Technology and Indian
Institute of Science will be major stakeholders. The proposed Digital Theses
Archive will have following characteristics (INDEST Consortium, 2003;
Venkadesan & Rajashekar, 2003):
Full-text database of the masters and doctoral theses from IITs and IISc
The INDEST will have following roles and tasks as the central coordination
agency for national ETD initiative (INDEST Consortium, 2003; Venkadesan &
Rajashekar, 2003):
- coordinating and cooperating with member institutions on organizational,
technological and educational issues and for developments;
- fund raising, supporting and conceptualizing of decentralized ETD structure,
special interest groups and working groups in which representatives from
different institutions can participate;
- organizing workshops for participating member institutions covering general and
special topics on ETD formats and archiving methods; and
- liaison with other Indian initiatives such as Vidyanidhi and international
initiatives such as NDLTD.
148
License: CC BY-NC-ND
149
License: CC BY-NC-ND
150
License: CC BY-NC-ND
foreign students approved by the foreign universities are also acquired through
this scheme. NASSDOC has planned to digitize and microfilm its theses
collection to address space problem and to device a modern retrieval mechanism
of social sciences information. The integration of its digitized ETD collection into
Vidyanidhi is a right direction to diffuse Indian social science research findings to
worldwide audience. But NASSDOC should keep in mind that Vidyanidhi may be
a platform to start with, but NASSDOC should establish an alternative dedicated
server for any long-term preservation and access.
151
License: CC BY-NC-ND
152
License: CC BY-NC-ND
Table 3.2 provides an indicative list of open access repositories in India having
ETD collections. These ETD repositories provide worldwide access to results of
Indian research. Thus, these repositories are listed in one or many international
directories/harvesters of open access repositories such as OpenDOAR (Directory
of Open Access Repositories), OAIster, ROAR (Registry of Open Access
Repositories), Google Scholar and CASSIR (Cross Archive Search Service for
Indian Repositories).
Table 3.3 provides a quantitative list of collections in different open access
repositories, indicating the number of ETD available in each repository vis--vis
other scholarly items. This Table also provides statistical information as on two
dates with 14 months gap. This time gap also helps us to know growth rate of
ETD collections in the listed repositories. This Table indicates that Vidyanidhi
stores maximum number of full-text e-theses followed by DSpace@TIET,
ETD@IISc and DSpace@NCL as on 15 May 2007. This Table further shows that
Vidyanidhi
stores
maximum
number
of
full-text
e-theses
followed
by
institutional
repository
th
of
National
Chemical
Laboratory
changes in institutional policy. Table 3.3 identifies that top three open access
digital archives are showing a steady rate of growth, i.e., 31.97% , 60.23% and
29.69% respectively in terms of total number of full-text e-theses. This Table also
indicates that Indian repositories experiencing more than 100 percent growth are
RRI Digital Repository, OpenMED@NIC, and DSpace@NITR.
Figure 3.3 graphically depicts ETD collections in Indian open access repositories,
based on Table 3.3. This Figure also compares ETD collections vis--vis other
items available in Indian open access archives. Amongst all open access
archives in India, Vidyanidhi has maximum number of full-text items followed by
RRI Digital Repository, and Open Access Repository of IIAP.
153
License: CC BY-NC-ND
Open Access
ETD
Repository
National Level
ETD Repository
(e.g. Vidyanidhi)
National Level
Subject
Repository,
having an ETD
Collection
(e.g. OpenMed)
Institutional
ETD Repository
(e.g. ETD@IISc)
Institutional
Repository,
having an ETD
Collection (e.g.
DSpace@TIET)
154
License: CC BY-NC-ND
Name of
URL
Institution
Vidyanidhi
Dspace@TIET
ETD@IISc
Software
Used
University of
http://dspace.vidyanidhi.org.in DSpace
Mysore
:8080/dspace/
Thapar
http://dspace.tiet.ac.in:8080/d
University
space/
Indian Institute
http://etd.ncsi.iisc.ernet.in/
DSpace
National
http://dspace.ncl.res.in/dspac
DSpace
Chemical
e/
DSpace
of Science
DSpace@NCL
Laboratory
Open Access Repository of
Indian Institute
http://prints.iiap.res.in/
DSpace
IIAP
of Astrophysics
Indian Institute
http://eprint.iitd.ac.in/dspace/
DSpace
National Institute
http://dspace.nitrkl.ac.in/dspa
DSpace
of Technology
ce/
of Technology
Delhi
DSpace@NITR
Rourkela
OpenMED@NIC
National
http://openmed.nic.in
EPrints
https://drtc.isibang.ac.in/
DSpace
Indian Institute
of Management
Kozhikode
http://dspace.iimk.ac.in/
DSpace
Raman
Research
Institute
http://dspace.rri.res.in/dspace
/
DSpace
DSpace at CUSAT
Cochin
University of
Science &
Technology
http://dspace.cusat.ac.in/
DSpace
Informatics
Centre
Librarians Digital Library
Documentation
Research and
Training Centre
IIMK's Scholarship
Repository
155
License: CC BY-NC-ND
ETD as
ETD as
Other
Other
Total
ETD
Repository
on
on
Items as
Items as
Collection
Growth
21/07/08
15/05/07
on
on
as on
Rate (%)
21/07/08
15/05/07
21/07/08
Vidyanidhi
5478
4151
5478
31.97
Dspace@TIET
415
259
59
33
474
60.23
ETD@IISc
297
229
297
29.69
DSpace@NCL
208
149
-100.00
Open Access
85
83
2590
1389
2675
2.41
30
30
2141
2106
2171
0.00
DSpace@NITR
36
13
635
388
671
176.92
OpenMED@NIC
51
2095
1491
2146
537.50
Librarians Digital
376
364
381
66.67
395
256
397
0.00
95
3383
2191
3478
9400.00
39
1383
1422
Repository of IIAP
Eprints & ETD@IIT
Delhi
Library
IIMK's Scholarship
Repository
RRI Digital
Repository
DSpace at CUSAT
156
License: CC BY-NC-ND
6000
5478
5000
4000
3000
2000
1000
415
297
0
85
30
36
51
95
Vi
dy
an
D
id
sp
hi
ac
e@
TI
ET
O
ET
pe
D
n
@
Ac
D
IIS
Sp
ce
c
ss
ac
e@
R
ep
NC
Ep
os
L
rin
ito
ts
ry
&
of
ET
IIA
D
P
@
IIT
D
D
el
Sp
hi
ac
e@
O
NI
Li
pe
TR
br
n
M
a
IIM
ria
ED
ns
K
@
's
D
N
Sc
IC
i
g
ho
ita
la
lL
rs
ib
hi
ra
p
R
ry
R
R
ep
ID
os
ig
ito
ita
ry
lR
ep
os
ito
ry
ETD as on 15/05/2007
ETD as on 21/07/08
157
License: CC BY-NC-ND
158
License: CC BY-NC-ND
Full-text
E-Thesis
Introduction
Chapter 1
Metadata
of Thesis
Chapter 2
Conclusion
In the following sections of this chapter, three most significant ETD initiatives,
namely, Vidyanidhi Digital Library, ETD@IISc and OpenMED@NIC, are
described along with an analysis of their impact on the research communication,
international visibility and availability of theses literature. These initiatives are
replicable and scalable. Other higher educational institutions in the country can
either collaborate with these two national ETD initiatives, or establish an
159
License: CC BY-NC-ND
160
License: CC BY-NC-ND
Vidyanidhi
Digital Library and E-Scholarship Portal
Objectives
Vision
To evolve as an
Digitally-enabled
scholarship
promoting knowledge,
creativity and
scholarship through
digital libraries.
online archive/
infrastructure and an
repository of doctoral
enabling framework
theses and
for strengthening
dissertations
research capacities
submitted to Indian
in Indian
universities by forming
universities.
a consortium of select
organisational model
To build a digital
archive of Indian
doctoral theses.
information
To develop an
Mission
Goals
Indian universities.
Advocacy and
and a technical
promotion of digitally-
mechanism for
enabled scholarship
creation,
through meetings,
submission,
liaison, conferences
archiving and
and workshops,
accessing of Indian
training programmes.
theses.
161
License: CC BY-NC-ND
Enhancing
Visibility of
University
Enriching
Expressive-
Research
Enlarging
ness
Access
Enduring
Expanding
Vidyanidhi
Audience
Archiving
Extending
Empowering
Dissemination
Students
Enabling
Universities
162
License: CC BY-NC-ND
163
License: CC BY-NC-ND
Figure 3.9 identifies partnership pattern in Vidyanidhi project. This Figure also
indicates that Vidyanidhi has global partnership with Networked Digital Library of
Theses and Dissertations (NDLTD) and strategic partnership with Indias
University Grants Commission (UGC).
NISSAT,
DSIR, GoI
Microsoft
India
Ford
Foundation
University of Mysore
Department of Library and Information Science
Vidyanidhi
Digital Library and E-Scholarship Portal
NASSDOC
JU
JMI
KU
HCU
164
License: CC BY-NC-ND
NISSAT,
DSIR
Ford
Microsoft
Foundation
India
UGC
Vidyanidhi
NASSDOC
Partner
NDLTD
Universities
Participating
Universities
165
License: CC BY-NC-ND
State
Role
Type of
Organization
Karnataka
Project
Academic
Directorate
Mysore [www.uni-mysore.ac.in/]
National Information System for
Science and Technology (NISSAT),
Delhi
Funding
Government
Delhi
Funding
Donor Agency
Karnataka
Funding
Corporate
Delhi
Strategic
Government
Support
United States
Strategic
Global ETD
Support
Initiative
Delhi
Partner
Institution
Government
West Bengal
Partner
Academic
University
Delhi
[www.jmi.nic.in/]
Partner
University
Jammu &
Partner
[www.kashmiruniversity.net/]
Kashmir
University
University of Hyderabad
Andhra
Partner
[www.uohyd.ernet.in/]
Pradesh
University
Content
Other Universities
Academic
Academic
Academic
Academic
Provider
166
License: CC BY-NC-ND
Vidyanidhi
Digital Library and E-Scholarship Portal
Indian Theses Database
(Multilingual Indian Theses
Metadata Records available
Database)
167
License: CC BY-NC-ND
168
License: CC BY-NC-ND
No. of Theses as on
No. of Theses as on
21/07/2008
15/05/2007
University of Mysore
1805
1657
University of Hyderabad
396
394
271
24
Jadavpur University
University of Kashmir
Others
2999
2070
Total
5478
4151
Thesis Title
Yes
Author's Name
Yes
Guide's Name
Yes
Language
Yes
Degree Grantor
Yes
Department Name
Yes
Submitted Year
Yes
169
License: CC BY-NC-ND
Searchable Metadata
Displayed Metadata
Simple Search
Thesis Title
Title
Advanced Search
Creator
Creator
Field Specific
Contributor
Subject
Subject
Contributor
Author Search
Language
Language
Supervisor Search
University Name
Degree Grantor
Subject Search
Year
Year
University Search
Keyword
Search
Boolean
Example
Operator
Find Results
With all of the words
AND
AND
Institutional Repositories
OR
Institutional Repositories in
word
Developing Countries
NOT
170
License: CC BY-NC-ND
Browsable
Searchable
Displayed
Metadata
Metadata
Metadata
Metadata
Recorded in
XML File
dc.title
Communities &
Keyword
Title
dc.creator
Collections
Author
Authors
dc.subject
(Collection by
Title
Keywords
dc.description
Subject /
Subject
Issue Date
dc.publisher
Collection by
Abstract
Abstract
dc.contributor
University)
URI
dc.date
Titles
Appears in
dc.type
Authors
Collections
dc.identifier
By Date
(Name of
Collections)
dc.language
dc.coverage
dc.rights
thesis.degree
3.4.3 Digitization Facility at Vidyanidhi Project
171
License: CC BY-NC-ND
172
License: CC BY-NC-ND
student first submits metadata information related to his thesis using appropriate
metadata entry form. Then he uploads his full-text thesis and related files in
appropriate file formats. After completion of submission process, a record of the
thesis work is stored in the buffer space of the repository. The submitted record
is validated and approved by ETD coordinator and/or his thesis guide, and then
record of thesis gets permanently archived in the ETD repository for worldwide
dissemination.
Table 3.10 shows the subject-wise distribution of full-text theses in the repository
which depicts that the Division of Mechanical Sciences has the maximum
contribution to this open access archive, followed by the Division of Electrical
Sciences and the Division of Biological Sciences. This Table also indicates that
collection from Division of Physical and Mathematical Sciences has the
maximum growth rate, followed by the Division of Chemical Sciences and the
Division of Mechanical Sciences, as calculated within a gap of fourteen months.
Figure 3.13 shows graphically the growth pattern of electronic theses collections
at ETD@IISc.
Figure 3.14 depicts statistical distribution of electronic theses collections in
ETD@IISc by subject divisions that shows the Division of Mechanical Sciences is
the top contributor (35.7%), followed by the Division of Electrical Sciences
(24.2%) and the Division of Biological Sciences (15.8%).
This repository uses a standard set of metadata elements as available in DSpace
software such as Title, Authors, Subject, Keywords, Date of Submission,
Publisher, Abstract, URI (Uniform Resource Identifier), and Name of Collection.
Here the author refers to both researcher (who carried out a thesis work) as well
as supervisors. The record display page of individual thesis provides links to fulltext objects, along with metadata information. Table 3.11 shows metadata
elements used at ETD@IISc such as browsable metadata elements, searchable
metadata elements and displayed metadata elements. This Table also indicates
that ETD@IISc collections can be navigated by name of author, name of guide,
thesis title, subject and issue date.
173
License: CC BY-NC-ND
Indian Institute of Science established an IISc INDEST User Group that has
subgroups on Thesis Format Guidelines, Copyright Issues and Technology
Issues for Archival and Online Hosting. ETD@IISc hosts some operational
guidelines and agreement forms, derived from the recommendations of the User
Group, for the perusal of IISc researchers and members of INDEST Consortium.
This portal also provides IISc Theses Style Templates for wide use by IISc
students and research community. Over the time this ETD initiative has became
a role model amongst the members of INDEST Consortium. The INDEST
Consortium has a strong advocacy role in formulation of policy for establishment
of open access ETD repositories and/or open access institutional repositories in
its respective member institutions.
The ETD@IISc is a flagship initiative for increasing visibility and availability of
research literature produced in Indias elite institutions. This happens to be a
motivational factor for other Indian and Asian research institutions as well as to
the research funding agencies to improve research landscape of emerging
knowledge economies. This effort will obviously attract more talents and research
funding into higher educational institutions in India.
Institutional Repositories of
EPrints@IISc
ETD@IISc
<http://eprints.iisc.ernet.in/>
<http://etd.ncsi.iisc.ernet.in/>
174
License: CC BY-NC-ND
As on
Growth
21/07/2008
15/05/2007
Rate
47
40
17.5
42
25
68.0
72
62
16.1
12
-58.3
106
77
37.7
25
13
Mathematical Sciences
Total
92.3
297
229
29.7
175
License: CC BY-NC-ND
77
Mechanical Sc.
62
Electrical Sc.
40
Biological Sc.
25
Chemical Sc.
13
Information Sc.
106
72
47
42
25
12
20
40
60
80
100
120
As on 15/05/2007
As on 21/07/2008
Div. of Physical
and Mathematical
Sc., 8.4%
Div. of Biological
Sc., 15.8%
Div. of Chemical
Sc., 14.1%
Div. of Mechanical
Sc., 35.7%
Div. of Information
Sc., 1.7%
Div. of Electrical
Sc., 24.2%
License: CC BY-NC-ND
Searchable Metadata
Displayed Metadata
Metadata
Communities &
Collections
(Name of
Collections)
Keyword
Title
Author
Authors
Title
Issue Date
Titles
Subject
Publisher
Authors
Abstract
Abstract
Thesis Guides
URI
Subjects
Appears in Collections
Issue Date
(Name of Collections)
177
License: CC BY-NC-ND
Although this repository has a wide coverage in all major biomedical subject
areas, it has a comprehensive collection of papers on tuberculosis as well as
tropical diseases.
Using this facility, authors/researchers/practitioners can self-archive their
scientific and technical documents including theses and dissertations. They
usually register once in order to obtain a unique user identity for the system. No
registration is however required for searching the archive or viewing the
documents. OpenMED accepts both published and unpublished documents
having relevance to research in biomedical and allied sciences including
paramedical areas. Self-archived contributions can be in the form of theses,
dissertations, journal articles, conference papers, conference proceedings,
posters, presentations, clinical cases, technical reports, working papers, and
policy documents. This repository also accepts documents in Indian languages,
but in this case descriptive metadata is entered in English. The self-archived
document then goes to the moderator and becomes part of the OpenMED
archive on its acceptance. Presently, this archive also serves as a knowledge
repository of journals, institutions and conference organizers in India for longterm preservation of research publications. These include:
Institutional Repositories
College of Veterinary Science and Animal Husbandry, Anand
Agricultural University, Gujarat
National Tuberculosis Institute, Karnataka
Sir Ganga Ram Hospital, New Delhi
Journal Repositories
Anatolian Journal of Psychiatry
Archives of Medical Science
Calicut Medical Journal
Indian Journal of Medical Informatics
Indian Journal of Tuberculosis
Indian Pacing and Electrophysiology Journal
178
License: CC BY-NC-ND
179
License: CC BY-NC-ND
to carry out their doctoral and pre-doctoral research degrees. These funding
agencies can make use of OpenMED to enhance international visibility of Indian
theses literature. They can motivate Indian researchers, who are the recipients of
research fellowships, to self-archive biomedical theses and dissertations into
OpenMED portal.
Open Access
Initiatives at NIC
IndMED:
A Bibliographic
Database of Indian
Biomedical Research
<http://indmed.nic.in>
OpenMED@NIC:
An Open Access
Archive for Medical
and Allied Sciences
<http://openmed.nic.in>
medIND:
Indian Full-text BioMedical Journals
Indexed in IndMED
<http://medind.nic.in>
180
License: CC BY-NC-ND
OpenMED@NIC
Objectives
Goal
Aim
To preserve
To provide a free
facility to authors to
valuable research
self-archive their
publications for
publications.
future medical
To encourage self-
service to
academics,
researchers, and
researchers and
archiving culture
side by side
amongst medical
publicize research
professionals in the
being conducted in
India.
the country.
To provide free
students working in
the area of medical
and allied sciences.
Self-Archived
Theses
Other SelfArchived
Documents
Journal
Repositories
OpenMED
Institutional
Repositories
Conference
Repositories
181
License: CC BY-NC-ND
Thesis, 2.4%
Institutional
Repositories,
26.5%
Journal
Repositories,
57.2%
Searchable Metadata
Metadata
Displayed Metadata
(for Thesis)
Subject
Title
Title
Year
Author/ Creator
Author/ Creator
Type of
Subject
Year
Date
University Name
Repository
Abstract
EPrint Type: Thesis
(PhD)
Additional Information
Uncontrolled Keywords
Subjects
ID Code
Deposited By
182
License: CC BY-NC-ND
Date of Deposit
3.7 DSIR-supported Other ETD Initiative
The Technology Information Facilitation Programme (TIFP) of the Department of
Scientific & Industrial Research (DSIR), Government of India, has a series of
focused activities to support digital and indigenous knowledge base, and more
particularly Indian Digital Library of Theses and R&D Publications. This activity
aims at establishing suitable mechanism for capturing full-text information of
dissertations/theses and R&D reports of Government funded projects and
publishing such material the same through electronic media (DSIR, 2006).
Under the auspices of this TIF Programme, DSIR is supporting a project entitled
Design and Development of Database and Web-Portal of Indian Theses
Covering Technology and Management aspects related to Manufacturing,
undertaken by Department of Mechanical Engineering in the Bannari Amman
Institute of Technology at Sathyamangalam, Tamil Nadu (DSIR, 2006).
The objectives of the project are:
This project was initiated in 2005-06 financial year under the heading Indian
Digital Library of Theses and R&D Publications and is scheduled to be
concluded in 2008-09 financial year. Unfortunately no Web-portal is launched till
date and no significant progress on the project has been reported in journals or
conferences.
On the other hand, the predecessor of this DSIR-supported ETD project Vidyanidhi, is a grand success as University of Mysore had a clear roadmap and
183
License: CC BY-NC-ND
Dspace@NIT Rourkela
DU Eprint Archive
184
License: CC BY-NC-ND
ePrints@IISc
ePrints@IIT Delhi
ETD@IISc
OpenMED
185
License: CC BY-NC-ND
Searchable Metadata
Displayed Metadata
Name of Archive
Name of Archive
Creator
Title
Contributor
Creator
Date
Identifier (Hyperlink)
Language
Description
Identifier
Format
Description
Language
Format
Rights
Publisher
Subject
Rights
Title
Type
Type
Subject
Name of Archive
Relation
Contributor
Source
Coverage
186
License: CC BY-NC-ND
187
License: CC BY-NC-ND
188
License: CC BY-NC-ND
Name of
URL
Level
Institution
Vidyanidhi
University of
www.vidyanidhi.org.in
National
http://csirexplorations.com/
National
Mysore
CSIR e-Thesis
URDIP
ethesis/e_thesis.htm
National Online Union
INFLIBNET
http://unicat.inflibnet.ac.in:8 National
Catalogue of Doctoral
Centre
080/unicat/thesis.jsp
Developing
www.delnet.nic.in
National
Dissertations Database
Libraries
Theses
Network
(DELNET)
ETD Digital Library @ IIT
Indian Institute
http://www.library.iitb.ac.in/
Institution
Bombay
of Technology
~mnj/gsdl/cgi-bin/library
al
Institution
Bombay
PhD Theses @ IIT
Indian Institute
www.cenlib.iitm.ac.in/docs/
Madras
of Technology
library/index.php?page=the al
Madras
ses
189
License: CC BY-NC-ND
Research and Development of Information Products (URDIP) at Pune, India. EThesis database is an authoritative source for information about doctoral theses
and dissertations, submitted in Indian universities by the researchers, who
availed CSIR Junior Research Fellowship (JRF) or Senior Research Fellowship
(SRF) scheme. CSIR research fellows carry out their research work in over 250
national laboratories, universities and research institutes in India. This database
is popularly used by doctoral candidates, faculty members and scientists who
would like to know the topics of already completed researches by other research
fellows in past.
At the time of submission of PhD thesis to an Indian university, a CSIR research
fellow usually submits electronic copy of his/her thesis on CD-ROM to URDIP
along with a descriptive set of metadata for inclusion in this online database.
Figure 3.20 shows contribution pattern as followed in the CSIR e-Thesis
database. This Figure depicts that CSIR research fellows attached with CSIR
laboratories or Indian universities are the main contributors, whereas contribution
from CSIR staff members can also be accepted in this database.
The research fellow receives a token financial incentive (INR 3000.00, about US
Dollar 73.00) to recover the cost of creation of CD-ROM. As CSIR is the largest
supporter of doctoral research work in the country, URDIP in association with
Human Resources Development Group (HRDG) of CSIR is creating a digital
library, containing CSIR supported PhD Theses in the 11th Five Year Plan period
(2007-2012) of the country. Probably, the existing CSIR e-Thesis database will
be scaled up for this purpose.
CSIR e-Thesis database has different searching and browsing options such as
browse, simple search, advanced search and thesis bibliography. This database
can be searched by applying different parameters using Boolean operators such
as Title, Author, Guide, Institute, University, Synopsis, and All Fields. The
190
License: CC BY-NC-ND
suggests that the e-Thesis database probably does not have comprehensive
coverage, but created bibliographic records only from few CSIR institutions.
Table 3.17 indicates a detailed template and metadata schema for submission of
thesis record in the CSIR e-Thesis database. This template is very
comprehensive and meticulously followed by the CSIR research scholars at the
time of submission of e-thesis on CD-ROM. Hyperlink to research papers and
patents is a unique option that facilitates navigation of intellectual contributions of
researcher in the forms of patents and research papers.
Table 3.15: Metadata Elements Used at CSIR e-Thesis
Browsable Metadata
Subject
Keyword
(controlled)
Searchable Metadata
Title
Author
Guide
Institute
University
Synopsis
Displayed Metadata
E-Thesis No.
Title
Author
Position
Guide
University
Institute
Area
Specialization
Submitted Year
Synopsis
Papers (Hyperlink)
Patents (Hyperlink)
191
License: CC BY-NC-ND
CSIR Junior/
Senior
Research
Fellows
(JRF/SRF)
CSIR eThesis
CSIR
Scientists
and Staff
Research
Fellows
affiliated to
CSIR
Institutions
192
License: CC BY-NC-ND
Number of Records
Coverage
(As on 22/07/08)
(Percent)
Chemistry
542
82.25
Biology
54
8.19
Physics
40
6.07
Biotechnology
11
1.67
Pharmaceuticals
0.30
Engineering
0.30
Leather Processing
0.30
Botany
0.15
Earth Sciences
0.15
Environment
0.15
Medicine
0.15
Others
0.30
659
100.00
Total
BioTech, 1.7%
Others, 1.8%
Physics, 6.1%
Biology, 8.2%
Chemistry,
82.2%
193
License: CC BY-NC-ND
Title
Author Details
Guide Details
Degree
University
Institute/Lab
Date of Submission
No. of Pages
Subject/Area
Abstract/Synopsis
Publications
Patents
194
License: CC BY-NC-ND
Online
Union
Catalogue
of
Doctoral
Theses
or
collected
from
various
reliable
sources
and
university
page,
containing
the
metadata
of
title
of
thesis,
name
of
195
License: CC BY-NC-ND
196
License: CC BY-NC-ND
197
License: CC BY-NC-ND
goods and social applications, rather than merely providing research degrees to
the beneficiaries.
3.11 References
Department of Scientific and Industrial Research (2006). DSIR Annual Report
2005-2006. New Delhi: Department of Scientific and Industrial Research.
Ghosh, S.B. & Das, A. K. (2007). Open Access and Institutional Repositories - a
Developing Country Perspective: a Case Study of India. IFLA Journal, 33(3),
229-250.
INDEST Consortium (2003). Minutes of the Meeting held on 7th October, 2003 at
IIT
Delhi.
Retrieved
15
May
2007
from
http://paniit.iitd.ac.in/indest/extended/minutes7thoct.pdf.
National Knowledge Commission (2006). Report to the Nation 2006. New Delhi:
National Knowledge Commission.
National Knowledge Commission (2007). Report to the Nation 2007. New Delhi:
National Knowledge Commission.
Planning Commission (2006). Report of the Working Group on CSIR - Eleventh
Five Year Plan 2007-2012. New Delhi: Planning Commission.
Planning Commission (2006). Report of Working Group on Higher Education 11th Five Year Plan. New Delhi: Planning Commission.
University Grants Commission (2007). Guidelines for Junior Research Fellowship
in Sciences, Humanities and Social Sciences. Retrieved 15 May 2007 from
www.ugc.ac.in/financialsupport/guideline_10.html.
University Grants Commission (2005). UGC (Submission of Metadata and Fulltext of Doctoral Theses in Electronic Format) Regulations, 2005. Retrieved 15
May 2007 from http://www.ugc.ac.in/new_initiatives/etd_hb.pdf.
Venkadesan, S. & Rajashekar, T.B. (2003). Operational Guidelines for INDEST
Digital Collection of Dissertations in Science and Engineering. Retrieved 15
May 2007 from http://paniit.iitd.ac.in/indest/extended/indestmeet_oct03.pdf.
Virginia Tech (2003). Proceedings of the Indo-US Workshop on Open Digital
Libraries and Interoperability, held at Ballston, USA, 23-25June 2003;
198
License: CC BY-NC-ND
organized by Virginia Tech & Old Dominion University, USA and University of
Mysore & Indian Institute of Science, India. Retrieved 15 May 2007 from
http://fox.cs.vt.edu/IndoUSdl/tabl.htm
-*-
199
License: CC BY-NC-ND
CHAPTER 4
Digitization of Documentary Heritage Collections:
Cases of Indira Gandhi National Centre for the Arts and National
Mission for Manuscripts
200
License: CC BY-NC-ND
with
help
of
digital
technologies.
Modern
information
and
201
License: CC BY-NC-ND
Text Box 4.1: Selected Main Lines of Action Plan for the Implementation of
the UNESCO Universal Declaration on Cultural Diversity
202
License: CC BY-NC-ND
203
License: CC BY-NC-ND
IGNCA
Kalanidhi
Kalakosa
Janapada
Sampada
Kaladarsana
Cultural
Informatics
Lab
Sutradhara
204
License: CC BY-NC-ND
205
License: CC BY-NC-ND
Over the time, CIL becomes a focal digitization and interactive multimedia
documentation centre for institutions supported by Ministry of Culture. Ministry of
Communication and Information Technology also recognized CIL as a dedicated
national centre for cultural informatics, capable to handle large-scale digitization
projects in documenting Indian cultural heritage and long-term preservation of
precious cultural materials on the verge of extinction.
While centuries old priceless manuscripts, paintings, archival records and rare
books available with state-own memory institutions are slightly better than those
artifacts available with private collections, their nature of brittleness deters the
digitization process to be undertaken by novice technicians at vendors locations.
Thus, there is a need of super specialty centre that will have sophisticated
technologies and technical expertise to handle rare materials with extreme care.
In one decade, CIL became most sought-after national digitization centre for
documentary artifacts and antiquities.
Having technical expertise in handling digitization of visual images, CIL
developed some online utility tools for viewing of digital images by the end users.
The utility tools developed by the CIL can be summarized as:
Decorative Motifs
Iconography
Architecture
206
License: CC BY-NC-ND
CIL has technical expertise to train digitization staff of partner institutions and
similar projects on handling of digitization hardware and software tools.
207
License: CC BY-NC-ND
208
License: CC BY-NC-ND
IBM Linux
Server
Fiber Optic
Storage
System
50
Terabytes
Workstations
/ PCs
(28 Nos)
Expandable
Storage
System
upto 320
Terabytes
Laptops
(3 Nos)
CIL
Infrastructural
Facilities
Router
Network
Printer
Ethernet
D-Link
Ethernet
Vivid
209
License: CC BY-NC-ND
Wicks &
Wilson
Microfilm
Scanner
Video
Digitization
Station
Nikon Slide
Scanner (2)
Minolta
PS7000 B/W
Book
Scanner (2)
CIL
Digitization
Facilities
Bookeye
A1/A2 size
Colour
Book
Scanner (5)
Audio
Digitization
Station
Nikon D2X
Digital
Camera
Umax
Flatbed
Slide
Scanner
Minolta
Microfiche
Scanner
210
License: CC BY-NC-ND
Rare Books
Digitization
Rare
Photographs
Digitization
Manuscripts
Digitization
Microfilm
Digitization
CIL
Digitization
Activities
Microfiche
Digitization
Journals/
Magazines
Digitization
Audio
Digitization
Slides
Digitization
Video
Digitization
211
License: CC BY-NC-ND
Ministry of Communication
and Information Technology (MCIT)
Ministry of Culture
Indira Gandhi National Centre for the Arts (IGNCA)
Kalasampada:
Digital Library
Resources for
Indian Cultural
Heritage
(DL-RICH)
Cultural
Heritage Digital
Library in Hindi
(CHDLH)
National
Databank on
Indian Art and
Culture
National
Databank on
Indian Art and
Culture
MCITsupported
Projects at
IGNCA
Kalasampada
CHDLH
(DL-RICH)
Completed
Ongoing
212
License: CC BY-NC-ND
Kalasampada: Digital
Cultural Heritage
National Databank
Digital Library in
Hindi (CHDLH)
Culture
(DL-RICH)
(NDIAC)
10,000+
pages
rare
5000+ Images of
hours
on
art
and
Indian
culture
antiquities
100+
books
of 100,000+ images
Coverage
of antiquities
recordings.
recordings
(Gaudiya Grantha
Back Volumes of
ASI
Indian
video recordings.
Archaeology A
Selected IGNCA
Review
Journal
Virtual
publications; back
Walkthroughs of
volumes of IGNCA
selected
1000+ hours of
monuments and
heritage sites.
of selected monuments
and heritage sites.
Status
Completed
Ongoing
213
License: CC BY-NC-ND
License: CC BY-NC-ND
Allahabad
manuscripts, over 1500 sculptures and terracotta, and over 1000 paintings and
photographs.
Recently, IGNCA extended its digitization solutions to Asiatic Society in Kolkata,
Department of Historical and Antiquarian Studies in Guwahati, and Sri Chandra
Shekharendra Saraswathi Viswa Maha Vidyalaya in Kanchipuram for digitizing
their antiquity collections including manuscripts.
Table 4.2 provides an indicative list of turnkey projects undertaken by CIL. All of
these turnkey projects were beyond the scopes of MCIT-supported digitization
projects. This Table also indicates that IGNCA is very keen to take up
consultancy projects related to digitization of documentary heritages utilizing its
infrastructure and resources.
Archaeologic
al Survey of
India
Cultural
Academy,
Srinagar
Department of
Historical and
Antiquarian
Stu., Guwahati
Oriental
Research
Library,
Srinagar
National
Museum,
New Delhi
CIL
Digitization
Clienteles
Visva
Bharati,
Santiniketan
Asiatic
Society,
Kolkata
National
Library of
Mongolia
Allahabad
Museum,
Allahabad
Established
Proposed
License: CC BY-NC-ND
Type of Materials
Coverage/ Status
Digitization of
manuscripts
2700+ manuscripts
covering over 0.9 million
folios.
Digitization of Sanskrit
and Persian manuscripts
on Yogavashistha,
Mahabharata, Saivism,
Ayurveda, etc. in Sanskrit
(Sharada and Devnagari
scripts)
10,581 manuscripts
covering over 2.1 million
folios.
Digitization of antiquities,
covering rare
manuscripts of
Rabindranath Tagore, his
paintings, photographs
and letters
Digitization of
manuscripts and other
antiquities
Archaeological Survey of
India (ASI), New Delhi
National Library of
Mongolia, Ulaanbaatar
Digitization of Ganjuur
and Danjuur manuscripts
Completed
Digitization of
manuscripts and other
antiquities
Ongoing
Department of Historical
and Antiquarian Studies,
Guwahati, Assam
Digitization of
manuscripts and other
antiquities
Ongoing
Cultural Academy,
Srinagar
Digitization of
manuscripts and other
antiquities
Ongoing
Sri Chandra
Shekharendra
Saraswathi Viswa Maha
Vidyalaya, Kanchipuram,
Tamil Nadu
Digitization of
manuscripts and other
antiquities
Ongoing
Allahabad Museum,
Allahabad
216
License: CC BY-NC-ND
for
Indian
Languages
Programme
(TDIL)
and
Content
institutions,
manuscript
libraries,
oriental
institutions,
religious
217
License: CC BY-NC-ND
Archaeological
Survey of India
Manuscript
Libraries
(e.g., Allama
Iqbal Library)
Oriental
Institutions
(e.g. Oriental
Research Library)
CILs
Partner
Institutions
Government
Agencies
(e.g. Asiatic
Society)
Museums
(e.g. National
Museum)
Academic
Institutions
(e.g. VisvaBharati)
218
License: CC BY-NC-ND
Guidelines for Digitization Projects for Collections and Holdings in the Public Domain,
Particularly Those Held by Libraries and Archives; jointly prepared by UNESCO, IFLA and
International
Council
on
Archives,
March
2002.
Online
accessible
at:
http://www.ifla.org/VII/s19/pubs/digit-guide.pdf
219
License: CC BY-NC-ND
Slides
Photographs
Audio
Video
Photo CD: a system designed by Kodak for digitizing and storing photos in a CD.
TIFF: Tagged Image File Format an uncompressed storage mode for digital images
WAV: Waveform audio format, a Microsoft and IBM audio file format standard for storing an
Sample rate: number of samples of a sound that are taken per second to represent sound
digitally. The more samples taken per second, the better the reproduction of the sound. The
current sample rate for CD-quality audio is 44,100 samples per second, or 44.1 kHz.
6
MPEG-2: Video compression standard for creation of broadcast-quality television videos that can
be distributed on DVD and similar disks.
220
License: CC BY-NC-ND
221
License: CC BY-NC-ND
Aims
To use multimedia computer technology integrating variety of cultural
information, and
222
License: CC BY-NC-ND
DL-RICH can be navigated through the hyper-linked indexes in the main page,
thematic categories, and geographic categories as indicated in Table 4.5. The
index to thematic categories gives further links to documents pertaining to
literature, personalities, catalogues, performances, studies, events, heritage,
collections, religious categories, etc. The index to geographic categories gives
further links to states and union territories of India, and other countries. Although
DL-RICH portal does not provide a comprehensive search facility, user-friendly
interfaces in this portal make navigation convenient to the end users.
License: CC BY-NC-ND
Fig. 4.12: DL-RICH Knowledge Base A Mix of Closed Access and Open
Access Resources
224
License: CC BY-NC-ND
Textual Collections
Multimedia Documentation
Newsletter of IGNCA
Gita Govinda
Brhadisvara Temple
Kutiyattam
Research Reports
Classical
(Hindustani)
Electronic Books
Audio Recordings
Back volumes
Indian
Back volumes
Journal of IGNCA
Video Recordings
Bibliographies
Devnarayan
Manuscript Catalogues
Catalogue
of
Microfilmed
Manuscripts
ROMs
Ajanta
Devadasi Murai
Muktesvara
Rockart
Databases
Catalogue
of
Catalogues
(CATCAT)
Temple
Digital Images
Archaeological Sites
Exhibitions of Paintings
Photograph Collections
Slide Collections
225
License: CC BY-NC-ND
Geographical Categories
States of India
Andhra Pradesh
Arunachal Pradesh
Assam
Bihar
Delhi
Gujarat
Haryana
Himachal Pradesh
Jammu & Kashmir
Karnataka
Kerala
Madhya Pradesh
Manipur
Meghalaya
Mizoram
Nagaland
Orissa
Punjab
Rajasthan
Tamil Nadu
Tripura
Uttar Pradesh
West Bengal
Union Territories
Andaman and Nicobar Islands
Lakshadweep
Other Countries
Australia
Bangladesh
China
Indonesia
Japan
Korea
Mongolia
Nepal
Sri Lanka
Thailand
Turkey
Vietnam
226
License: CC BY-NC-ND
Metadata Creation
Digitization of Materials
Post-Digitization Editing
Checking of Quality
Permission/
Approval
Obtained from
Source
Libraries?
No
Hosting to
Intranet Server
Yes
Web Hosting of
Digitized Objects to
DL-RICH Portal
227
License: CC BY-NC-ND
DL-RICH is one of the highly visible digital library portals in South Asia. DL-RICH
received the Golden Icon: Award for Exemplary Implementation for eGovernance Initiatives under the category, Best Documented Knowledge and
case study, given by Indias Department of Administrative Reforms and Public
Grievances in 2005.
228
License: CC BY-NC-ND
229
License: CC BY-NC-ND
benefits of IT revolution to the common citizen and more specifically bridging the
existing digital divide in the Hindi-speaking states of India. Figure 4.17 shows the
aims and objectives of CoIL-Net Programme. Out of many objectives of CoIL-Net
Programme, objectives emphasising need of digitization and digital library
development in an Indian language are:
(i)
(ii)
TDIL Programme is the primary collaborator of CHDLH initiative through its CoILNet Programme. Figure 4.15 provides a schematic structure of CHDLH, where
TDIL Programme is placed in top layer in terms followed by CoIL-Net and
implementing agency IGNCA.
CoIL-Net is a national network of eleven intervention initiatives for IT localization
and content creation. Out of them four initiatives are national in scope and seven
initiatives are state-specific i.e., each initiative caters to a particular state out of
seven Hindi-speaking target states. CHDLH is one of the successful national
level intervention initiatives within CoIL-Net, where more contents are created for
Web dissemination than any other network member. Figure 4.18 shows a list of
national and state level intervention initiatives affiliated to CoIL-Net Programme,
where CHDLH is placed very prominently not only in India but also to the Hindispeaking diasporas abroad.
Figure 4.19 indicates that CHDLH aims at offering carefully selected thoughtfully
compiled and contextually integrated multimedia content on cultural heritage, folk
literature and lifestyles of Hindi-speaking region. This Figure also indicates that
CHDLH is engaged in development of a reusable model design and
230
License: CC BY-NC-ND
TDIL Programme
CoIL-Net Programme
Cultural Heritage Digital Library in Hindi
(CHDLH)
Cultural Informatics Lab, IGNCA
(Implementing Agency)
TDIL Programme
Aim
Objectives
Promotion of
IT tools for
Indian
Languages.
knowledge
tools:
representation,
integration,
231
License: CC BY-NC-ND
CoIL-Net programme
Immediate Goals
Broad Objectives
Providing a boost
to IT localization
for socioeconomic
developments.
Helping bridge
the existing digital
divide by
appreciably
improving IT
penetration and
awareness levels,
using Hindi as
medium of
delivery, in the
Hindi-speaking
states of Madhya
Pradesh,
Chhattisgarh,
Uttar Pradesh,
Uttaranchal,
Bihar, Jharkhand,
and Rajasthan.
232
License: CC BY-NC-ND
Coil-Net Activities
233
License: CC BY-NC-ND
offer
Main Objectives
carefully
selected,
thoughtfully
compiled
and
contextually
integrated
heritage,
folk
for Hindi-speaking
of
multimedia content on
cultural
Hindi-speaking
region.
to
socioeconomic
contribute
towards
development
of
the
Hindi-
speaking region.
Source: About CoIL-Net <www.ignca.nic.in/mmd_body.htm#CoIL>
Fig. 4.19: Aim and Main Objectives of CHDLH
4.4.2 Digitized Collections in CHDLH
CHDLH represents ethos of Indian heritage in general and Hindi-speaking target
states in particular. In this project most of the contents has been compiled by the
project investigators and scholars from the respective region covering the major
topics. CHDLH disseminates information and traditional knowledge related to:
Architectural heritage;
234
License: CC BY-NC-ND
Figure 4.20
indicates the focused states of India that become both beneficiaries as well as
content providers of this portal.
Similar to DL-RICH portal, CHDLH portal provides user-friendly browsing
facilities through the hyper-linked index in the main page, index to thematic
categories, and index to geographic categories. Similar to DL-RICH portal,
CHDLH portal does not have search facility.
Table 4.6 indicates the available navigation options in the main page of this
portal. This Table also indicates that different kinds of multimedia materials are
integrated in this portal.
235
License: CC BY-NC-ND
Traditional Literature
Digital Images
Oral Epics
Folk Literature
Geographic Class
Documentation
Audio
Video
Books
Articles
Uttar
Pradesh
Madhya
Pradesh
Rajasthan
CHDLH
Chhattisgarh
Jharkhand
Uttarakhand
Bihar
236
License: CC BY-NC-ND
Traditional
Literature
Articles
Folk
Literature
Books
Oral Epics
CHDLH
Collections
Poets and
Writers
Digital
Images
Geographic
Video
Class
Audio
Documentation
237
License: CC BY-NC-ND
238
License: CC BY-NC-ND
Vision
Main Objectives
authoritative
sources of information
on
Indian
culture.
art
and
239
License: CC BY-NC-ND
Visual archive in this project includes over 0.1 million digital photographs,
photographic slides and negatives on various aspects of Indian art and culture
collected from the various institutions and individuals. The photographic slides
and negatives collected from different sources are digitized using appropriate
digitization method and later stored in JPEG format. Respective groups of
owners of the visual images share their digital copyrights with the project team
and allow to make these visual images freely available through NDIAC Web
portal for wide use by the researchers, archaeologists, art historians, students,
and academics.
Virtual walkthrough is a kind of simulated interactive video tour to allow a user to
virtually visit a heritage site. In this project five virtual walkthroughs will be
created for worldwide access. Some of the selected sites for creation of virtual
walkthroughs are inscribed on World Heritage List, maintained by UNESCO
World Heritage Centre. The heritage sites selected for creation of virtual
walkthroughs are:
Table 4.7 indicates present status of the project as on 31st March 2008 vis--vis
project targets. This Table also shows that creation of digital corpus is on its way
and will meet the target hopefully by the end of project period.
240
License: CC BY-NC-ND
Project Targets
Achievement till
31st March 2008
Textual Pages
25000 books
(5 Million pages)
(2157 Books)
Digital Images
0.1 Million
33000
300 hours
93 hours
700 hours
35 hours
Walkthroughs
05
Nil
241
License: CC BY-NC-ND
IGNCA
Digitized
Corpus in
NDIAC
State
Archaeology
Departments
ASI
Content Providers
Content Aggregator
242
License: CC BY-NC-ND
Travelling
Multimedia
Exhibition
Outreach
Modes
ASI Web
IGNCA Web
Portal
Portal
243
License: CC BY-NC-ND
Project Document: National Mission for Manuscripts. New Delhi: Department of Culture, 2002.
244
License: CC BY-NC-ND
245
License: CC BY-NC-ND
of Indian manuscripts, presently having more than 1.5 million records. After
documenting all revealed data on Indian manuscript collections, NMM targets
physical preservation as well as digital preservation of most significant as well as
most endangered pieces of manuscripts. Figure 4.29 illustrates broad and
digitization objectives of NMM.
Digitization of manuscripts is one of the major intervention areas of the Mission.
NMM envisages that digitization will improve access to significant Indian
manuscript collections. In its digitization objectives, NMM states that digitization
helps in long-term preservation of manuscripts. Digitization also helps in reducing
physical handling of rare and fragile original manuscripts. Digital surrogates of
manuscripts are much easier to share and use than a unique copy of original
manuscripts archived in special condition. National Manuscripts Library, having a
digital library component, will store digitized collections of Indian manuscripts and
eventually will be accessible online. NMM also envisaged to create standards
and procedures for digitization of manuscripts, which NMM already documented
in its booklet titled Guidelines for Digitization of Manuscripts, published in 2005.
246
License: CC BY-NC-ND
Ministry of Culture
Indira Gandhi National Centre for the Arts (IGNCA)
(National Nodal Agency)
Manuscript
Manuscript
Manuscript
Resource
Conservation
Partner
Conservation
Centres
Centres
Centres
Partner Centres
(MRCs)
(MCCs)
(MPCs)
(MCPCs)
National
Empowered
Committee
Project
Monitoring
Committee
Executive
Committee
National
Mission for
Manuscripts
Finance
National
Committee
Nodal Agency
247
License: CC BY-NC-ND
License: CC BY-NC-ND
NMM undertakes national surveys in each state and union territory for locating
manuscripts in both known and unknown, private and public, catalogued and
non-catalogued collections through a standard questionnaire form as shown in
Annex 3. After completion of national surveys, NMM undertakes extensive postsurvey exercises to document each manuscript in Manus Data Sheets (Annex 4).
National survey addresses documentation of whole manuscript collection of an
institution or individual, whereas post-survey addresses documentation of
detailed information of each manuscript available with surveyed institution or
individual.
In the exercises of national surveys and post surveys, state and district
coordinators provide local support in enumeration of data in structured formats.
Usually, state and district coordinators are drawn from existing Manuscript
Resource Centres of NMM.
During post-survey, local coordinator receives filled in Manus Data Sheets from
the surveyors. Manus Data Sheet describes each manuscript with detailed
information such as title, author, commentary, script, name of repository, name of
scribe, date of manuscript, number of folios and pages, and other relevant
information. Detailed information collected using Manus Data Sheets are then
entered into decentralized national online catalogue of manuscripts using eGranthavali software. This catalogue database is also known as National
Electronic Register of Manuscripts. After cataloguing all surveyed manuscripts,
NMM will select most significant manuscripts for digitization. Then the process of
digitization will be started in selected centres of NMM.
Figure 4.30 illustrates the lifecycle of documentation activities in NMM for Indian
manuscripts. Activities carried out in the first phase of lifecycle are national
survey and post-surveys across the country. In the second phase of lifecycle,
NMM incorporates surveyed data in the National Electronic Register of
Manuscripts an online catalogue of manuscripts. In the third phase of lifecycle,
NMM undertakes digitization activities and incorporates digitized collections in
249
License: CC BY-NC-ND
the National Manuscripts Library hybrid library having both online digital library
as well as offline physical and digital collections housed in IGNCA premises.
Figure 4.31 more specifically indicates that NMM maintains two types of online
databases, namely Kritisampada: National Database of Manuscripts and
National Manuscripts Library. Kritisampada is the Web-based search engine of
National Electronic Register of Manuscripts, presently available from the NMM
portal (Arora, 2007). Digitized collections of manuscripts have been incorporated
in the National Manuscripts Library, which is a kind of hybrid library having both
physical and digital collections. Digital collections of National Manuscripts Library
are now available on intranet only at the IGNCA premises.
National
Surveys for
Manuscripts
National
Electronic
Register of
Manuscripts
Digitization
of Most
Valuable
Manuscripts
Ministry of Culture
National Manuscripts
Database of Manuscripts
Library
(Bibliographic)
(Hybrid)
250
License: CC BY-NC-ND
Post-Survey
More Extensive National Survey
Compilation of CAT-CAT
A database of all printed catalogues of
Indian manuscripts
License: CC BY-NC-ND
License: CC BY-NC-ND
License: CC BY-NC-ND
records are sent to NMM office in New Delhi for integrating into the national
database. NIC provides technical support to the state coordinators of national
surveys, MRCs and MPCs for smooth implementation of Manus Granthavali at
their locations. It is estimated that approximately 5 million manuscripts have to be
catalogued using this software. Presently, Web-based National Database of
Manuscripts: Kritisampada provides information on about 1.5 million manuscripts.
Fig. 4.33: Brochure of Manus Granthavali the Software Used for Creation
of National Electronic Register of Manuscripts
254
License: CC BY-NC-ND
255
License: CC BY-NC-ND
Author
(Free Text
Search)
Search
Criteria
Title
(Free Text
Search)
52
Scripts
21
States/UTs of
India
276
Subject
Headings
States/ UT Coverage
Bamboo Leaves
Papers
States
Maharashtra
Bhoj Patra
Parchments
Andhra Pradesh
Orissa
Birch-bark
Sanchipats
Assam
Punjab
Clay tablets
Scrolls
Bihar
Rajasthan
Cloth
Tamra Patra
Chhattisgarh
Tamil Nadu
Deer Skin
Tortoise shell
Gujarat
Uttar Pradesh
Ivory Covers
Tulapats
Haryana
Uttarakhand
Leather
Wooden Beads
West Bengal
Palm Leaf
Wooden Covers
Jharkhand
UTs/ NCT
Pamera Leaves
Others
Karnataka
Chandigarh
Kerala
Delhi
Madhya Pradesh
Pondicherry
256
License: CC BY-NC-ND
257
License: CC BY-NC-ND
Centre for
Digital Imaging
Technology,
Trivandrum
Digitizing
Mahabharata
Samshodhana
Pratishthanam,
Bangalore
Partners in
NMM Project
National
Informatics
Centre,
Bhubaneswar
INTACH,
Lucknow
258
License: CC BY-NC-ND
(ii)
(iii)
(iv)
This guideline also suggests a list of metadata elements set, similar to metadata
elements used in the Manus Data Sheet or Manus Granthavali software
application.
259
License: CC BY-NC-ND
260
License: CC BY-NC-ND
261
License: CC BY-NC-ND
City
State
Existing Role(s)
Vrindavan
Uttar Pradesh
MRC1, MCC2
Aligarh
Uttar Pradesh
MCPC3
Rampur
Uttar Pradesh
MRC, MCC
Himachal Academy of
Shimla
Himachal
MRC
North Zone
Vrindavan Research
Institute
Aligarh Muslim
University
Pradesh
Language
Vishveshvaranand
Hoshiarpur
Punjab
MRC, MCC
Haridwar
Uttarakhand
MRC
Leh
Jammu &
MRC
Kashmir
Jammu
Research Institute
Jammu &
Kashmir
Jodhpur
Rajasthan
MRC
Udaipur
Rajasthan
MPC4
Ahmedabad
Gujarat
MRC
Kohlapur
Maharashtra
MPC
Research Institute
Institute of Rajasthan
Studies
Lalbhai Dalpatbhai
Institute of Indology
Shivaji University
Cont
1
262
License: CC BY-NC-ND
Cont
Name of Repository
City
State
Existing Role(s)
Mumbai
Maharashtra
MPC
Ujjain
Madhya Pradesh
MRC, MCC
Sagar
Madhya Pradesh
MRC
Indore
Madhya Pradesh
MRC
Darbhanga
Bihar
MRC
Nalanda
Bihar
MRC
Arrah
Bihar
MRC, MCC
Imphal
Manipur
MRC, MCC
Guwahati
Assam
MRC
Calcutta University
Kolkata
West Bengal
MRC, MCC
Sanskrit Sahitya
Kolkata
West Bengal
MCPC
Pondicherry
Pondicherry
MRC
Chennai
Tamil Nadu
MPC
Sabha
Scindia Oriental
Research Institute
Dr. Harisingh Gour
University
Kunda-Kunda Jnanpitha
East Zone
Kameshwar Singh
Darbhanga Sanskrit
University
Nav Nalanda
Mahavihara
Sri Dev Kumar Jain
Oriental Research
Institute
Library
Parishad
South I Zone
French Institute of
Pondicherry
Institute of Asian
Studies
Cont
263
License: CC BY-NC-ND
Cont
Name of Repository
City
State
Existing Role(s)
Government Oriental
Chennai
Tamil Nadu
MCPC
Thanjvur
Tamil Nadu
MRC, MCC
Tamil University
Thanjvur
Tamil Nadu
MPC
Chennai
Tamil Nadu
MPC
Chennai
Tamil Nadu
Calicut University
Calicut
Kerala
Oriental Research
Trivandrum
Kerala
MRC
Tirupati
Andhra Pradesh
MRC, MCC
Salarjung Museum
Hyderabad
Andhra Pradesh
MCC
Idara-e-Adabiyat-e-Urdu
Hyderabad
Andhra Pradesh
MPC
Oriental Manuscripts
Hyderabad
Andhra Pradesh
MRC
Shravanbelgola
Karnataka
MRC
Keladi Museum
Keladi
Karnataka
MRC
Kannada University
Hampi
Karnataka
MRC
Oriental Research
Mysore
Karnataka
MRC
Manuscripts Library
(GOML)
Foundation
Muhammadan Public
Library
Institute
South II Zone
Oriental Research
Institute
Institute
264
License: CC BY-NC-ND
265
License: CC BY-NC-ND
266
License: CC BY-NC-ND
4.8 References
Arora, A. (2007). Online catalogue of one million manuscripts Kritisampada.
Kriti Rakshana, 2(4), 3-5.
Department of Culture (2002). Project document: National Mission for
Manuscripts. New Delhi: Department of Culture.
Gaur, R.C. (2006). Digital libraries in India with special reference to experiences
at IGNCA, New Delhi, India. Retrieved from http://publikationen.ub.unifrankfurt.de/volltexte/2006/2445/.
Indira Gandhi National Centre for the Arts (2003). CoIL-Net Centre: Cultural
Heritage Digital Library in Hindi. Viswabharat @TDIL, 9, 47-55.
Indira Gandhi National Centre for the Arts (2007). Cultural informatics:
information brochure of Cultural Informatics Lab. New Delhi: IGNCA.
Jha, P. (2005). Digital Library @ IGNCA. Viswabharat @TDIL, 17, 76-77.
Jha, P. (2006). Digitization and documentation of manuscripts and other cultural
heritage at the IGNCA. Kriti Rakshana, 2(3), 25-26.
Microsoft India (2005). NIC helps document and digitize Indian manuscripts:
Microsoft .NET customer solution case study. Retrieved 15 July 2008 from
www.microsoft.com/india
Ministry of Communication and Information Technology (2004). Digitization of
culture - background note for Asia IT Ministers 2nd Summit, Hyderabad.
Retrieved 15 June 2008 from www.asiaitsummit.nic.in/Digitisation.pdf
National Mission for Manuscripts (2004). Guidelines for digitization of
manuscripts. New Delhi: National Mission for Manuscripts.
National Mission for Manuscripts (2005). Basic minimum standards for
conservation of manuscripts. New Delhi: National Mission for Manuscripts.
National Mission for Manuscripts (2007). Report of the fourth year: 7 February
2006 7 February 2007. New Delhi: National Mission for Manuscripts.
Paliwal, N. (2007). Vijnananidhi: manuscript treasures of India. Kriti Rakshana,
2(4), 6-7.
267
License: CC BY-NC-ND
Paul, S. & Jain, P.K. (2008). Digitization of special libraries in India: with special
reference to Indira Gandhi National Centre for Arts (IGNCA) library.
Presented in the ICIKM Conference 2008, Kathmandu, Nepal.
UNESCO (2001). Universal Declaration on Cultural Diversity. Retrieved 15 July
2008 from www.un-documents.net/udcd.htm
UNESCO (2002). Guidelines for digitization projects for collections and holdings
in the public domain, particularly those held by libraries and archives.
Retrieved 15 July 2008 from www.ifla.org/VII/s19/pubs/digit-guide.pdf
268
License: CC BY-NC-ND
CHAPTER 5
5.1 Introduction
With the emergence of information and communication technologies (ICT) in
academic and research environment not only in developed countries but also in
developing nations scholars have become more active and empowered in the
processes of scholarly communication and information absorption. Modern ICTenabled research environment helps the institutions in realization of research
goals by electronically communicating results of research in the form of research
papers to journal publishers. Similarly, institution-wide access to primary and
secondary periodicals is implemented utilizing ICT infrastructure available within
the institutions. This enables researchers to get timely information on published
literature as well as forthcoming literature just before publishing in print journals.
Now-a-days e-journals are more familiar to academic communities than even a
decade before. This is due to their far-wider impact on the research
communication process as well as to the research communities (Canessa &
Zennaro, 2008; European Commission, 2008).
While commercial journal publishers have quite impressive journal access
mechanisms due to their early adoption of electronic publishing model, noncommercial journal publishers have started lately to provide full-text access to
their journal contents.
To the journal publishers, electronic journal publishing is more cost-effective
option than its print counterpart, particularly for journals published by non-profit
institutions and societies. In India, most of the primary journals are published by
non-profit bodies (Sen, 2002). However, these journals were not properly
promoted and merchandised to the international markets when only print
versions were available.
268
License: CC BY-NC-ND
269
License: CC BY-NC-ND
Figure 5.1 indicates that digitization of back volumes of journals lead to four kinds
of products, namely:
(i)
(ii)
(iii)
(iv)
Digitized
Journal
Volumes
Open
Access
Journals
Subscriptionbased eJournals
(web-based)
(web-based)
CD-ROM
Collection of
Journal
Articles
Intranet
Collection of
Journal
Articles
(offline)
(Campus-wide)
270
License: CC BY-NC-ND
XML (Extensible Markup Language) is an open standard suitable for web publishing.
271
License: CC BY-NC-ND
Figure 5.3 indicates other two channels of archiving digitized journal articles, i.e.,
(i) through subject repository and (ii) through institutional repository, in addition to
journal gateway. For example, digitized contents of Indian Journal of
Tuberculosis and some other journals are archived in the popular subject
repository in India - OpenMED@NIC. Similarly, the institutional repository of
Indian Institute of Astrophysics in Bangalore provides full-text access to digitized
articles of Bulletin of the Astronomical Society of India.
As indicated earlier, digitized articles can also be part of online full-text research
databases as well as publisher-specific online journal gateways. The full-text
database service providers aggregate contents from different journal publishers.
These contents are not current literature publishers usually provide one year
backward journal contents. On the other hand publisher-specific journal gateway
provides access to current as well as back volumes of journal contents. While
research database can be available both as CD-ROM collections and online
collections, this service is fee-based and subscribers can only opt for bouquet of
journals not individual journals. Similarly, publisher-specific online journal
gateway is fee-based information service. However, some publishers provide
flexibility in choosing a set of journals, based on institutional requirements of the
subscribers.
Figure 5.4 provides a subscription-based diffusion model for digitized journal
articles. There are some online secondary databases originated from India that
also provide web-links to full-text research contents along with index to journal
articles. However, these databases do not exactly provide aggregation service,
instead they act as metadata harvesters or indexers for facilitating full-text
access to journal articles. Examples of these services are INDmed of National
Informatics Centre, Open J-Gate of Informatics India Limited, and Open Index
Initiative of Indira Gandhi Institute of Development Research (IGIDR).
272
License: CC BY-NC-ND
Journal
Gateways
Subject
Repositories
Institutional
Repositories
(National/
International)
273
License: CC BY-NC-ND
Subscriptionbased
Journal
Articles
Online
Research
Databases
(web-based)
Publisherspecific
e-Journals
Gateway
(web-based)
274
License: CC BY-NC-ND
provides support for publication of about twenty five important journals published
by scientific societies in India through its ongoing scheme called Assistance to
Professional Bodies. Similarly, Department of Scientific and Industrial Research
(DSIR) of Ministry of Science and Technology provides support for e-publishing
of existing scientific periodicals through its ongoing scheme called Electronic
Publishing of Selected Indian S&T materials. Indian research councils such as
Council of Scientific and Industrial Research (CSIR) and Indian Council of
Medical Research (ICMR) also have mandates to support publications of Indian
research periodicals.
With the realization of
275
License: CC BY-NC-ND
276
License: CC BY-NC-ND
Number of e-Journals
11
11
Nil
Nil
Nil
JACEP recommended:
Digitization of back-volumes of all journals published by INSA and IAS.
Web-archiving of back-volumes of all journals published by INSA and IAS.
Open Access to all journals published by INSA and IAS.
Each Academy is to set up a web server for hosting journal contents online.
Fig. 5.6: Schematic Structure of JACEP, a Joint Effort of INSA and IAS
277
License: CC BY-NC-ND
To support and research activities by providing all time access to the fulltext scientific information globally.
In later stage, few other project objectives emerged to make this project more
invasive and inclusive.
278
License: CC BY-NC-ND
279
License: CC BY-NC-ND
navigation of the portal, searching the full-text database, and displaying the
retrieved documents after successful search queries.
Although this project is one of first open access journal gateways in India that
promoted the concept of free access to scientific literature, this gateway is not
compliant to the Open Archives Initiative-Protocol for Metadata Harvesting (OAIPMH) - the international standard for web content interoperability. Thus,
metadata harvesting and indexing service providers for open access journals
such as Google Scholar, Directory of Open Access Journals (DOAJ) and Open JGate, cannot index INSA journals archived in this gateway as indicated in Table
5.5. Worldwide scientists and researchers prefer to use secondary services such
as metadata harvesting and indexing services for open access journals to identify
relevant literature in their areas of interest. Thus, INSA loses crucial international
visibility of their journals by not adopting OAI-PMH-compliant journal publishing
model.
e-Journals @ INSA
Digitization Objectives
server;
projects/ activities;
280
License: CC BY-NC-ND
e-Journals @ INSA
Digital Publishing Solutions (DPS), Pune
NISSAT = National Information System for Science and Technology
DSIR = Department of Scientific and Industrial Research
GoI = Government of India
Digitization
of back
volumes
Webenablement
of digitized
contents
INSA
e-Journals
System
Creation of
metadata for
each article
Development
of web-based
ISAR
281
License: CC BY-NC-ND
No. of
Full-Text
Contents
Born Digital
issues
Availability
Digitized
Contents
a year
from Year
Since
1970-
Since
1970
2002
2003
Since
1966-
Since
1966
2002
2003
Since
1970-
Since
1970
2002
2003
Since
1970-
Since
1970
2002
2003
12
- Physical Sciences
Proceedings of the Indian
National Science Academy Part B
- Biological Sciences
282
License: CC BY-NC-ND
Full-Text
Contents
Digitized
1935-1954
1955-69
Physical Sciences
(Re-christened as Proceedings of the Indian National Science
Academy Part A & B)
1955-69
Biological Sciences
(Re-christened as Proceedings of the Indian National Science
Academy Part A & B)
Searchable Metadata
Displayed Metadata
Metadata
Current Journal
Journal Name
Article Title
Keywords
Author
Article Title
Keywords
Issues (Name of
Author
Initial Page
Journal,
Journal Volume
Journal Name
Volume, Year,
Issue
Volume
and Issue
Year
Issue
Number)
Initial Pages
Year
Issue
Journal Back
Read Article
(Hyperlink to Full-text)
More Articles in This
Issue (Hyperlink)
283
License: CC BY-NC-ND
DOAJ
Open
Scholar
J-Gate
0%
0%
25%
284
License: CC BY-NC-ND
285
License: CC BY-NC-ND
All of the articles in current issues of Academy journals are born-digital from 2000
onwards. The articles of back volumes, which were not born-digital, were
digitized through a government-supported project. IAS has already archived all
articles of eight journals from the first volume whereas full-text contents of other
three journals are available from some old volumes. Born digital current journal
issues are made available immediately after publishing the journal issues. Table
5.7 indicates full-text availability of Academy journals in IAS journal gateway and
SpringerLink portal .
The Academy publications portal provides easy navigation facility to the users.
The current journal issue as well as back volume of a journal can be browsed
from the main page of each journals website. The user has to first select year or
volume of a journal then proceed to navigate table of contents of an issue of the
selected volume. When user selects an article from the table of contents, the
PDF version of that article appears on the screen and can be downloaded into
the user workstation.
Unlike INSA journals, IAS journals are compliant to the Open Archives InitiativeProtocol for Metadata Harvesting (OAI-PMH). Many indexing service providers
for open access journals effectively index and harvest contents of almost all IAS
journals archived in its portal. Table 5.8 shows a matrix of coverage of IAS
journals by online journal indexing and metadata harvesting services such as
Google Scholar, Directory of Open Access Journals (DOAJ) and Open J-Gate.
This Table also indicates availability of IAS journals in SpringerLink gateway that
has very efficient searching and retrieval interface.
Open access to IAS journals increases their global visibility and worldwide
acceptance. Internationally acclaimed online research databases such as Web of
Knowledge and Scopus have considered this fact and now index most of the IAS
journals. Indian scientific periodicals are being considered in these citation
databases due to worldwide recognition of publications quality maintained by the
Academy.
286
License: CC BY-NC-ND
Ministry of
Science and
Technology
IAS
Journals
Indian
Institute of
Sciences
Springer Link
Current
Science
Association
Fig. 5.11: Partnership Pattern for Electronic Publishing of IAS Journals
Table 5.6: Indian Academy of Sciences Published Open Access Journals
Journal Name
No. of
Web Address of
issues a
Journal Portal
year
Current Science
24
www.ias.ac.in/currsci/
www.ias.ac.in/chemsci/
www.ias.ac.in/mathsci/
www.ias.ac.in/jess/
www.ias.ac.in/sadhana/
12
www.ias.ac.in/pramana/
Journal of Biosciences
www.ias.ac.in/jbiosci/
www.ias.ac.in/matersci/
www.ias.ac.in/jaa/
Journal of Genetics
www.ias.ac.in/jgenet/
12
www.ias.ac.in/resonance/
Engineering Sciences)
287
License: CC BY-NC-ND
IAS e-Journal
Springer Link
Portal
Vol.
Year
Vol.
Year
Current Science
1932
Nil
Nil
86
1977
88
1979
87
1978
87
1978
87
1978
87
1978
1978
1978
1973
1973
Journal of Biosciences
1979
1979
1979
1979
1980
1980
Journal of Genetics
1910
64
1985
1996
1996
Engineering Sciences)
Education
Table 5.8: Coverage of IAS Journals in OA Journal Indexing Gateways
Journal Name
DOAJ
Open
Springer
Scholar
J-Gate
Link
Current Science
Engineering Sciences)
Pramana - Journal of Physics
288
License: CC BY-NC-ND
Journal of Biosciences
Journal of Genetics
10
10
10
10
90.9%
90.9%
90.9%
90.9%
Education
Total
Coverage
289
License: CC BY-NC-ND
database. This Figure also shows other full-text and reference services
maintained by Bibliographic Services Division of NIC.
Figure 5.13 shows a schematic structure of medIND@NIC project. This Figure
indicates that Indian Council of Medical Research (ICMR) has functional
collaboration in terms of financial support, while Indian MEDLARS Centre of
Bibliographic Services Division at NIC implements this open access initiative.
medIND aims at providing online access to full-text Indian biomedical periodicals
to the users within and outside India. Different publishers, mainly learned
societies in the respective specialized areas, publish medIND covered journals in
print-on-paper format. Figure 5.14 shows the linkages between publishers, ICMR
and NIC in the process of implementation of this project. Figure 5.15 identifies
different categories of publishers that provide full-text journal contents to this
initiative. The content providers are drawn from professional societies, medical
schools, R&D institutions and a medical council ICMR.
Figure 5.16 shows a digitization workflow in medIND project that starts with
identifying journals for digitization. IndMED covered journals are only considered
for this digitization project as these journals maintain high quality, usually publish
in time and have fewer backlogs. After identifying journals, publishers of these
journals are invited to establish an official agreement through memorandum of
understanding (MOU). The publishers who signed up legal agreements send
their journal back volumes to NIC for digitization. NIC digitizes the contents of
back volumes of these journals. NIC also establishes linkages of digitized articles
with IndMED database, if the bibliographic records are already available there.
Otherwise, new bibliographic record for each digitized article is created in
IndMED database. Finally NIC web-enables full-text contents of digitized journals
to make them freely accessible through medIND open access journal gateway.
Initially, medIND project digitized some back volumes of journals. But now NIC is
mostly receiving born-digital contents of current issues from the publishers.
Some of these publishers also maintain separate journal websites for providing
full-text access to journal contents.
290
License: CC BY-NC-ND
Table 5.9 provides full list of Indian journals archived in medIND gateway. This
Table also shows a status of the availability of full-text contents of different
journals. This Table also indicates that in the medIND project only fewer volumes
were digitized mostly after year 2000 period until the born digital contents were
made available.
Table 5.10 provides names of publishers along with respective journal titles that
belong to different categories of journal publishers as indicated in Figure 5.15. A
close observation of names of publishers suggests that a majority of journal
publishers belongs to the professional society category and other publishers
belong to institutional categories. Figure 5.17 shows a distribution pattern of
different kinds of journals in medIND project. This Figure also indicates that
professional societies contribute 76.9% of total number of journals whereas
professional institutions including biomedical schools, R&D institutions and
medical council, contribute 23.1% journals to the medIND gateway.
Similar to IAS journals, medIND journals are compliant to the Open Archives
Initiative-Protocol for Metadata Harvesting (OAI-PMH). Many indexing service
providers for open access journals effectively index and harvest contents of
almost all medIND journals archived in its portal. Table 5.11 shows a matrix of
coverage of medIND journals by online journal indexing and metadata harvesting
services such as Google Scholar, Directory of Open Access Journals (DOAJ)
and Open J-Gate. This Table also indicates availability of medIND journals in
MedKnow gateway that is a private sector professional initiative to make Indian
biomedical periodicals internationally visible through open access and widely
indexed in secondary research databases. Table 5.11 also indicates that about
35.90% medIND journals are listed in DOAJ, about 25.64% medIND journals are
indexed by DOAJ Content service. Indexing services of Google Scholar and
Open J-Gate have better coverage for medIND journals with 84.62% and 79.49%
coverage respectively.
Table 5.11 and Figure 5.18 show that MedKnow covers about 30.8% medIND
journals. While medIND offers merely web-archiving facility to Indian biomedical
291
License: CC BY-NC-ND
Full-text
Products
Reference
Databases
medIND:
Full-text of 38 Journals
Indexed in IndMED
OpenMED@NIC:
an Open Access Archive
for Medical and Allied
Sciences
IndMED: Bibliographic
Database of Peer
Reviewed Indian
Biomedical Journals
Union Catalogue: a
Database of the Serials
Holdings of Major
Medical Libraries
292
License: CC BY-NC-ND
medIND
ICMR
BID, NIC
(Funding
Agency)
(Implementing
Agency)
Professional
Societies
Biomedical
Schools
medIND
Journals
R&D
Institutions
ICMR
293
License: CC BY-NC-ND
No
IndMED
covered
journals?
Stop
Yes
MOU signed with Journal
Publishers
Digitization of Back
Volumes
Linking of Full-text Articles
with IndMED Database
294
License: CC BY-NC-ND
Full-text Availability
Year
Vol. No.
2005
Endodontology
2000
12
Health Administrator
2000
2000
44
2000
14
2002
46
2000
42
2002
17
2000
25
2004
23
2001
22
2001
19
2004
119
2001
11
2002
17
2003
2002
33
2001
53
2005
72
2000
32
2003
34
2004
14
2004
25
2003
19
Medicine
Surgery
Cont...
295
License: CC BY-NC-ND
Cont...
Journal Name
Full-text Availability
Year
Vol. No.
2003
19
2000
47
Indian Pediatrics
2004
41
J.K. Practitioner
2001
2000
2000
46
2004
26
2002
10
2004
54
2001
50
2001
2000
18
Lung India
2004
21
2000
56
NTI Bulletin
2000
36
2001
15
Preventive Dentistry
296
License: CC BY-NC-ND
Journal Publisher
Indian Association
of Cardiovascular Thoracic
Anaesthesiologists
Endodontology
Health Administrator
Immunology
Immunology
Sciences
Oncology
Oncology
Environmental Medicine
Indian Journal of Occupational Therapy
Cont...
297
License: CC BY-NC-ND
Cont...
Journal Name
Journal Publisher
Medicine
University
Diseases
Cardiovascular Surgery
Thoracic Surgeons
Indian Pediatrics
J.K. Practitioner
Medicine
Journal of Indian Rheumatology Association
India
Societies of India
Surgeons
Journal of Indian Society of Pedodontics and
Preventive Dentistry
Preventive Dentistry
Lung India
NTI Bulletin
298
License: CC BY-NC-ND
Institutional
Journals ,
23.1%
Society
Journals,
76.9%
MedKnow
Covered, 30.8%
Not Covered,
69.2%
299
License: CC BY-NC-ND
MedKnow
DOAJ
Pub.
DOAJ
Open
Content Scholar
J-Gate
Endodontology
Health Administrator
Immunology
Allied Sciences
Paediatric Oncology
Environmental Medicine
Cont
300
License: CC BY-NC-ND
Cont
Journal Name
MedKnow
DOAJ
Pub.
DOAJ
Open
Content Scholar
J-Gate
Cardiovascular Surgery
Indian Pediatrics
J.K. Practitioner
Medicine
Lung India
NTI Bulletin
Organs
Total
12
14
10
33
31
30.8%
35.90%
25.64%
84.62%
79.49%
Diseases
Indian Journal of Thoracic and
Coverage
301
License: CC BY-NC-ND
302
License: CC BY-NC-ND
Literature
Search
PubMed
IndMED
medIND
Full-Text
Access
PubMed Database
303
License: CC BY-NC-ND
304
License: CC BY-NC-ND
Searchable
Metadata
Metadata
Current Journal
Displayed Metadata
Title
Name of Author(s)
Author
Keywords
Article Title
Issues (Name of
ISSN
Journal Name
Journal, Year,
Source
Year
and Issue
Journal Title
Volume No.
Number)
Year
Issue No.
Issue
Journal Back
Pages
Abstract
Keywords
No. of References
Record Identifier
Read Article (Hyperlink to Fulltext)
Explore Related Information
(Hyperlink)
License: CC BY-NC-ND
306
License: CC BY-NC-ND
Linking from other sites so that visitors can reach journal site.
Good ranking with search engines which enables people to find its journal
sites.
Medknow also offers post-print services to its clienteles that include Indexing with
bibliographic databases and providing bibliographic records to indexing agencies
and secondary aggregating agencies.
Figure 5.25 shows categories of Medknow clienteles availing e-publishing
solutions. Its clienteles are drawn from the professional societies, academic
institutions and research institutions. Most of these institutions already have
scholarly periodicals in their portfolios and collaborated with Medknow to enlarge
their international visibility and subscriptions. Whereas, few institutions are just
starting publishing scholarly journals and collaborating with Medknow to adapt a
sustainable and technologically advanced publishing model.
Figure 5.26 shows country profile of Medknow clienteles. Its clienteles are mainly
drawn from India. Few institutions from countries such as Nigeria, Saudi Arabia
and United Kingdom also have collaborated with Medknow for co-publishing their
research journals.
Table 5.13 provides a comprehensive list of Medknow Publications hosted open
access journals. In this Table a list of 64 journal titles along with their full-text
availability status is shown. Co-publishing institutions having existing journals
provide digitized as well as born digital contents of old volumes to Medknow for
web-archiving in the respective journals websites. Thus, very old volumes of few
journals are made accessible through their websites. Medknow also helps in
creation of metadata and meta-tags for digitized collection of old volumes and
later these metadata records are merged with existing searchable database of
published papers.
Medknow Publications specially designed its journal portals to make all Medknow
journals compliant to the Open Archives Initiative-Protocol for Metadata
307
License: CC BY-NC-ND
Harvesting (OAI-PMH). All major indexing service providers for open access
journals effectively index and harvest contents of almost all Medknow journals
archived in their respective portals. Table 5.14 shows a matrix of coverage of
Medknow journals by online journal indexing and metadata harvesting services
such as Google Scholar, Directory of Open Access Journals (DOAJ), DOAJ
Content and Open J-Gate. Table 5.14 also indicates that about 85% Medknow
journals are listed in DOAJ. Indexing services of DOAJ Content service, Google
Scholar and Open J-Gate have outstanding coverage for Medknow journals as
compared to other Indian journal gateways with 78.3%, 85% and 79.49%
coverage respectively. This kind of outstanding coverage by secondary
databases is due to its special initiative in providing bibliographic records to
indexing agencies and secondary aggregating agencies.
Publishing and Online Services
Journals' Websites (Open Access)
Online Manuscript Submission and Peer-Review
System
Subscription and Advertisement Management
308
License: CC BY-NC-ND
Professional
Societies
MedKnow
Publications
Biomedical
Schools
R&D
Institutions
India
Saudi Arabia
MedKnow
Publications
United
Kingdom
Nigeria
309
License: CC BY-NC-ND
Journal Name
No.
Full-text Availability
(Year and Vol. No.)
2007 (v.4)
2005 (v.8)
2006 (v.9)
2008 (v.1)
2006 (v.1)
2008 (v.1)
2008 (v.2)
2007 (v.1)
CytoJournal
2008 (v.5)
10
Hepatitis B Annual
2004 (v.1)
11
2003 (v.39)
12
2007 (v.32)
13
2003 (v.7)
14
2006 (v.17)
15
2005 (v.50)
2001 (v.56)
16
and Leprology
17
2002 (v.8)
18
2001(v.19)
19
2001 (v.55)
20
21
2008 (v.23)
2003 (v.7)
22
Environmental Medicine
23
1975 (v.23)
24
2007 (v.41)
25
2004 (v.9)
310
License: CC BY-NC-ND
26
2008 (v.51)
27
2006 (v.68)
28
1969 (v.1)
29
2001 (v.34)
30
2005 (v.47)
31
2008 (v.30)
32
1999 (v.9)
33
34
2003 (v.65)
35
2005 (v.21)
2001 (v.21)
36
Countries
37
2008 (v.2)
38
2007 (v.1)
39
2007 (v.1)
40
2005 (v.1)
41
Journal of Carcinogenesis
2008 (v.7)
42
2007 (v.10)
43
2008 (v.1)
44
Journal of Cytology
2008 (v.25)
45
2008 (v.1)
2009 (v.1)
46
Surgery
47
2008 (v.1)
2005 (v.10)
48
Surgeons
Journal of Indian Society of Pedodontics and
49
Preventive Dentistry
50
2003 (v.21)
2009 (v.1)
311
License: CC BY-NC-ND
51
2006 (v.31)
52
2005 (v.1)
53
2007 (v.11)
54
2006 (v.1)
55
1980 (v.26)
2009 (v.1)
56
Gender
57
Lung India
2008 (v.25)
58
2008 (v.1)
59
2003 (v.1)
60
Neurology India
1999 (v.47)
61
1998 (v.1)
62
2006 (v.12)
63
2005 (v.5)
64
Urology Annals
2009 (v.1)
DOAJ
Google
DOAJ
Content Scholar
X
X
312
License: CC BY-NC-ND
X
X
X
X
X
X
X
X
313
License: CC BY-NC-ND
51
85%
47
78.3%
51
85%
314
License: CC BY-NC-ND
Searchable
Metadata
Metadata
Table of
Displayed Metadata
Author
Article Title
Contents of
Keywords
Name of Author(s)
Current Journal
ISSN
Journal Name
Issues
Journal Name
Year
Journal Back
Volume No.
Issues (Name of
Issue No.
Journal, Year,
Abstract
and Issue
Article Type
Number)
315
License: CC BY-NC-ND
The publisher took up a huge task of digitizing this esteemed magazine to make
available this important collection of Ramakrishna-Vivekananda literature to the
Bengali knowing computer users all over the world, which took a few years to
complete this digitization project, as indicated in Figure 5.27. In September 2005,
Ramakrishna Math released the e-version of all the Udbodhan articles in 21 CDROMs covering contents published in its first centenary. A portion of the digitized
contents is web-enabled in the magazines web portal. Udbodhan magazine
portal provides open access to full-text contents of this magazine since July 2000
issue. Inaugural issue of this magazine published in January 1899 is also made
online in this portal. Presently, current issues of this magazines are born-digital
and web-enabled immediately publishing its print counterpart. Although this
magazine is freely available online, its print version is subscription based. So is
the CD-ROM collection of digitized articles.
Ramakrishna Math is also engaged in digitizing other important literature
collections such as Prabuddha Bharata, published by Advaita Ashrama a wing
of Ramakrishna Math, and other important works on Ramakrishna-Vivekananda
ideology and Vedanta.
316
License: CC BY-NC-ND
License: CC BY-NC-ND
Searchable
Displayed Metadata
Metadata
Range of Years
Category
Year
Subject
Title of Article
Month
Title
Volume Number
Issue Number
Author Type
Issue
Table of Contents of
Author Name
Year of Publishing
(Displayed in Transliterated
(Displayed in Bengali)
English)
Fig. 5.28: Image Viewer A Navigation Tool for Udbodhan Centenary CDROM Collection
318
License: CC BY-NC-ND
5.9 References
Canessa, E. & Zennaro, M., eds. (2008). Science dissemination using open
access: a compendium of selected literature on open access. Trieste: Abdus
Salam International Centre for Theoretical Physics.
319
License: CC BY-NC-ND
320
License: CC BY-NC-ND
CHAPTER 6
6.1 Prologue
Digitization of library materials has quickly become common practice in a great
many institutions. It provides an excellent opportunity to widely disseminate our
documentary heritages and greatly increases access to library collections of rare
documents as well as current research literature. In this connection I would like to
provide an anecdote based on a true story:
One professor of Jawaharlal Nehru University (JNU) went to Kolkata from
New Delhi to consult National Library of India (NLI) and locate a specific
book there. NLI neither has any web-based online public access catalogue
(OPAC) nor an email-based service to inform a distant user about
availability of a document. Fortunately, he came to know about availability
of the book in NLI collections from his Kolkata-based friend. After two days
of follow up, he could locate the book in the library and took photocopy of
about 50% of the said book, although he needed full copy of the book for
his research purpose. This was an out of copyrighted book, but lack of
understanding by NLI staff on copyright norms forced him to settle for a
partial content. After returning back to JNU, he asked his colleague for
obtaining remaining part of the book from the NLI. Instead of going to NLI,
his colleague referred him to Digital Library of India (DLI) portals. Both of
321
License: CC BY-NC-ND
them found full text content of the book at a DLI site within a few minutes
and took print outs on the same day.
Digital Library of India has digitized many historically important documents and
those are now available in DLI portals. Many researchers and scholars need to
consult old literature for their advanced studies. However, many researchers in
India either are not competent in Internet searching or not aware of online
resources originated in India. Thus, when we see usage statistics of any digital
library or digital archive site originated from India including DLI sites, we observe
that the top ranking country is either United States or any other developed
country in terms of number of visitors in a particular period from a country, but
not India. This reflects the fact that Indian researchers and scholars need training
on functional information literacy and digital literacy, which is very crucial at this
digital age.
6.2 Summary of Findings
Indian digitization initiatives as discussed in the Chapters 2 to 5 aim at producing
a vast amount of digitized documents pertaining to different forms of recorded
human knowledge, ranging from the rare manuscripts to current research
literature. Digital Library of India is the largest digitization initiative in India
spreading across states of India and involving over ninety organizations to
ensure several thousands of rare books written in Indian languages as well as
non-Indian languages are accessible through Internet channel. Chapter 2
critically appraises different aspects of the DLI project such as collaboration
pattern, digitization processes, metadata practices, information retrieval, etc.
The higher education system in a country produces advanced research literature
in the form of theses and dissertations in both thrust areas of the country as well
as in the emerging academic fields. The researchers in the country need to
consult results of past researches in order to expand frontiers of knowledge.
Theses and dissertations are constant in demand for our academic research,
although availability of the same is the matter of concern. Chapter 3 critically
322
License: CC BY-NC-ND
323
License: CC BY-NC-ND
License: CC BY-NC-ND
No Public Access
Partial Access
Intranet channel)
Full Access
Full Access
OAI-PMH)
OAI-PMH)
325
License: CC BY-NC-ND
License: CC BY-NC-ND
are also added in this gateway at regular interval through the process of new
journal acquisition, bifurcation or initiation. Best known example for this phase is
the Medknow publications. Medknow is new age digital publisher, publishing
open access journals in collaboration with biomedical professional societies and
institutions. Most of the journal issues of Medknow journals are published online
earlier than their print counterparts.
Model 2: Growth of Open Access Journals in India
Static Growth
Slow Growth
Dynamic Growth
327
License: CC BY-NC-ND
available through Internet channel whereas full text ETD collection is made
available in intranet for the internal users. Best examples in this phase are
national initiatives such as CSIR e-Thesis and institutional initiatives of IIT
Chennai, IIT Mumbai, IIT Kanpur, etc.
In the third phase of this model, a partial full text collection is made accessible
through Internet channel, whereas same is fully accessible in intranet for the
internal users. The best example of this phase is ETD collection of IIT Delhi
which is partially accessible through its institutional repository - ePrints@IIT Delhi
and fully accessible in campus-wide Intranet. In the fourth and final phase of this
model, entire ETD collection is made accessible in Internet through a web-based
ETD repository. The best examples of this kind of collections are Vidyanidhi ETD
repository and ETD@IISc repository.
Figure 6.1 depicts a chain of transitional phases from no online public access to
full-text open access to ETD collection, based on Model 3.
No Public Access
Partial Access
accessible in intranet)
Full Access
Partial Access
channel)
328
License: CC BY-NC-ND
Full-text
Fully
Accessible
in Intranet
Full-text
Fully
Accessible
in Internet
Full-text
Partially
Accessible
in Internet
No Public
Access
Full-text
Partially
Accessible
in Intranet
Metadata
Accessible
in Intranet
Metadata &
Abstract
Accessible
in Internet
Metadata
Accessible
in Internet
Metadata &
Abstract
Accessible
in Intranet
Fig. 6.1: Transition from No Online Public Access to Full-text Open Access
to ETD Collection
329
License: CC BY-NC-ND
License: CC BY-NC-ND
The fourth and final phase of this model takes care of the self-sustainability of a
digitization or open access project. In this phase, the project implementing
agency generates revenues from different sources such as online/print
advertisements, sales promotions, copyright transfers, reprint permissions, print
subscription, and consultancy services. While this phase is most dynamic, it
requires a visionary leadership to formulate robust marketing strategies.
The best example of this phase is the open access initiative of Medknow
Publications having interoperable collection of biomedical open access journals
published from India and other countries. Medknow ensures its journal
collections to be indexed in major secondary databases and metadata harvesting
services. Medknow also tries to generate sustainable revenues by initiating
innovative strategies in the emerging open access domain.
Model 4: Sustainability of Digitization Projects
Self-Sponsored Digitization
Government-Sponsored
Project
Digitization Project
Institutionally Supported
Self-Sustainable Digitization
Government-Sponsored
Project,
Digitization Project,
and Services.
Agencies
331
License: CC BY-NC-ND
332
License: CC BY-NC-ND
ACTORS
NATIONAL
ENVIRONMENT
Socio-Cultural
Economic and
Scientific
Environment
Emerging
Markets
Emerging
Knowledge
Economies
Emerging
Information
Society
GDP
Rate
ICT-enabled
Learning
Environment
Lifelong
Learning and
Workplace
Learning
Growth
IT Localization
National
Information
Infrastructure
National
Knowledge
Network
National
Education
Grid
National
Information
Economy
GLOBAL
ENVIRONMENT
Socio-cultural
Economic and
Scientific
Environment
Global
Information
Systems
Scholarly
Communication
Systems
Development of
Open Contents
Development of
Creative
Contents
Dissemination of
Contents
through Creative
Commons
Licences
Society
Fig. 6.2: National Digital Information System integrating Indigenous Digital Libraries
333
License: CC BY-NC-ND
6.4 Recommendations
This thesis tries to help in establishing a theoretical framework in the area of
indigenous digital library development in a country where a wide array of
documentary heritages coexists along with scholarly literature collections. A
broad-based inclusive National information system integrating indigenous digital
libraries is also proposed in the previous Section of this Chapter. This National
information system integrates many aspects of knowledge society such as
participatory development, IT localization, participatory content creation, creative
content creation, and adoption of Creative Commons Licences for wide
dissemination of open contents. All these are discrete elements having
proposition to serve the knowledge society. Further studies can be carried out to
examine the impact of their integration and convergence on the national
information system.
For any advanced research, some performance indicators as well as
environmental indicators are needed to be defined. These indicators can be
emerged if any kind of national baseline survey is undertaken preferably by a
national agency.
The sustainability of the Indian digitization initiatives is the major concern for
many professionals. Long term sustainability of any information system is
supplemented by innovative practices, visionary leadership and entrepreneurship
capabilities of individuals as well as organizations. Thus, leadership traits and
innovative practices can be measured in a comparative study that may help in
elevating national capacities and capabilities.
This thesis further recommends that Indian digitization and open access projects
should initiate appropriate strategic measures towards attainment of selfsustainability as depicted in the Model 4 theoretical framework.
_*_
334
License: CC BY-NC-ND
ANNEX 1
Questionnaire on Content Digitization
in the Library/ Information Centre/ Archive
Introduction: This questionnaire is part of a research study titled An evaluative study of some
selected libraries in India undergoing the process of digitization being conducted under the aegis
of the Department of Library and Information Science, Jadavpur University, Kolkata. Your reply to
this questionnaire is highly valuable and extremely important to assess and evaluate the state of
the digitization activities in India. Information given here will be kept confidential and used for
academic purposes only. Individuals identity will be kept confidential, if so desired.
ADMINISTRATIVE QUESTIONS
(I) About the Library / Institution
1.1) Name of the Library/ Information Centre/ Archive: ......................................................
1.2) Name of the Parent Body: ...........................................................
1.3) Name of Director/ Chief Librarian/ Librarian/ In-charge: ......................................
1.4) Name of the Person responsible for digitization/ electronic collection: ........................
1.5) Address: ........................................................................................................
1.6) Telephone Number: ............................................................................................................
Fax Number: ......................................................................................................................
E-mail: ....................................................................................................
URL: .................. ...... ...... ..............................................................................................................
(II) Digitization programme and policy
2.1) Do you have a programme for digitizing collections?
Yes /No
Yes /No
Yes / No
Yes/ No
335
License: CC BY-NC-ND
Yes / No
336
License: CC BY-NC-ND
__ the Library, or
__ an outside body
__ library staff, or
__ outside staff
__ the original, or
__ reproductions
__ photocopies
__ microfilm
__ slides
__ photographs
__ microfiche
__ grey level
__ colour
__ TIFF
__GIF
__ JPEG
__BMP
__ others (specify)..........................
__PAL
__ PDF
black/white...............
colour....................
grey....................
337
License: CC BY-NC-ND
8.10) Please give details of any special software used to digitize non-Latin scripts/ Indian scripts?
................ ................................................................................... ..................................................
8.11) Do you digitize sound recordings?
Yes /No
8.12) If Yes what methods are used? ................................................................................
................................................... ................................................... ...............................................
8.13) Do you digitize film or video?
Yes / No
Yes / No
Yes / No
338
License: CC BY-NC-ND
photographs
others (specify)......................
10.3) Do you digitize any of the following:
_____%
_____%
__ engravings
__ prints
__ lithographs
__ posters
__ postcards
__ drawings and water-colours
__ three-dimensional objects
__ sculptures
__ fabrics and textiles
__ sound recordings
__ films and videos
__ others (specify)..........................................
(XI) Catalogues
11.1) Are the catalogue records for digitized material
__ included in the main catalogue
__ in a separate catalogue
11.2) Is the catalogue
__ in paper form
__ electronic form
__ on an intranet server
__ available on the Internet or Websites
11.3) Are digitized materials catalogued according to a recognised standard?
Yes / No
339
License: CC BY-NC-ND
Yes /No
12.7) What approaches for digitized documents searching are available to users?
__ Author
__ Title
__ Subject
__ Keyword
__ Series
__ Place
__ Publisher
__ Year
__ Free search
__ Boolean search
(XIII) Charges
13.1) Are users to pay to use the digitized material?
13.2) If Yes, tick the appropriate items
Yes / No
__ on-site
__ outside the library
__ when accessed through the Website
Yes / No
340
License: CC BY-NC-ND
14.5) Are users allowed to do any of the following with the digitized documents:
__ make printouts
__ download to a PC
__ download to a local network (LAN)
__ download to a general network (WAN)
__ download to a floppy/ CD-ROM
14.6) Is any electronic management system used to control copying?
Yes / No
Yes / No
Yes / No
Yes / No
341
License: CC BY-NC-ND
342
License: CC BY-NC-ND
ANNEX 2
Request for Participation (RFP) in the Digital Library of India Project
1. Name and Address of the Institution:
2. Name and address of the Coordinator:
3. Brief write up about the Institution:
4. Size of the local library:
5. List of books and other contents available locally:
(Please enclose in separate sheets)
6. Preferred Language for Indian language contents and research:
7. Number of scanners needed:
8. Number of researchers who could participate in joint research with Indian and
the US side:
9. Statement of Participation:
We agree to participate in the Digital Library of India Project and agree to ensure
that all the responsibilities outlined in the RFP for DLI will be fully met from our
side. We also nominate ........................................................................... as the
coordinator from our side.
343
License: CC BY-NC-ND
ANNEX 3
National Mission for Manuscripts National Survey for Manuscripts
Questionnaire
Record No.
Name of the Institute
Individual Collection
Place/ Village
District
State
Pin Code
Head of Institution
Telephone
Email
Website
No. of total manuscripts
No. of paper manuscripts
No. of palm leaf manuscripts
No. of manuscripts in other materials
No. of manuscripts catalogued categorywise
State of preservation of manuscripts
Subject
Language
Script
No. of
Manuscripts
Signature of Surveyor
344
License: CC BY-NC-ND
ANNEX 4
National Mission for Manuscripts
Manus Data Sheet
Record No.
Date of Data Collection:
Name of the Institute
Personal Collection
Village
Block
District
State
Pin Code
Title of the Text
Other Title
Author
Language
Script
Commentary
Commentator
Date of Manuscript
Scribe
Subject
Bundle No.
Manuscript No.
No. of Folios
Size of Manuscript
Material
Illustrations
Complete/Incomplete
Missing portion
Condition
Source of Catalogue
Remarks
345
License: CC BY-NC-ND
ANNEX 5
Form for Selection of Journal for Indexing in IndMED
http://indmed.nic.in
I.
Journal Particulars
1. Title:
2. Former Title (if any):
3. ISSN:
4. Language:
5. Frequency:
6. Editor:
7. Postal Address for Correspondence:
8. Telephone/Fax:
9. Email:
10. Homepage (Web Site): http://
11. Year of First Publication and history of publication including non-publication:
12. Indexing Agency (if already indexed):
13. Circulation:
14. Parent body (if any):
15. Subscription Rate:
16. Whether Newsletter/In-house Publication:
346
License: CC BY-NC-ND
II.
Content Particulars
Enclosures Required:
1.
2.
In case the journal is selected for indexing, back volumes and latest
issues would be required free of cost without delay.
3.
Enclose letter (on journal's letterhead) authorising NIC to index the journal.
(Signature)
Name:
Position:
347
License: CC BY-NC-ND
ANNEX 6
List of Abbreviations
CASSIR: Cross Archive Search Services for Indian Repositories
CDAC: Centre for Development of Advanced Computing
CHDLH: Cultural Heritage Digital Library in Hindi
CIL: Cultural Informatics Lab
CoIL-Net: Content Development and IT Localization Network
CSIR: Council of Scientific and Industrial Research
DIT: Department of Information Technology
DLI: Digital Library of India
DL-RICH: Digital Library- Resource for Indian Cultural Heritage
DOAJ: Directory of Open Access Journals
DRTC: Documentation Research and Training Centre
DSIR: Department of Scientific and Industrial Research
DST: Department of Science and Technology
ETD: Electronic Theses and Dissertations
FOSS: Free and Open Source Software
GoI: Government of India
HTML: Hypertext Markup Language
IAS: Indian Academy of Sciences
ICMR: Indian Council of Medical Research
ICT: Information and Communication Technologies
IGNCA: Indira Gandhi National Centre for the Arts
IIIT: International Institute of Information Technology
IISc: Indian Institute of Sciences
IIT: Indian Institute of Technology
INSA: Indian National Science Academy
JACEP: Joint Academys Committee for Electronic Publishing
JPEG: Joint Photographic Experts Group
MCC: Manuscript Conservation Centre
348
License: CC BY-NC-ND
349
License: CC BY-NC-ND
http://in.linkedin.com/in/dranupkumardas/
http://orcid.org/0000-0001-9490-7938
www.researcherid.com/rid/F-3923-2011
http://impactstory.org/AnupKumarDas
www.researchgate.net/profile/Anup_Das7
http://jnu.academia.edu/AnupKumarDas
www.slideshare.net/anupkd
License: CC BY-NC-ND