Agrobiodiversity Indicators and Measurement Using R For Description, Monitoring, Comparison, Relatedness, Conservation and Utilization
Agrobiodiversity Indicators and Measurement Using R For Description, Monitoring, Comparison, Relatedness, Conservation and Utilization
Agrobiodiversity Indicators and Measurement Using R For Description, Monitoring, Comparison, Relatedness, Conservation and Utilization
the agrobiodiversity better, to plan programs and activities, and to monitor the trends
(Sthapit et al., 2017; PAR, 2018).
For the conservation of forest biodiversity (non-agrobiodiversity), different
indicators and approaches have been used, for example red listing of the species. Many
types of species have been defined and given due attention. Different types of species
include Alien, Charismatic, Dominant, Emblematic, Endangered, Endemic, Exotic,
Flagship, Focal, Foundation, Indicator, Indigenous, Invasive, Keystone, Landscape,
Priority, Rare, Specialty, Substitute, Surrogate, Target, Threatened, Tourism, Umbrella
and Vulnerable species. Similar approaches can be applied at species and landrace level
to support AGRs. Quantification of AGRs is another aspect that identifies such species
or landraces.
Different types of scores and indices along with coefficients can be estimated and
used as indicators (Joshi et al., 2005; Jarvis et al., 2000; Grum and Atieno, 2007)).
Several statistical tools can be applied using computer software to quantify
agrobiodiversity. Quantifications (measurements) of agrobiodiversity are generally done
at different levels e.g., at the agroecosystem, species, varieties, and administrative units.
Agrobiodiversity in any area should be estimated properly that leads to choosing the
conservation approaches effectively. This paper, therefore, describes different
operational agricultural units (OAU) for estimating diversity indices using R packages.
Among the various components under agrobiodiversity statistics, this paper focuses on
the measurement of agrobiodiversity. With the approaches described in this paper, one
can rank any household, community, district, or the country and can locate a center of
the diversity. A hotspot of agrobiodiversity and red zone for agrobiodiversity can be
identified, in addition to identifying the indicator species and landraces.
Agrobiodiversity covers all genetic resources that have value for food, nutrition,
health, and other economic uses to human beings. It has six components, and they are
crops, forages, livestock, insects, microorganisms, and aquatic genetic resources (Joshi
et al., 2020c). Insects and microorganisms include only economic and beneficial species.
Under aquatic genetic resources, only economically important species are included e.g.,
fish. Each of these components can further be divided into four sub-components. They
are cultivated/ domesticated, semi-domesticated, wild relatives, and wild edible species
(Joshi and Shrestha, 2017; Joshi and Shrestha, 2019).
Based on the economic uses, agricultural genetic resources can be grouped into
25 groups. They are 1. cereals, 2. pseudocereals, 3. millets, 4. sugar and starch crops, 5.
grain legumes, 6. oilseed crops, 7. summer vegetables, 8. winter vegetables, 9. roots and
tubers, 10. winter fruits, 11. summer fruits, 12. spices, 13. beverages and narcotics, 14.
fibers, 15. forage trees, 16. forage grasses, 17. ornamental plants, 18. medicinal plants,
19. supportive plants, 20. economic and beneficial (EB) insects, 21. EB microorganisms,
22. fish/aquatic animals, 23. aquatic plants, 24. poultry, and 25. livestock (Joshi and
Shrestha, 2019, Joshi and Shrestha 2017). Supportive plants include green manuring
crops, cover crops, pesticide plants, and other economically important plants that are not
included in the above groups.
These components, sub-components, and economic groups (Joshi et al., 2020c;
Joshi and Shrestha, 2019) are very useful to estimate different types of diversity indexes,
indicators, and scores of a particular site, community, or household over a certain period.
The AGRs may be of exotic and native types and both types can be considered for
agrobiodiversity measurement, but measurement based on only native AGRs would be
more valuable and important. There are many other grouping systems of AGRs (Joshi
and Shrestha, 2019), and these groups can also be considered to quantify
agrobiodiversity.
Cultivated, semi domesticated, wild relatives and wild edible genetic resources
Agrobiodiversity
within an area , part of biodiversity
Agrobiodiversity levels
1. Agro-ecosystem Diversity, Variety of different agro-ecosystems within an area, different growing seasons,
Agro-ecozone Diversity cropping pattern, agro-ecology and agro-ecozones
2. Agrobiodiversity All components and sub components and groups of agricultural genetic
components and groups resources within agro-ecozone
3. Species Diversity, Inter and intra level species and sub species and crops diversity within a given
Crop Diversity area
4. Varietal Diversity Intra and inter varietal diversity, landrace or cultivar diversity within a species
Genetic Diversity
6. Allelic Diversity Variation within genes, traits and among alleles within genotypes
Agrobiodiversity
types
Diversity
Diversity
Continuous Nominal
Data Type
1 https://news.mongabay.com/2016/05/top-10-biodiverse-countries/
pulses and legumes, milk and milk products, oil/fat and ghee2, sugar and honey, and
miscellaneous) at 0 or 1 scale on half-year basis with maximum 15 score
8. Social agrobiodiversity score (number of religious or culturally associated
landraces, considering all 6 agrobiodiversity components)
9. Food diversity score (number of food items/recipes eaten per meal, average of
morning, day, and evening foods)
10. Food component score (number of species in food per meal, average of morning,
day, and evening foods)
11. The average area per species (crops and forages) in square meter
12. HH agrobiodiversity score: sum from above 1 to 10 scores.
2
It is made by melting butter.
Bal Krishna Joshi 53
ISSN 2564-4653 | Agrobiodiversity & Agroecology | vol.01, No.01 (November 2021): 47-64 | Doi: https://doi.org/10.33002/aa010103
Different types of data are generated and collected for the measurement and other
studies of agrobiodiversity. Different data types for agrobiodiversity study are given in
figure 4. Data could be agro-morphological, molecular, and perception, which can be
generally collected from on-station research, on-farm trial, surveys, and lab research.
Several methods and techniques can be used to collect data and information (see Joshi
et al., 2005 for detail).
Apps and software are available for collecting data and information electronically
both online as well offline. FieldLab is an application for Android tablets that are used
for data collection in the field. It is developed by IRRI3 and is available freely. Field
Book is a simple app for taking phenotypic notes. It is an open-source application for
field data collection on Android4 and is available from Google Play5. The Fieldbook2020
software developed by CIMMYT6 provides offline capabilities for managing pedigrees,
phenotypic data, seed stocks, and field books for a breeding program. It provides
integrated management of global information on genetic resources, crop improvement,
and evaluation for individual crops. R Package7 included in this software is useful for
statistical analyses. Biologer8 is a simple and free software designed for collecting data
on biological diversity.
Perception data is generally collected from a survey. Along with the advancement
of information technology, many data collections survey tools are available. These
3 http://bbi.irri.org/products/fieldlab
4 http://dx.doi.org/10.2135/cropsci2013.08.0579
5 https://play.google.com/store/apps/details?id=com.fieldbook.tracker&hl=en&gl=US
6 https://www.cimmyt.org/
7 https://data.cimmyt.org/dataset.xhtml?persistentId=hdl:11529/10548370
8 https://biologer.org/
online tools are very useful to minimize errors and speed up data processing. Some
electronic media-based survey tools are given below.
▪ Surveymonkey9: A cloud-based survey tool that helps users create, share, collect
and analyze surveys.
▪ Google forms10: It is used to create online forms and surveys.
▪ SoGoSurvey11: A cloud-based platform that enables creation, distribution, and
multilingual analysis of surveys, forms, polls, quizzes, and assessments.
▪ mWater Portal12: Free platform for data collection, data visualizations, and data-
driven management of infrastructure in emerging economies.
▪ ODK13: It is an Open Data Kit, open-source software for collecting, managing,
and using data in resource-constrained environments.
8. Measurement Objects
Many software are available for agrobiodiversity statistics. The general and
molecular software are given below.
9 https://www.surveymonkey.com/
10 https://www.google.com/forms/about/
11 https://experience.sogosurvey.com/
12 https://portal.mwater.co/#/
13 https://opendatakit.org/
14 https://www.agronomix.com/AGROBASE.aspx
15 http://bbi.irri.org/products
16 https://www.diva-gis.org/
17 https://www.vsni.co.uk/software/genstat
18 http://ggebiplot.com/
19 https://instat.software.informer.com/3.3/
20 https://www.minitab.com/en-us/
21 https://www.microsoft.com/en-ww/microsoft-365/excel
22 https://www.canr.msu.edu/afre/projects/microcomputer_statistical_package_mstat._1983_1985
23 http://www.appliedbiostat.com/ntsyspc/ntsyspc.html
24 https://www.nhm.uio.no/english/research/infrastructure/past/index.html
25 https://www.r-project.org/
26 https://www.rstudio.com/
27 https://www.sas.com/en_us/home.html
28 https://www.ibm.com/analytics/spss-statistics-software
29 http://bbi.irri.org/products
30 https://www.statistica.com/en/
31 https://www.statistix.com/
32 http://www.cibiv.at/software/pda/
33 https://www.sams.ac.uk/science/outputs/
34 http://cmpg.unibe.ch/software/arlequin35/
Most of the software and R packages used in biodiversity analysis can be used for
agrobiodiversity analysis. Past is simple and free software that can be used for
agrobiodiversity data. It is good for generating a graph, doing multivariate analysis,
estimating different diversity indices, and analyzing time-series data. Some of the R
packages useful for analysis of agrobiodiversity data are:
▪ adiv42: Analysis of Diversity, with functions, data sets, and examples for the
calculation of various indices of biodiversity including species, functional and
phylogenetic diversity.
▪ agricolae43: Statistical Procedures for Agricultural Research, offers extensive
functionality on experimental design especially for agricultural and plant
breeding experiments and other statistical analysis.
▪ analogues44: To calculate the climatic similarity between a reference site and a
prescribed area, helps identifying locations with similar climates.
▪ BAT45: Biodiversity assessment tools, assess alpha and beta diversity in all their
dimensions (taxonomic, phylogenetic and functional).
▪ BiodiversityR46: For statistical analysis of biodiversity and ecological
communities.
▪ BioFTF47: To study biodiversity with the functional data analysis.
▪ BIO-R48: Biodiversity analysis using molecular data.
▪ GGEBiplotGUI49: A graphical user interface for the construction of, interaction
with, and manipulation of GGE biplots.
35 https://phylogeny.uconn.edu/software/
36 https://biology-assets.anu.edu.au/GenAlEx/Welcome.html
37 https://www.megasoftware.net/
38 https://evolution.genetics.washington.edu/phylip.html
39 https://sites.ualberta.ca/~fyeh/popgene.html
40 https://brcwebportal.cos.ncsu.edu/powermarker/
41 https://web.stanford.edu/group/pritchardlab/structure.html
42 https://cran.r-project.org/web/packages/adiv/index.html
43 https://cran.r-project.org/web/packages/agricolae/index.html
44 https://github.com/CIAT-DAPA/analogues
45 https://biodiversityresearch.org/software/
46 https://www.worldagroforestry.org/output/tree-diversity-analysis
47 https://cran.r-project.org/web/packages/BioFTF/index.html
48 https://data.cimmyt.org/dataset.xhtml?persistentId=hdl:11529/10820
49 https://cran.r-project.org/web/packages/GGEBiplotGUI/index.html
A very common data frame in agrobiodiversity study is a data matrix that contains
information about the properties, traits, characters, variables of several OAU
(individuals, samples, specimens and population). For example, data is a household data
matrix (household by several landraces within a species) and it is a count data set. The
first column is household name or number, and it may be a community, site, household,
species, agrobiodiversity component, agrobiodiversity group, or any other OAU. Other
columns are the number of landraces under different crop species, and it may be species,
cultivars, or any other variables. Data is generally prepared in MS Excel, and it is good
to cross-check and verify the data before importing it into the R environment. The useful
commands in Excel for data check are freezing or splitting panes, filter, sort, text to a
column, data validation, exploratory data analysis, scatter plot, etc.
RStudio is more user-friendly, and the following analysis and process are based
on RStudio. RStudio has four windows, script/editor window, data import/workspace
window, console/ command window, and file/plot/package window. Among many R
packages, vegan and BiodiversityR are more useful for estimating agrobiodiversity
indices (Kindt and Coe, 2005), and, therefore, methods including R script are described
below. To import data, the import dataset menu under environment is used. Here
example data file is hhdata. The followings are the R scripts to import, view data, and
converting imported data into a data frame.
library(readxl)#loading readxl package
hhdata <- read_excel("C:/Users/BK Joshi/Downloads/canada
training/ram/hhdata.xlsx")#importing data from given drive and saving this data
into hhdata
View(hhdata)#to see the data
hhdata<- as.data.frame (hhdata)#converting imported excel data into R data
frame
rownames(hhdata) <- hhdata[,1] #assigning row names from 1st column
hhdata[,1] <- NULL #removing the first column
hhdata #to display data contents
Followings are the R script for installation and estimating diversity indices using
R package, vegan
#install vegan package from a menu, Package then install in RStudio
S=apply(hhdata>0,1,sum)# estimate species richness (S) without loading vegan
50 https://www.rdocumentation.org/packages/stats/versions/3.6.2/topics/hclust
51 https://www.rdocumentation.org/packages/stats/versions/3.6.2/topics/prcomp
52 https://cran.r-project.org/web/packages/psych/index.html
53 https://cran.r-project.org/web/packages/rich/index.html
54 https://cran.r-project.org/web/packages/vegan/index.html
S # to display a richness
library(vegan) #loading vegan package
H=diversity(hhdata)#estimate Shannon diversity index
help(diversity)# look for description of function diversity
simp=diversity (hhdata, index="simpson") #estimate simpson index
J = diversity (hhdata, index =”simpson”)/log(S) #estimate Pielou’s evenness (J)
diversity(hhdata[-1], index="shannon")#exclude first column in case of data file
with first column as row name
barplot(simp) #plot simpson index
pairs(cbind(H, simp), pch="+", col="blue") #plot all
## Species richness (S) and Pielou's evenness (J):
S <- specnumber(hhdata) #estimate richness
cor(H,simp) #correlation coefficient between the Shannon and Simpson indices
A useful picture of diversity across several units is the function anosim() in the
package, vegan. This analysis ranks all the dissimilarities among accessions and
produces a boxplot of the ranks of dissimilarities within a given unit e.g., household. As
an example, iris data set within this package is given below.
data(iris) #loading data in R memory
distiris<-dist(iris[,1:4]) #distance matrix computed by using the specified
distance measure to compute the distances between the rows of a data matrix
anoiris<-anosim(distiris,iris$Species) #analysis of similarities (anosim)
provides a way to test statistically whether there is a significant difference
between two or more groups of sampling units.
plot(anoiris) #produces a boxplot of the ranks of dissimilarities within a given
unit.
Another useful R package is BiodiversityR, which is a graphical user interface for
statistical analysis of biodiversity and ecological communities, including species
accumulation curves, diversity indices, Renyi profiles, GLMs for analysis of species
abundance and presence-absence, distance matrices, Mantel tests, and cluster,
constrained and unconstrained ordination analysis. It is menu-driven built within Rcmdr
package. BiodiversityR analyzes two datasets simultaneously as does the vegan
community ecology package. These data sets are the community datasets (rows
correspond to sample units and columns correspond to species) and the environmental
datasets.
It is suggested to install the package in R following the guidelines55 as described
in the installation guide. The manual56 can also be accessed.
Followings are the commands and steps for analysis in BiodiversityR. An analysis
can be carried out either through menu driven or using commands:
library (BiodiversityR) #load BiodiversityR package
library (Rcmdr) #load Rcmdr package
BiodiversityRGUI() #open graphical interface
help("BiodiversityRGUI", help_type="html") #to see details.
These are the steps for doing analyses with the menu options of BiodiversitR. To
select the species and environmental matrices, follow these menu-driven steps:
BiodiversityR > Environmental Matrix > Select environmental matrix
Select the dune.env dataset as an example
Biodiversity > Community Matrix > Select community matrix
Select the dune dataset as an example.
55 https://www.worldagroforestry.org/sites/default/files/users/admin/Installation%20of%20BiodiversityR%202018.pdf
56 http://apps.worldagroforestry.org/downloads/Publications/PDFS/b13695.pdf
11.1 Interpretation
Richness (S) is a number of species, landraces, particular traits in household,
community, sites, or landrace. It quantifies types of the dataset. Shannon index (Shannon
diversity index or Shannon Weaver index, H’) includes both species number and
evenness, where a greater number of species increase diversity, as does a more equitable
distribution of individuals among species. High H’ is representative of a diverse and
equally distributed community. H’ is strongly influenced by species richness and by rare
species. Simpson index (D) is a measure of diversity, which takes into account both
richness and evenness. The value of D ranges from 0 to 1, the greater the value the
greater the diversity. The Simpson index gives more weight to evenness and common
species. Evenness (Pielou’s evenness, E) is a measure of the relative abundance of the
different species making up the richness of an area. A community dominated by one or
two species is considered to be less diverse than one in which several different species
have a similar abundance. Its value ranges from 0 to 1 and 1 is complete equitability.
12. Conclusion
13. Acknowledgments
14. References
Bajracharya, J., Rana, R., Joshi, B.K., Subedi, A. and Sthapit, B. (2012). Measuring on-
farm crop diversity. In: B.R. Sthapit, P. Shrestha and M.P. Upadhyay (eds.), On-
farm management of agrobiodiversity in Nepal: Good practices. Kathmandu:
NARC, LIBIRD and Bioversity International, pp. 9-12.
Bioversity International (2017). Agrobiodiversity Index. Program Oversight. A paper
presented in 49th Board Meeting of Bioversity International, Rome, Italy, 10 – 12
May 2017.
Borcard, D., Gillet, F. and Legendre, P. (2011). Numerical ecology with R. LLC:
Springer Science+Business Media.
Grum, M. and Atieno, F. (2007). Statistical analysis for plant genetic resources:
clustering and indices in R made simple. Handbooks for Genebanks, No. 9. Rome:
Bioversity International.
Gurung, R., Sthapit, S.R, Gauchan, D., Joshi, B.K. and Sthapit, B.R. (2016). Baseline
Survey Report: II. Ghanpokhara, Lamjung. Integrating Traditional Crop Genetic
Diversity into Technology: Using a Biodiversity Portfolio Approach to Buffer
against Unpredictable Environmental Change in the Nepal Himalayas. Pokhara:
LI-BIRD, NARC and Bioversity International. Available online at
https://cgspace.cgiar.org/handle/10568/81039 [Accessed on 11 August 2021]
Jarvis, D.I., Myer, L., Klemick, H., Guarino, L., Smale, M., Brown, A.H.D., Sadiki, M.,
Sthapit, B. and Hodgkin, T. (2000). A Training Guide for In Situ Conservation
On-farm. Version 1. Rome: International Plant Genetic Resources Institute.
Joshi, B.K. and Baniya, B.K. (2006). A diversity in qualitative traits of Nepalese
cultivated buckwheat species. Fagopyrum, 23:23-27.
Joshi, B.K. and Shrestha, B.B. (2017). Notes on plant and crop classification. In: B.K. Joshi,
H.B. KC and A.K. Acharya (eds.), Conservation and Utilization of Agricultural Plant
Genetic Resources in Nepal. Proceedings of 2nd National Workshop, 22-23 May 2017,
Dhulikhel: NAGRC, FDD, DoA and MoAD, pp. 17-20.
Joshi, B.K. and Shrestha, R. (eds.) (2019). Working Groups of Agricultural Plant
Genetic Resources (APGRs) in Nepal. Proceedings of National Workshop, 21-22
June 2018, Kathmandu: NAGRC, NARC.
Joshi, B.K., Ghimire, K.H., Gurung, R., Pudasaini, N., Pant, S., Paneru, P., Gauchan, D.,
Mishra, K.K. and Jarvis, D. (2020b). On-farm Agrobiodiversity Measurement and
Conservation. In: B.K. Joshi, D. Gauchan, B. Bhandari and D. Jarvis (eds.), Good
Practices for Agrobiodiversity Management. Kathmandu: NAGRC, LI-BIRD and
Alliance of Bioversity International and CIAT, pp. 15-24.
Joshi, B.K., Gorkhali, N.A., Pradhan, N., Ghimire, K.H., Gotame, T.P., KC, P., Mainali,
R.P., Karkee., A. and Paneru, R.B. (2020c). Agrobiodiversity and its Conservation in
Nepal. Journal of Nepal Agricultural Research Council 6: 14-33. DOI:
https://doi.org/10.3126/jnarc.v6i0.28111
Joshi, B.K., Gurung, S.B., Mahat, P.M., Bhandari, B. and Gauchan, D. (2018). Intra-
Varietal Diversity in Landrace and Modern Variety of Rice and Buckwheat. The
Journal of Agriculture and Development, 19: 1-8. Available online at
https://cgspace.cgiar.org/handle/10568/97576 [Accessed on 30 August 2021]
Joshi, B.K., Shrestha, P., Upadhyay, M.P., Chaudhary, B., Mudwari, A., Baniya, B.K.,
and KC, H.B., (2007). On-farm variation and household diversity of pigeon pea
landraces in Kachorwa, Nepal. Nepal Agric. Res. J., 8:28-34. Available online at
https://www.nepjol.info/index.php/NARJ/article/view/11567 [Accessed on 30
August 2021]
Joshi, B.K., Upadhyay, M.P., Bimb, H.P., Gauchan, D. and Baniya, B.K. (2005). Data
analysis methods adopted under in-situ global project in Nepal. Nepal Agric. Res. J.,
6: 99-109. Available online at
https://www.nepjol.info/index.php/NARJ/article/view/3371 [Accessed on 30 August
2021]
Kindt, R. and Coe, R. (2005). Tree diversity analysis. A manual and software for
common statistical methods for ecological and biodiversity studies. Nairobi:
World Agroforestry Centre (ICRAF).
PAR (Platform for Agrobiodiversity Research) (2018). Assessing Agrobiodiversity: A
Compendium of Methods. Platform for Agrobiodiversity Research, Rome.
Pudasaini, N., Sthapit, S.R., Gauchan, D., Bhandari, D., Joshi, B.K. and Sthapit, B.R.
(2016). Baseline Survey Report: I. Jungu, Dolakha. Integrating Traditional Crop
Genetic Diversity into Technology: Using a Biodiversity Portfolio Approach to
Buffer against Unpredictable Environmental Change in the Nepal Himalayas.
Pokhara: LI-BIRD, NARC and Bioversity International.
Sthapit, B.R., Gauchan, D., Joshi, B.K. and Chaudhary, P. (2017). Agrobiodiversity
index to measure agricultural biodiversity for effectively managing it. In: B.K.
Joshi, H.B. KC and A.K. Acharya (eds.), Conservation and Utilization of
Agriculture Plant Genetic Resources in Nepal. Proceed. 2nd National Workshop,
May 22-23, 2017, Dhulikhel: NAGRC/FDD/MoAD.
Funding
No funding was available for the research conducted for and writing of this paper.
(Optional) PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses)
Has author complied with PRISMA standards? Yes
Open Access. This article is licensed under a Creative Commons Attribution 4.0
International License, which permits use, sharing, adaptation, distribution and
reproduction in any medium or format, as long as you give appropriate credit to the
original author(s) and the source, provide a link to the Creative Commons license, and
indicate if changes were made. The images or other third-party material in this article
are included in the article's Creative Commons license, unless indicated otherwise in a
credit line to the material. If material is not included in the article's Creative Commons
license and your intended use is not permitted by statutory regulation or exceeds the
permitted use, you will need to obtain permission directly from the copyright holder. To
view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.