TY - JOUR
T1 - Objective Definition of Rosette Shape Variation Using a Combined Computer Vision and Data Mining Approach
AU - Camargo-Rodriguez, Anyela
AU - Papadopoulou, Dimitra
AU - Spyropoulou, Zoi
AU - Vlachonasios, Konstantinos
AU - Doonan, John H.
AU - Gay, Alan P.
A2 - Candela, Hector
N1 - Funding Information:
We are grateful to Paula Kover for supplying the accessions and to the European Commission (Erasmus program) for supporting PD and ZS. We are also grateful to Drs John Lane and Ruth Sanderson for their advice on the statistical analysis of the data.
PY - 2014/5/7
Y1 - 2014/5/7
N2 - Computer-vision based measurements of phenotypic variation have implications for crop improvement and food security because they are intrinsically objective. It should be possible therefore to use such approaches to select robust genotypes. However, plants are morphologically complex and identification of meaningful traits from automatically acquired image data is not straightforward. Bespoke algorithms can be designed to capture and/or quantitate specific features but this approach is inflexible and is not generally applicable to a wide range of traits. In this paper, we have used industry-standard computer vision techniques to extract a wide range of features from images of genetically diverse Arabidopsis rosettes growing under non-stimulated conditions, and then used statistical analysis to identify those features that provide good discrimination between ecotypes. This analysis indicates that almost all the observed shape variation can be described by 5 principal components. We describe an easily implemented pipeline including image segmentation, feature extraction and statistical analysis. This pipeline provides a cost-effective and inherently scalable method to parameterise and analyse variation in rosette shape. The acquisition of images does not require any specialised equipment and the computer routines for image processing and data analysis have been implemented using open source software. Source code for data analysis is written using the R package. The equations to calculate image descriptors have been also provided.
AB - Computer-vision based measurements of phenotypic variation have implications for crop improvement and food security because they are intrinsically objective. It should be possible therefore to use such approaches to select robust genotypes. However, plants are morphologically complex and identification of meaningful traits from automatically acquired image data is not straightforward. Bespoke algorithms can be designed to capture and/or quantitate specific features but this approach is inflexible and is not generally applicable to a wide range of traits. In this paper, we have used industry-standard computer vision techniques to extract a wide range of features from images of genetically diverse Arabidopsis rosettes growing under non-stimulated conditions, and then used statistical analysis to identify those features that provide good discrimination between ecotypes. This analysis indicates that almost all the observed shape variation can be described by 5 principal components. We describe an easily implemented pipeline including image segmentation, feature extraction and statistical analysis. This pipeline provides a cost-effective and inherently scalable method to parameterise and analyse variation in rosette shape. The acquisition of images does not require any specialised equipment and the computer routines for image processing and data analysis have been implemented using open source software. Source code for data analysis is written using the R package. The equations to calculate image descriptors have been also provided.
KW - Algorithms
KW - Arabidopsis/genetics
KW - Data Mining
KW - Ecotype
KW - Image Processing, Computer-Assisted
KW - Plant Leaves/genetics
KW - Software
UR - http://www.scopus.com/inward/record.url?scp=84900525783&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0096889
DO - 10.1371/journal.pone.0096889
M3 - Article
C2 - 24804972
SN - 1932-6203
VL - 9
SP - e96889
JO - PLoS One
JF - PLoS One
IS - 5
M1 - e96889
ER -