Conference PaperPDF Available

Estimation of band level resolutions of human chromosome images

Authors:

Abstract and Figures

Current estimation methods for band level resolutions of human chromosome images in cytogenetic laboratories are time consuming and required experienced specialists to manually perform. To alleviate this problem, in this paper, a computerized approach to estimate band level resolution is proposed. The intensity gradient profile and sign profile of chromosome images are utilized to count the number of bands. Then band level resolutions of chromosome images are classified into three categories: 400-, 500-, and 550-band levels by using k-nearest neighbor algorithm. The experimental results show the accuracy of the proposed algorithm. We also provide a discussion on how to improve the overall accuracy.
Content may be subject to copyright.
Estimation of Band Level Resolutions
of Human Chromosome Images
W. Sethakulvichaia, S. Manitpornsutb, M. Wiboonratc, W. Lilakiatsakund, A. Assawamakine, S. Tongsimaf
IT Graduate School, Mahanakorn University of Technology, Bangkok, Thailanda, d
Department of Computer and Multimedia Engineering, University of the Thai Chamber of Commerce, Bangkok, Thailandb
College of Graduate Study in Management, Khon Kaen University, Thailandc
National Center for Genetic Engineering and Biotechnology (BIOTEC), Klong Luang, Pathumthani, Thailande, f
wisan@sriracha.coma, suparerk_man@utcc.ac.thb, montwi@kku.ac.thc, woraphon@mut.ac.thd,
anunchaiice@gmail.come, sissades@biotec.or.thf
Abstract—Current estimation methods for band level resolutions
of human chromosome images in cytogenetic laboratories are
time consuming and required experienced specialists to manually
perform. To alleviate this problem, in this paper, a computerized
approach to estimate band level resolution is proposed. The
intensity gradient profile and sign profile of chromosome images
are utilized to count the number of bands. Then band level
resolutions of chromosome images are classified into three
categories: 400-, 500-, and 550-band levels by using k-nearest
neighbor algorithm. The experimental results show the accuracy
of the proposed algorithm. We also provide a discussion on how
to improve the overall accuracy.
Keywords; image processing, band level resolution, feature
extraction, medial axis determination, k-nearest neighbor.
I. INTRODUCTION
Automated systems are now widely integrated to almost
everywhere surrounding our everyday life. Most of the systems
are aiming to reduce a time consumption of repeated and
tedious tasks. Chromosome analysis is one of those laborious
and time-intensive activities, routinely conducted in
cytogenetic laboratories [1] – [5]. The evolution of computer
graphic in the 21st century has changed the way of human
chromosome analysis. Integrating computerized system such as
image processing into this area has been an open research topic
since 1960s [2]. However, there are plenty of aspects for
research in this area required for improvement, e.g. information
from the chromosome images, processing speed, accuracy rate,
reliability, easy to adaption, and competitive cost.
The band level of resolution of each human chromosome
image is one of the additional information that laboratories
need to specify in karyotype report. Actually this information
represents the total number of bands on chromosomes
approximately half of the cell [6]. Most laboratories determine
this number by identifying the specific landmarks on the
specific chromosomes. From many literature reviews,
automatic estimation system for band level resolution of human
chromosome image has not been mentioned much in this
research field. Furthermore, it is expected that knowing the
band level resolution before the classification process will help
managing the proper dataset for the chromosome classification
system.
The objective of this study is to demonstrate the method for
estimating the band level resolution of human chromosome
image automatically. This paper is organized as following.
Section II provides a background of the cell cycle, the
karyotype analysis and the important of the levels of resolution
of the chromosome images. The source of dataset and the
proposed algorithm are explained in Section III.
Experimentation and results are revealed in Section IV. Finally,
the explanation of results and discussion are summarized in
Section V.
II. BACKGROUND
Cytogenetic laboratories are one of those specialized
laboratories where many special tests for study micro-
biological behavior of all living things are usually conducted
[7]. Karyotype analysis is a special activity, consisting of
several steps, i.e. collecting cell samples, culturing cell
samples, staining chromosomes, acquiring image of
chromosomes, segmenting image into individual chromosomes,
classifying chromosomes into their classes, rearranging and
representing into karyogram and finally analyzing the
karyogram [8].
The results of the karyotype analysis give valuable
information in many aspects such as, for parental screening, sex
determination, identification of abnormality in chromosomal
diseases such as Down’s syndrome, Klinefelter syndromes,
malignant tumor, cancer, leukemia, ect [7].
G-banding is the technique that cytogenetic laboratories use
to stain the specific content of chromosomes so that they
express their distinctiveness pattern on their chromosome
images [1]. This information is the main information that
cytogenetic staff use to classify the chromosome into their
correct classes. Although, there are many more others staining
techniques available for karyotype analysis in this present, and
some of them give more information of individual
chromosomes, g-banding technique is the most widely used for
the karyotype analysis due to its low cost and short preparation
duration. In addition, information achieved after the analysis is
adequate for medial practitioner to make decisions. Therefore,
this work will be focused on the images of the g-banding
chromosome only.
276
978-1-4673-1921-8/12/$31.00 ©2012 IEEE
2012 Ninth International Joint Conference on Computer Science and Software Engineering (JCSSE)
During the cell culturing process, cells go through their
cells cycle processes as shown in Figure 1 [8]. Chromosomes
in each cells, during mitosis process, has different stages of
condensation [9]. Most of currently automatic karyotype
analysis research is focusing on the images of metaphase
chromosomes because the chromosome at this stage is in the
most condensed form. However, in reality, cytogenetic need to
conduct further examination on the pro-metaphase or even
prophase chromosomes in many cases [10] – [11]. At the
earlier stages of mitosis, the structures of chromosomes are less
condensed and the overall length of all chromosomes in one
cell is longer than in metaphase cell. Furthermore, the total
numbers of bands along the length of all chromosomes are also
different. The more number of bands the earlier the stage of
mitosis.
The level of resolution of human chromosome image is an
important number to indicate the stages of chromosomes being
analyzed. It is normally determined by counting the number of
bands seen in a haploid set (22 autosomes + X and Y) [6].
However, this is really difficult to perform in reality. Most
laboratories currently evaluate the level of resolution by
identifying the land marks on the chromosome images and
scoring the karyotype image by using the scoring table as
shown in Table I. At least three of the land marks criteria to be
obtained to indicate the band levels resolution [10]. There is
also a requirement to specify the number of band resolution in
the karyotype report to confirm that the analysis is appropriate
for the referral reason. Table II shows lowest standard
acceptable for a given reason for referral in constitutional
TABLE I G -BANDING RESOLUTION EVALUATION SCORE [10].
Band
resolution
Land marks Criteria
300 band 2 dark bands on 8p (8p12 & 8p22)
3 dark bands on 10q (10q21, 10q23, 10q25)
20p12 visible
22q12 distinct
400 band 3 dark bands on mid-4q (q22-28)
3 dark bands mid-5q (5q14, 5q21, 5q23)
2 dark bands on 9p (9p21 & 9p23)
13q33 distinct
500 band 7q33 & 7q35 distinct
3 dark bands on 11p (11p12, 11p14, 11p15.4)
14q32.2 distinct
4 dark bands on 18q (18q12.1, 18q12.3, 18q21.2,
18q22)
550 band 5q31.2 distinct
8p21.2 visible
2 dark bands on 11pter (11p15.2 & 11p15.4)
22q13.2 distinct
700 band 2p25.2 distinct
2q37.2 distinct
10q21.1 and 10q21.3 resolve
17q22-q24 resolves into 3 dark bands
TALBE II MINIMUM G -BANDING RESOLUTION FOR REGERRAL REASON
[10].
Reason for referral Minimum level of
resolution of G-
banding images
quality
Confirmation of aneuploidy e.g. direct
lymphocyte, direct CV or solid tissue culture
preparation
< 300 band
Exclusion of known large structural
rearrangements e.g. lymphocyte, solid tissue, CVS
direct preparation or amniotic fluid cell
preparation
300 band
Identification and exclusion of small expected
structural rearrangements e.g. lymphocyte, solid
tissue, CVS culture or amniotic fluid preparation
400 band
Routine amniotic fluid and CV culture
preparations
400 band
Abnormal ultrasound scan associated with AF, CV
and solid tissue referrals
500 band
Blood referrals, not covered by exclusion criteria 550 band
For microdeletion syndroms (when no FISH probe
is available)
700 band
Figure 1 Cell cycle and stages in cell division
Figure 2 Ideograms of G-banding patterns for normal human chromosome
number 1 at five different levels of resolution, approximately 300-,400-,
550-, 700-, and 850-band levels [6].
277
analysis as recommended by the Association for Clinical
Cytogenetics [10].
The standard ideograms of chromosomes class 1 shown in
Figure 2 provide schematic representations of chromosomes
corresponding to approximately 300, 400, 550, 700 and 850
bands [6]. It is clearly noticeable that different resolution
yielding different banding or intensity profile.
Apart from this, different levels resolution of G-banding
images acquired from different staining process creates a
tedious problem to most laboratories. Sometime, a slightly
different of period when capturing the metaphase moment
could cause a different in level resolution of banding patterns.
Figure 3 shows the different banding levels of chromosome
number 1, acquired at the different moments during the cell
division process. The longer chromosomes give many more
bands than the shorter one [11].
Since the shape and the number of bands of the same class
chromosomes are varied according to their relative mitosis
stages, classification of chromosomes at the level resolution
different from the resolution of trained database is prone to a
huge number of error rates. Estimation the band levels
resolution of chromosomes prior to the classification stage is a
promising approach to enhance the accuracy of the
classification when incorporate this module into the system to
help selecting the proper band level database to the analyzing
subject. In the next section, automatic band level resolution
estimation algorithm is described in detail.
III. PROPOSED METHODOLOGY
Figure 4 shows the proposed automatic system for band
level resolution estimation of human chromosome images.
There are several steps as described below.
A. Image Enhancement
The first step is to eliminate noises, enhance image
contrast and adjust the brightness of the images by using the
following methods:
Median filtering – to get rid of pepper noises.
Contrast enhancement – to adjust a low-contrast gray-
scale image.
Figure 4 Diagram shows the proposed automatic chromosomes band counting. system.
Figure 3 G-banded chromosomes of the class no.1 arranged in increasing
order of resolution from approximately the 400- to the 700-band levels
G-bands per haploid karyotype
400 450 500 550 600 650 700
Metaphase Prometaphase Prophase
278
Intensity enhancement – to compensate non-uniform
brightness and contrast behavior of the images by using
histogram equalization.
Segmentation – to convert the image to binary by using
Otsu’s thresholding algorithm. Then, closing, opening
and filling image operations are performed to mask out
the individual chromosomes region and smooth the
image boundary.
B. Medial axis determination
A simple thinning algorithm is applied here to determine an
axis of an individual chromosome, as shown in Figure 6(a).
However, the length of thinning line is shorter than that of the
chromosome, thus requiring line extension. Bresenham’s line
algorithm is applied to extend the tip points of the thinning
lines to tip point of the chromosome, as illustrated in Figure
6(b). Finally, as displayed in Figure 6(c), the medial axis of
individual chromosome image is formed.
C. Intensity profile extraction
Intensity profile is extracted from the individual
chromosome image on the perpendicular lines of the medial
axis. In Figure 7(a), the perpendicular lines So are generated
from the local slopes of the medial axis. However these
perpendicular lines might overlap and therefore result in
incorrect intensity profile. The orientations of the perpendicular
lines So are rearranged to eliminate the overlapping lines as
shown in Figure 7(b).
Let is the 1 vector of rearranged slopes. The So is
rearranged by using a moving mean value as:
    1  
·1,,·1   2,3,4, …,   1
(1)
The and are the smoothing weights calculated from
the difference between the current slope and previous slope and
the current slope and the next slope respectively.
The sampling is done by collecting all the intensity value on
each perpendicular line and calculating the average value along
this line except three pixels that closed to the medial axis. This
is because most of the chromosome images appeared to be
“white” in the middle of the chromosome body along its medial
axis as noticeable in Figure 7 (b). The original intensity profile
as shown Figure 8 (a) is then smoothed by using a moving
average filter. Let is the original intensity profile and is the
smoothed intensity profile.
The span of the moving average of this work is set to 5
(experimentally selected). Transformation from the original to
the smoothed profile can be expressed as;
     1
1,,1   2
2,1,,1,2   3,4,5, . .,   2
1,,1      1
     
(2)
D. Intensity gradient determination
The intensity gradient profile is the rate of changes in
intensity value along the medial axis line defined as:

 (3)
(a) (b)
Figure 5 Preprocessing process; (a) original chromosome image,
(b) binary image.
(a) (b) (c)
Figure 6 Finding medial axis; (a) Thinning, (b) Extending tips, (c) Medial
axis.
(a) (b)
Figure 7 Finding the perpendicular lines along the medial axis;
(a) perpendicular lines from the local slopes, (b) rearranged perpendicular
lines.
279
where, is a vector returns the one-dimensional numerical
gradient of .  corresponds to 
, the differences in x
(chromosome medial axis) direction. Figure 8(c) shows a
sample of intensity gradient profile derived from the intensity
profile.
E. Band counting
The bands are counted by detecting the changes in direction
of the gradient intensity profile. The band is divided into 2
types; white bands and dark bands. The rate of change in
intensity between the edge of white and dark band should be
high. The algorithm is set to count the changes in sign (+/-) of
the gradient profile along the medial axis. The gradient profile
is converted to the sign profile as
 1,   0
0,   0
1,   0 (4)
where, is the sign profile calculated from a gradient profile. Figure
8(d) shows the sample of sign profile.
F. K-Nearest Neighbors
The k-NN is one of the supervised data mining techniques
for classifying data into classes. The method is based on the
majority of the k (k = number of neighbors) nearest distances
between the features of the training dataset to the features of
the testing data [13]. Four features which are number of bands,
length, width and aspect ratio are used in the proposed system.
The number of neighbors was set to vary from k = 1 to 15. The
features of test dataset are then compared to the nearest trained
features.
IV. EXPERIMENTS AND RESULTS
The dataset of chromosome images are from the Center for
Medical Genetics Research, Rajanukul institute, Thailand. In
this research, 210 karyogram images are collected according to
the availability of information of the band level resolution. All
chromosomes were stained using G-banding technique. These
chromosomes were cultured, stained, captured by cytogenetic
specialists. Karyograms were also prepared by using the current
analysis system in the laboratory. However, image intensity
and scaling are very diverse. The band level resolution of each
images were identified by experienced staff. Three band levels
resolutions are investigated, i.e. 400, 500 and 550 band levels.
TABLE III SUMMARY OF THE DATASETS
DESCRIPTION BAND RESOLUTION TOTAL
400 500 550 cell %
Training 56 56 56 168
80
Testing 14 14 14 42 20
Total 70 70 70 210 100
Seventy metaphase cell images for each band level
resolution are employed in experiment, or 210 images in total.
Each metaphase cell image consists of 46 individual
chromosomes. These samples are divided into two sets, training
and testing dataset as shown in table III. All data in each band
level resolution are randomly selected and divided into 80% for
training and 20% for testing, respectively. The experimentation
is divided into 2 sections, band counting and band estimation.
A. Band counting
The test is conducted to count number of bands of
individual chromosomes from the training dataset of 400, 500
and 550 band level resolutions, respectively. The output of each
run contains the following features.
1) The computed bands:
This feature is computed by the technique as described in
the previous section.
2) The Aspect ratio (AR):
This feature is defined by the ratio of length and the width
of each chromosome.
Figure 8 Output profiles from the band estimation algorithm, (a) intensity and smoothed intensity profiles, (b) original input, (c) gradient intensity profile, and
(d) sign profile.
(a)
(b)
(c) (d)
280
  
 (5)
This feature indicates the morphological shape of the
chromosome. The high value of AR means long and/or narrow
chromosome, thus implying the high value of band level
resolution. On the other hand, the low AR implies the low value
of band level resolution.
The results are plotted as shown in figure 9, which is the
plot of counted number of band vs. the aspect ratio of all
chromosomes from each band level datasets. This graph shows
the three clusters of these dataset in a 2D feature space spanned
by the relation between numbers of bands counted from the
proposed algorithm and theirs relative aspect ratios. The border
areas of these clusters are distributed dispersedly but the center
of populated areas of each cluster can be observed.
B. Estimation of Band Level Resoultion
The band level resolution is estimated by implementing k-
nearest neighbor (k-NN) classification model. The features are
number of bands, length, width and aspect ratio, respectively.
The number of neighbors was set to vary from k = 1 to 15. For
simple image representation, Figure 10 displays the
relationship of the target node and the neighbor nodes in the 2D
feature space of the Count Number and the Aspect Ratio
features.
Figure 11 shows the accuracy percentage of the
classification results of each band level and the overall test set.
The maximum accuracy percentage is 72%, 93%, and 79% for
classification of 400-, 500-, and 550-band level resolutions.
The overall accuracy is 76% at k = 3, 9 and 11, respectively.
V. DISCUSSION AND FUTURE WORK
This work has shown the method for extracting features
from individual chromosomes, i.e. intensity profile, gradient
intensity profile, sign profile and aspect ratio. By using k-NN
technique, the classification of human chromosome band level
resolution can be identified with the maximum accuracy of
76%. The main issues that affect the accuracy of the system are
the chromosome image quality and the accuracy of band level
resolution from experts.
The actual chromosomes in the sample glass slides are in
three dimensions. When they are captured by a camera, they
are projected to two-dimensional images. Some of them are
overlapped or crossed by others. The more of chromosomes are
overlapped, the lower quality of the chromosome images is.
The low quality of chromosome image affects the sign profile
and therefore decreasing the classification accuracy of the
proposed system.
Another effect on the system accuracy is the classification
results from experts. Experts do not count the actual number of
bands in the chromosome image since actual counting takes too
much time for human. Instead, they notice the landmark of
some chromosomes to identify the band level resolution. This
is an estimation approach. Unfortunately, the chromosome
images have wide variation in terms of landmarks, length,
width, and band intensity.
To account for the aforementioned issues, our future work
is considering more features from the chromosome images, e.g.
the total number of bands in one cell (instead of individual
chromosome), the normalized band intensities, scaling effect
and compensation of band counting in the overlapped areas.
Other classification techniques are also under investigation. In
addition, the interactive cross validation with the experts will
be conducted to reduce human errors.
Figure 9 Aspect Ratio VS number of bands counted from chromosome
number 1 from 400-,500-, and 550- band levels.
Target node
5 neighbors
Figure 10 The 5 neighbor nodes that have distances of features closet to the
target node.
Figure 11 The classification results when vary the k number from 1 to 15
281
ACKNOWLEDGMENT
Special thanks to Dr. Verayuth Praphanphoj, M.D., Center
for Medical Genetics Research, Rajanukul Insititute, Thailand
and all his staff for supporting data and information.
REFERENCES
[1] X. Wang, B. Zheng, S. Li, J.J. Mulvihill, and H. Liu, “Automated
classification of metaphase chromosomes: optimization of an adaptive
computerized scheme,” Journal of Biomedical Informatics, vol. 42, pp.
22-31, 2009.
[2] F.C.A. Groen, T. K. Kate, A.W.M. Smeulders and I.T. Young, “Human
chromosome classification based on local band descriptors”, Pattern
Recognition Letters, vol. 9, pp. 211-222, April 1989.
[3] B. Legrand, C.S. Chang, S.H. Ong, S.Y. Neo and N. Palanisamy,
“Chromosome classification using dynamic time warping”, Pattern
Recognition Letters, vol. 29, pp. 215-222, 2008.
[4] J.H. Kao, J.H. Chuang and T. Wang, “Chromosome classification based
on the band profile similarity along approximate medial axis”, Pattern
Recognition, vol. 41, pp. 77-89, 2008.
[5] J. Piper and E. Granum, “On Fully Automatic Feature Measurement For
Banded Chromosome Classification”, Cytometry, vol. 10, pp. 242-255,
1989.
[6] ISCN2009: An internationalsystem for human cytogenetics
nomenclature in D.G. Harnden, H.P. linger (Eds.), Cytogenetics and Cell
Genetics, Karger, Basel, 2009
[7] H.C. Macgregor, An Introduction to Animal Cytogenetics, Chapman &
Hall, 1993.
[8] C. Starr and B. McMillan, Human Biology, 5th ed., Books/Cole,
Thomson Learning, 2003.
[9] G. Karp, Cell Biology, International edition, 2nd ed., McGraw-Hill
International Book Co., 1984.
[10] Association for Clinical Cytogenetics, Professional Guidelines for
Clinical Cytogenetics; General Best Practice Guidelines (2007) v1, 04,
March. 2007.
[11] A. Carothers and J. Piper, “Computer-aided classification of human
chromosome: a review”, Statistics and Computing, vol. 4, pp. 161-171,
1994.
[12] B. Lerner, H. Guterman, I. Dinstein and Y. Romem, “Medial axis
transform-based features and neural network for human chromosome
classification”, Pattern Recognition, vol. 28, no. 11, pp. 1673-1683,
1995.
[13] M.S. Serbanescu, “A k-nearest neighbor approach for chromosome
shape classification”, Annals of the University of Craiova, Mathematics
and Computer Science Series, vol. 37(3), pp. 142-146, 2010.
282
... Nowadays, several efforts have been done to create automatic systems for dealing with computer-based karyotyping [2,[6][7][8][9][10][11][12][13]. Several studies implement techniques from machine learning such as Support Vector Machines [7,8,[14][15][16], Nearest Neighbor Algorithms [17,18], Wavelets [19], Bayesian techniques [20,21] and mainly, Artificial Neural Networks [19,[22][23][24][25]. Nevertheless, the automatic computer image-based karyotyping is an open research topic. One of the main problems arises when the image contains overlapping or touching chromosomes, because each chromosome must be cut individually to present the karyogram. ...
... It is important to define the optimal chromosome features to obtain a good accuracy. In the literature, shape description, length, centromere position, and the banding pattern have been used as chromosomes descriptors [12,18,30,31,[34][35][36]. In Reference [8,33], the authors annotate that these characteristics are useful to determine if the chromosome was correctly segmented, but they can not be used as the only descriptors of the chromosome. ...
... Through the analysis of the medial axis it is possible to obtain the length of the chromosome, and some other features, as the gray level intensities along this axis, and its corresponding band profile [18,34]. In Reference [8], a feature vector is constructed using the chromosome length, the gray intensity levels along 98 transversal lines traced over the medial axis, and the chromosome area, computed by counting the active pixels of the binary chromosome image. ...
Article
Full-text available
This work proposes the evaluation of a set of algorithms of machine learning and the selection of the most appropriate one for the classification of segmented chromosomes images acquired using the Giemsa staining technique (G-banding). The evaluation and selection of the best classification algorithms was carried out over a dataset of 119 Q-banding chromosomes images, and the obtained results were then applied to a dataset of 24 G-band chromosomes images, manually classified by an expert of the Laboratory of Cytogenetic of the Children’s Hospital of Tamaulipas. The results of evaluation of 51 classifiers yielded that the best classification accuracy for the selected features was obtained by a backpropagation neural network. One of the main contributions of this study is the proposal of a two-stage classification scheme based on the best classifier found by the initial evaluation. In stage 1, chromosome images are classified into three major groups. In stage 2, the output of phase 1 is used as the input of a multiclass classifier. Using this scheme, 82% of the IGB bank samples and 88% of the samples of a bank of images obtained with a Q-band available in the literature consisting of 119 chromosome studies were successfully classified. The proposed work is a part of an desktop application that allows cytogeneticist to automatically generate cytogenetic reports.
... Depending on the stage of metaphase arrest, the degree of chromosome condensation can differ 13,14 . ...
... Prometaphase cells have longer chromosomes, are less rigid, exhibit greater overlap and less well-defined centromere constrictions, all of which pose a significant challenges for automated chromosome classifiers 14,15 . Metaphase images with longer, thinner chromosomes (roughly corresponding to >500band level 14 ) were also excluded. ...
Preprint
Full-text available
Software to automate digital pathology relies on image quality and the rates of false positive and negative objects in these images. Cytogenetic biodosimetry detects dicentric chromosomes (DCs) that arise from exposure to ionizing radiation, and determines radiation dose received from the frequency of DCs. We present image segmentation methods to rank high quality cytogenetic images and eliminate suboptimal metaphase cell data based on novel quality measures. Improvements in DC recognition increase the accuracy of dose estimates, by reducing false positive (FP) DC detection. A set of chromosome morphology segmentation methods selectively filtered out false DCs, arising primarily from extended prometaphase chromosomes, sister chromatid separation and chromosome fragmentation. This reduced FPs by 55% and was highly specific to the abnormal structures (≥97.7%). Additional procedures were then developed to fully automate image review, resulting in 6 image-level filters that, when combined, selectively remove images with consistently unparsable or incorrectly segmented chromosome morphologies. Overall, these filters can eliminate half of the FPs detected by manual image review. Optimal image selection and FP DCs are minimized by combining multiple feature based segmentation filters and a novel image sorting procedure based on the known distribution of chromosome lengths. Applying the same segmentation filtering procedures to both calibration and test sample image data reduced the average dose estimation error from 0.4Gy to <0.2Gy, obviating the need to first manually review these images. This reliable and scalable solution enables batch processing multiple samples of unknown dose, and meets current requirements for triage radiation biodosimetry of high quality metaphase cell preparations.
... In chromosomes treated with 0 mM Mg 2+ and 1 mM EDTA, new bands were formed as compared to those treated with 5 mM Mg 2+ . The qualitative evaluation of the band level was done by identifying the landmark of the chromosome that can be seen, later it was converted to the estimation of quantitative scale classification based on the assessment criteria (Sethakulvichai et al., 2012). The identification of the optimum band number in chromosomes treated with 1 mM EDTA was relatively more difficult due to the poor banding. ...
Article
Magnesium ion (Mg2+ ) plays a fundamental role in chromosome condensation which is important for genetic material segregation. Studies about the effects of Mg2+ on the overall chromosome structure have been reported. Nevertheless, its effects on the distribution of heterochromatin and euchromatin region have yet to be investigated. The aim of this study was to evaluate the effects of Mg2+ on the banding pattern and ultrastructure of the chromosome. Chromosome analysis was performed using the synchronized HeLa cells. The effect of Mg2+ was evaluated by subjecting the chromosomes to three different solutions, namely XBE5 (containing 5 mM Mg2+ ) as a control, XBE (0 mM Mg2+ ), and 1 mM EDTA as cations-chelator. Chromosome banding was carried out using the GTL-banding technique. The ultrastructure of the chromosomes treated with and without Mg2+ was further obtained using SEM. The results showed a condensed chromosome structure with a clear banding pattern when the chromosomes were treated with a buffer containing 5 mM Mg2+ . In contrast, chromosomes treated with a buffer containing no Mg2+ and those treated with a cations-chelator showed an expanded and fibrous structure with the lower intensity of the banding pattern. Elongation of the chromosome caused by decondensation resulted in the band splitting. The different ultrastructure of the chromosomes treated with and without Mg2+ was obvious under SEM. The results of this study further emphasized the role of Mg2+ on chromosome structure and gave insights into Mg2+ effects on the banding distribution and ultrastructure of the chromosome.
... In chromosomes treated with 0 mM Mg 2+ and 1 mM EDTA, new bands were formed as compared to those treated with 5 mM Mg 2+ . The qualitative evaluation of the band level was done by identifying the landmark of the chromosome that can be seen, later it was converted to the estimation of quantitative scale classi cation based on the assessement criteria 29 . The identi cation of the optimum band number in chromosomes treated with 1 mM EDTA was relatively more di cult due to the poor banding. ...
Preprint
Full-text available
Magnesium ion (Mg ²⁺ ) plays a fundamental role in the chromosome condensation which is important for genetic material segregation. Studies about the effects of Mg ²⁺ on the overall chromosome structure have been reported. Nevertheless, its effects on the distribution of heterochromatin and euchromatin region have yet to be investigated. This study was aimed to evaluate the effects of Mg ²⁺ on the banding pattern of the chromosome structure. Chromosome analysis was performed using the GTL-banding technique on synchronized HeLa cells. The effect of Mg ²⁺ was evaluated by subjecting the chromosomes to three different solutions, namely XBE5 (5 mM Mg ²⁺ ) as a control, XBE (0 mM Mg ²⁺ ), and 1 mM EDTA as cations chelator. The results showed a condensed chromosome structure with a clear banding pattern when it was treated with a buffer containing 5 mM Mg ²⁺ . In contrast, chromosomes treated with a buffer containing no Mg ²⁺ and those treated with an ions chelator showed an expanded and fibrous structure with the lower intensity of the banding pattern. Elongation of the chromosome caused by decondensation resulted in the band splitting. The results of this study further emphasized the role of Mg ²⁺ on chromosome structure and gave insights into Mg ²⁺ effects on the banding distribution.
Preprint
Genetic abnormalities constitute a considerable share of all the existing societal healthcare issues. There has been a dire need for the automation of chromosomal analysis, hence supporting laboratory workers in effective classification and identifying such abnormalities. Nevertheless, many modern image processing techniques, like karyotyping, have improved the life expectancy and quality of life of such cases. The standard image-based analysis procedures include pre-processing, segmentation, feature extraction, and classification of images. When explicitly considering karyotyping, the processes of segmentation and classification of chromosomes have been the most complex, with much existing literature focusing on the same. Various model-based machine learning models have proven to be highly effective in solving existing issues and building an artificial intelligence-based, autonomous-centric karyotyping system. An autonomous karyotyping system will connect the pre-processing, segmentation, and classification of metaphase images. The review focuses on machine-learning-based algorithms for efficient classification accuracy. The study has the sole motive of moving towards an effective classification method for karyotype metaphase images, which will eventually predict the fetus's abnormalities more effectively. The study's results will benefit future researchers working in this area.
Article
Full-text available
Accurate digital image analysis of abnormal microscopic structures relies on high quality images and on minimizing the rates of false positive (FP) and negative objects in images. Cytogenetic biodosimetry detects dicentric chromosomes (DCs) that arise from exposure to ionizing radiation, and determines radiation dose received based on DC frequency. Improvements in automated DC recognition increase the accuracy of dose estimates by reclassifying FP DCs as monocentric chromosomes or chromosome fragments. We also present image segmentation methods to rank high quality digital metaphase images and eliminate suboptimal metaphase cells. A set of chromosome morphology segmentation methods selectively filtered out FP DCs arising primarily from sister chromatid separation, chromosome fragmentation, and cellular debris. This reduced FPs by an average of 55% and was highly specific to these abnormal structures (≥97.7%) in three samples. Additional filters selectively removed images with incomplete, highly overlapped, or missing metaphase cells, or with poor overall chromosome morphologies that increased FP rates. Image selection is optimized and FP DCs are minimized by combining multiple feature based segmentation filters and a novel image sorting procedure based on the known distribution of chromosome lengths. Applying the same image segmentation filtering procedures to both calibration and test samples reduced the average dose estimation error from 0.4 Gy to <0.2 Gy, obviating the need to first manually review these images. This reliable and scalable solution enables batch processing for multiple samples of unknown dose, and meets current requirements for triage radiation biodosimetry of high quality metaphase cell preparations.
Thesis
Full-text available
By clinical evaluation of a patient they can be detected characteristics indicating a possible structural chromosome aberration. The present study is an evaluation or reevaluation of a group of patients with these characteristics, using the technique of high resolution and FISH. We proceeded to implement a technique of high-resolution karyotype, achieving a resolution chromosomal bands 550- 850. The FISH technique was used according to the availability of probes in the laboratory. 54 individuals identifying critical rearrangements in 21 of them were evaluated. It was possible to establish the genotype-phenotype correlation taking into account the variable expressivity of each syndrome and magnitude of each rearrangement. 94% of patients showed a correlation between the cytogenetic finding and the observed clinical features. 14 families was characterized by establishing four rearrangements transmitted from one generation to another. We establish the utility score De Vries in identifying possible cases cryptic rearrangements, and the need for the study patients with a value equal to or greater than 5. The use of microarrays techniques is recommended for better diagnostic accuracy.
Article
Full-text available
In this paper several new techniques for automated chromosome analysis are described: one for piecewise-linear chromosome stretching and projection, two for accurately localizing the centromere and one for two-dimensional local band pattern description. A classification procedure is described that is based upon local band descriptors. Classification results obtained with this method are compared with results obtained with the global band description method (WDD functions). Data sets from two different laboratories are used to investigate the influence of the preparation. Results show the suitability of the local description method in its ability to visualize the image processing technique at the level of the chromosome image.
Article
The aim of this paper is to use a k-nearest neighbor algorithm for chromosome shape classification. To achieve this task a free web database of 117 normal karyotypes (for both men and women) was used. The classifier, in spite of its simplicity proved very reliable, provides a very good accuracy.
Article
Procedures for fully automatic location of chromosome axis and centromere in metaphase chromosomes are described for a practical interactive chromosome analysis system that omits the usual stages of interactive axis and centromere correction. Accuracy of centromere finding and consequential determination of a chromosome's polarity, i.e., which end is which, is measured experimentally. The saving in interaction by not correcting centromeres is compared to the increase in errors at the classification stage and the consequent increase in interaction needed to correct these errors. Some previously unreported features for banded chromosome classification are described, and in particular a set of global shape features is introduced. The discrimination capability of the feature measurements is evaluated by use of simple statistics and by reference to the performance of classifiers trained with various feature subsets. Class discrimination capability of the global shape feature set is shown to be comparable to that of centromere position, a widely used local shape feature. The variability of feature measurements that might occur in data from different laboratories on account of differing tissue, preparation methods, and digitiser hardware is assessed using three data bases of G-banded human metaphase cells. It is shown that the differences can be considerable and that appropriate feature selection and classifier training substantially improve classification performance.
Article
Computer-aided imaging systems are now widely used in cytogenetic laboratories to reduce the tedium and labour-intensiveness of traditional methods of chromosome analysis. Automatic chromosome classification is an essential component of such systems, and we review here the statistical techniques that have contributed towards it. Although completely error-free classification has not been, nor is ever likely to be, achieved, error rates have been reduced to levels that are acceptable for many routine purposes. Further reductions are likely to be achieved through advances in basic biology rather than in statistical methodology. Nevertheless, the subject remains of interest to those involved in statistical classification, because of its intrinsic challenges and because of the large body of existing results with which to compare new approaches. Also, the existence of very large databases of correctly-classified chromosomes provides a valuable resource for empirical investigations of the statistical properties of classifiers.
Article
Automated chromosome classification is an essential task in cytogenetics and has been an important pattern recognition problem. Numerous attempts were made in the past to characterize chromosomes for the purposes of clinical and cancer cytogenetics research. It is important to determine good features and develop feature extraction schemes for chromosome classification. In this paper we propose efficient approaches for medial axis determination and profile matching of human chromosomes without identifying centromeres. The medial axis determination is based on simple cross-section analysis. The features of the band profile obtained along the axis are then used to classify a chromosome based on a subsequence matching technique. Using a special indexing structure, we are able to perform fast similarity search and dynamic insertion and deletion over the established subsequence database of chromosome profiles. According to the experimental results, the developed adaptive system can automatically and efficiently determine the medial axis of a given chromosomes, and achieve satisfactory classification results.
Article
A classifier based on dynamic time warping (DTW) has been developed to perform the classification of human chromosomes. DTW is used in speech recognition applications to compare two time-sequences. This paper describes a method to adapt the DTW technique in order to deal with the length and the density profile, which are common features used in classifying chromosomes. The DTW classifier is able to compare chromosomes with different elongations. Since chromosomes are non-rigid objects, the proposed classifier has the main advantage of requiring only a small training set in comparison with the conventional methods based on Bayesian classifiers or neural networks. For the same classification accuracy of 81.0%, we achieve a reduction of 88% of the size of the training set in comparison with a Bayesian classifier.
Article
Medial axis transform (MAT) based features and a multilayer perceptron (MLP) neural network (NN) were used for human chromosome classification. Two approaches to the MAT, one based on skeletonization and the other based on a piecewise linear (PWL) approximation, were examined. The former yielded a finer medial axis, as well as better chromosome classification performances. Geometrical along with intensity-based features were extracted and tested. The probability of correct training set classification of five chromosome types was 99.3–99.6%. The probability of correct test set classification was greater than 98% and greater than 97% using features extracted by the first and second approaches, respectively. It was found that only 5–10, out of all the considered features, were required to correctly classify the chromosomes with almost no performance degradation.
Article
We developed and tested a new automated chromosome karyotyping scheme using a two-layer classification platform. Our hypothesis is that by selecting most effective feature sets and adaptively optimizing classifiers for the different groups of chromosomes with similar image characteristics, we can reduce the complexity of automated karyotyping scheme and improve its performance and robustness. For this purpose, we assembled an image database involving 6900 chromosomes and implemented a genetic algorithm to optimize the topology of multi-feature based artificial neural networks (ANN). In the first layer of the scheme, a single ANN was employed to classify 24 chromosomes into seven classes. In the second layer, seven ANNs were adaptively optimized for seven classes to identify individual chromosomes. The scheme was optimized and evaluated using a "training-testing-validation" method. In the first layer, the classification accuracy for the validation dataset was 92.9%. In the second layer, classification accuracy of seven ANNs ranged from 67.5% to 97.5%, in which six ANNs achieved accuracy above 93.7% and only one had lessened performance. The maximum difference of classification accuracy between the testing and validation datasets is <1.7%. The study demonstrates that this new scheme achieves higher and robust performance in classifying chromosomes.
Center for Medical Genetics Research, Rajanukul Insititute, Thailand and all his staff for supporting data and information
  • Special
  • To Dr
  • Verayuth
  • M D Praphanphoj
Special thanks to Dr. Verayuth Praphanphoj, M.D., Center for Medical Genetics Research, Rajanukul Insititute, Thailand and all his staff for supporting data and information. REFERENCES