Conference PaperPDF Available

Networking, clustering and brokering keywords in the computer science research - Analysis of the evolution using social network analysis

Authors:
  • Illinois at Singapore Pte Ltd, ADSC

Abstract

Keywords of academic papers are jargons shared within a research domain as well as a summary of the contents. However, the increasing popularity of the interdisciplinary research in academia in recent years opened a possibility that the choice of keywords would be no longer confined by the traditional domain boundaries. In order to investigate the evolutionary trends of keyword-sharing groups in academic publications, we applied three types of Social Network Analysis methods on the papers in the Computer Science domain collected from the Libra database. Particularly, papers published in 5 different time periods from 1970 to 2010 were targeted to show the change of keyword networks over time. The analysis results confirmed the increasing tendency in the keyword network to be absorbed by one big agglomerated cluster over time. As a result, the gatekeeper- or liaison keywords seldom exist in later times but coordinator keywords became majority.
Networking, clustering and brokering keywords in the
computer science research
- Analysis of the Evolution using Social Network Analysis
Jihyoun Park, Tom Z. J. Fu, Dah Ming Chiu
Department of Information Engineering
The Chinese University of Hong Kong, Hong Kong
E-mail: {jhpark, zjfu6, dmchiu}@ie.cuhk.edu.hk
Abstract Keywords of academic papers are jargons shared
within a research domain as well as a summary of the contents.
However, the increasing popularity of the interdisciplinary
research in academia in recent years opened a possibility that the
choice of keywords would be no longer confined by the
traditional domain boundaries. In order to investigate the
evolutionary trends of keyword-sharing groups in academic
publications, we applied three types of Social Network Analysis
methods on the papers in the Computer Science domain collected
from the Libra database. Particularly, papers published in 5
different time periods from 1970 to 2010 were targeted to show
the change of keyword networks over time. The analysis results
confirmed the increasing tendency in the keyword network to be
absorbed by one big agglomerated cluster over time. As a result,
the gatekeeper- or liaison keywords seldom exist in later times
but coordinator keywords became majority.
Keywords-component; social network analysis; academic
keyword analysis; cluster; brokerage
I. INTRODUCTION
The interdisciplinary research gets more popularity and
attention in recent days, because not only the convergence
among different disciplines can provide more innovative tools
to solve a problem by approaching the old problem with brand-
new aspects, but also the global issues that we are now facing
are not confined by a certain domain boundary. Therefore, the
demand for interdisciplinary research is expected to increase
further.
Keywords of an academic paper are the most representative
words to summarize the contents of the paper. They are used to
categorize the paper as well. Therefore, research papers that
belong to the same or related domain area share keywords; and
we can expect that the relationship would be applicable the
other way around, too, if the domains are well classified to be
exclusive and distinctive. However, with the increase of inter-
and multidisciplinary research, we expect that more and more
research papers share keywords beyond the traditional domain
boundaries. Therefore the boundary of traditionally different
research domains would not be as distinguishable as they were.
This paper investigates the keywords network of research
papers in the computer science discipline. We will analyze the
networking characteristics, clustering structure and brokering
nodes between clusters of the network. Those analyses are
conducted for 5 different time periods from 1970 to 2010 with
10 years interval separating them so that the dynamics of the
keywords network over time can be captured. Through the
analysis, we will test how the keyword-sharing groups formed
and changed and which keywords took the role to broker
different keyword-sharing groups for each time period.
Brokerage roles [1, 2] are defined by a vertex’s strategic
location in the network. Contrary to the role of central nodes,
who have power to control information and resource flows in
the network, these network nodes take a role of bridging other
nodes. If we apply this concept to research networks, we can
easily imagine that those broker nodes are the intermediaries
for interdisciplinary research. Especially, representatives,
gatekeepers and liaisons are a type of shared keywords by
different domains. Those keywords with such roles in early
days are likely to become central keywords in later days
because they can take advantage of being related to more
keywords in other domains. However, not all broker keywords
grow up to be central actors. Many keywords in early days
become out-of-date and perish out. Therefore, we are going to
study how they evolved over the course of time. Particularly 10
interesting keywords in 1970 were selected for further
investigation to trace the change of their status and roles from
1970 to 2010.
The remainder of the paper is organized as follows. In
section 2, we will introduce previous literature related to
investigating social networks in scientific research. In section 3,
selected methods of the social network analysis (SNA), which
is the methodology used for our analysis, will be explained.
Section 4 describes the input data of the analysis. After
showing the analysis results in section 5, we conclude this
paper with discussion and future works in Section 6.
II. RELATED WORKS
The past work on academic social networks mainly focused
on the collaboration among countries, organizations, and
individual researchers [3]. They investigate the networking
properties among target entities using various SNA methods in
order to analyze the knowledge flow, the balance and
dependencies among entities, and the trend of change in
research cooperation.
978-1-4673-2430-4/12/$31.00 ©2012 IEEE
110
In academic social network analysis, the academic
influence of a network entity (inside the research communities
represented as a social network) is usually evaluated based on
its co-authorship and citation activities [3-8]. The attention is
on how the research community and academic social networks
evolve over time as a result of authors' collaborations
Represented by co-author and citation links. The common
interests in this field are in researchers' connections or
influence based on their academic achievements reflected by
their networking positions among other researchers.
On the other hand, co-patents [9, 10], websites links [11],
and project partnerships [12, 13] are also widely-used sources
for academic social network research. There are a few works,
which tried to apply SNA methods for analyzing the process of
knowledge creation, exchanges and sharing [14, 15, 16].
However, their emphasis is also mainly on the interactions and
collaborations among individuals or organizations in terms of
the communities in the networks.
Our research is different from previous works in that it uses
the keywords network derived from the papers content-based
relationship to present the overall trends of academic research.
III. METHODS
SNA is a methodology to simply represent the relations
between social actors with nodes and edges (for undirected
graphs) or arcs (for directed graphs) using graph theories. In
this way, researchers can utilize various mathematical methods
to investigate the status, power and interactions among social
actors as well as the overall network structure. In this section,
we introduce the selected social network methods applied for
our analysis.
A. General Network Properties
For the 5 derived keyword networks (one for each time
period), we conduct three types of SNA methodologies which
aim at different aspects of the dedicated social network. The
first type is to look into the network attributes [6, 8, 14, 17]
including distance, density, degree, components and centrality
etc. These metrics that reveal some basic and general
topological network characteristics help us to have some
understanding at the coarse level.
B. p-Clique
The second type is a more detailed level analysis which
focuses on the group of vertices of the network, called the p-
clique analysis. Since cliques are suggested to be structured
around some key vertices (e.g. popular or important keyword in
our study), this helps to find out possible highly cohesive
subgroups or clusters. We apply the following definition of p-
clique in our analysis: p-Clique partitions a network to groups
of close neighbors so that vertices inside the group have at least
proportion p (0<p<1) neighbors.
C. Brokerage role
The third type, the Brokerage Role analysis, is at the most
detailed level. Based on the clustering results derived by the p-
clique method, it is more interesting to investigate the roles that
the vertices at different positions of the network could act as
and the corresponding properties they have. The brokerage role
of a vertex is determined by the relative position of the vertex
in a clustered network. Each vertex can have five brokerage
roles according to given partition, including coordinator,
itinerant broker, representative, gatekeeper and liaison. Figure
1 plots a simple illustration of the 5 kinds of brokerage roles.
(In undirected graphs, a representative is equal to a gatekeeper.)
Figure 1. 5 brokerage roles [18]
IV. DATA
In this section, we explain the data source to retrieve
academic publication information in the computer science
discipline and the pre-processing procedure of the data to make
them suitable for our analysis purpose.
A. Libra Database
We used Microsoft Academic Search [19] (also known as
Libra) as the data source of academic social network for
analysis. Currently, Libra provides publication records for
more than 10 disciplines. In this paper, we pick Computer
Science, one of the most representatives to study. Libra
provides an API for data download service and several
visualization features for users to take a glance at the trend of
academic research, top authors, and the network of authors and
papers for each research domain.
For each paper record obtained from Libra, we have some
basic information such as paper title, publication year, authors,
conference/journal, keywords, etc. Although we only focus on
one domain, it still results in a huge number of total paper
records for SNA, which makes the analysis process complex
and intractable.
Therefore, we further simplified the analysis by adding two
filtering conditions: a) paper record whose publication year is
either one of the five time periods will be selected: 1970, 1980,
1990, 2000, 2010, where the different time periods could help
to reveal the change and evolution of the keywords network in
this research discipline; b) paper record to be selected should
have at least one keyword information. In this study, we
directly make use of the results of how keywords are extracted
from papers already done by Libra (in the format of paper-
keyword mapping). Investigating on how Libra achieves this
and discussing on the accuracy and efficiency of related
algorithms are out of the scope of this paper. We will consider
in the future works.
Table I shows the number of papers used for the analysis
and that of distinctive keywords counted for all the papers. The
number of papers and keywords increased exponentially during
the time. In the case of paper numbers, it increased about 3
111
times every 10 year from 1970 to 2000 and about 2.5 times for
keywords.
TABLE I. DATA STATISTICS
1970
1980
1990
2000
2010
# of papers
1,986
6,123
19,157
58,128
# of keywords
1,342
3,319
7,504
17,319
B. Data transformation
The collected information is in a bipartite mode, i.e. papers
vs. keywords. Since the study focuses on the keyword relations,
we transformed the bipartite graph, or the 2-mode matrix into
the 1-mode (keyword) matrix. For each pair of keywords, there
is an edge connecting them if they belong to the same paper.
V. ANALYSIS RESULTS
We used Pajek [18], which is especially good at handling
large amount of data, to conduct the SNA on the keywords
network of computer science academic papers.
A. Genreal properties
The general properties of the keywords network are shown
in Table II. The major findings are, first, the keywords network
of the computer science discipline shows the typical small
world property. Even though the number of network nodes is
large but average distance from one node to the others in the
network is only few steps. Second, these networks are very
sparse and highly decentralized. The density of the network is
only 0.08%~0.24%. The average degree and network
centralization indicate that the keyword networks are also
decentralized networks. It means that there is no outstanding
node but the power among network nodes is relatively evenly
distributed. Lastly, the number of components, which are self-
connected sub-networks, is not ignorable. However, the
tendency that keywords get connected more and more is
observed over time. After 2000, more than 90% of keywords
connected with each other thus the connectedness index
improved drastically. This trend shows that the entire network
is moving from a highly segmented structure to the mesh
structure, in which keywords link to each other more closely.
TABLE II. GENERAL NETWORK PROPERTIES
1970
1980
1990
2000
2010
# of nodes
(keywords)
1,342
3,319
7,504
17,319
17,448
Avg. distance
7.04
5.28
3.74
2.91
3.12
Density
0.0010
0.0008
0.0012
0.0024
0.0016
# of
Components
728
1,293
1,408
583
1,219
Largest
component
306
(23%)
1,704
(51%)
5,898
(79%)
16,687
(96%)
16,156
(93%)
Connectedness
0.05
0.26
0.62
0.93
0.86
Avg. degree
1.34
2.79
8.76
42.37
28.48
Network
centralization
0.02
0.03
0.05
0.19
0.11
B. Clusters
We then investigated the clustering composition inside the
main component in order to look into how keyword-sharing
groups naturally arose and changed over time. Figures 2 to 6
show the clustering results of keyword-sharing groups for each
time period. The representative keyword of a cluster stands for
the most degree-central keyword. Different size for nodes is
applied to represent the size of a cluster (=number of nodes).
The width of links indicates the frequency of interactions
between nodes of two clusters.
1) 1970: The clustering analysis of 1970s keywords
detected 28 clusters out of 306 vertices. The two largest
clusters are covariation function (34) and approximate solution
(33) followed by computer simulation (25) and difference
equation (22). The number of clusters, of which members are
between 10 and 19, is 9. The clusters which have less than 10
members are 15. The average cluster size is 11. The
connectivity between clusters is quite active. Particularly, it is
more visible between approximate solution and computer
simulation, between covariance function and spectrum,
between computational complexity and natural language, and
between computational complexity and turing machine.
Figure 2. Keyword-sharing groups in the main component of the keywords
network in 1970
2) 1980: The total number of vertices increased to 1704
but the number of clusters decreased a little (26). The
composition of clusters became unbalanced. The largest
cluster is the one represend by language production, of which
size is 557. It is followed by 5 leading clusters (lower bound,
distirubted system, pattern recognition, spanning tree, and
visual inspection), of which the number of memebers is
between 149 and 231. We can also detect 5 small groups
(queueing network, optimal design, search algorithm,
symbolic computation, and matrix multiplication) and 15 very
small groups of which the number of members is below 10.
Strong connectivity are detected between clusters of pattern
recognition and langauge production, language production and
lower bound, lower bound and distirubted system, distribute
system and visual inspection, and visual inspection and
spanning tree.
112
Figure 3. Keyword-sharing groups in the main component of the keywords
network in 1980
3) 1990: Significant changes in the clustering structure
from the previous times are deteced. The number of vertices
are now 5898, but only 8 clusters were formed. The cluster
represented by indexing terms is a giant cluster that contains
94% of keywords. It means that almost all keywords are now
shared among papers regardless of their domain categories,
which have been traditionaly considered distinguishable.
Figure 4. Keyword-sharing groups in the main component of the keywords
network in 1990
4) 2000: 7 clusters out of 16687 vertices are found. The
largest cluster includes 99.68% keywords. Interaction among
keywords internalized and networking outside clusters is even
sparser than 1990s.
Figure 5. Keyword-sharing groups in the main component of the keywords
network in 2000
5) 2010: we can observe 5 clusters out of 16156 vetices.
The cluster composition in 2010 is very much similar with
2010. 99.67% of keywords belong to one cluster. In
conclusion, since almost all keywords belong to a single large
cluster, the clustering approach is no longer very effective in
understanding keyword relationships after 1990.
Figure 6. Keyword-sharing groups in the main component of the keywords
network in 2010
C. Brokerage role
Based on the previous analysis of clustering keywords, this
section diagnoses the role of keywords that they took to
promote activities inside a cluster or between clusters (Table
III). The increase tendency of coordinators tells that research
keywords continuously reinforce internal links inside its own
group. From 1990, the majority of keywords took a role of
coordinator. Gatekeepers and liaisons, which connect different
domains together, increased from 1970 to 1980, but suddenly
decreased after 1990. It is because small keyword-sharing
groups converged into one larger keyword-sharing group
through active sharing of keywords in papers of different
domains.
TABLE III. STATISTICS OF BROKERAGE ROLES
1970
1980
1990
2000
2010
Coordinator
17.32%
28.64%
53.59%
71.13%
68.8
%
Itinerant
broker
4.9
%
8.74%
10.09%
0.02%
0.01%
Gatekeeper
21.9%
27.58%
5.07%
0.3%
0.18%
Liaison
14.38%
16.12%
0.54%
0.04%
0.02%
We assume that the dynamics of keywords over time can
follow four paths. First, they are always popular. They are
fundamental concepts and already established in the early time
so until now the centrality and its roles remain important.
Second, they were once popular but as time passes by, the
research focus moved to other domains. Third, at the beginning
they were not very important but strategically positioned to link
multiple domains so later they become much more influential.
This is a possible course when the popularity of
interdisciplinary research at the current time is taken into
consideration. Fourth, they were bridges among different
113
domains in the past but become obsolete because the kind of
interdisciplinary research was not so successful.
We traced the path of 97 keywords from 1970 to 2010 that
had any brokerage role inside a cluster or between clusters in
1970. We observed the status of those keywords in terms of
degree centrality to see how the keywords evolved. The data
were divided into two groups. The first group is coordinators in
1970 and the other is non-coordinator but those who took a
gatekeeper or liaison role. As depicted in Figure 7, the trends of
the two groups are similar but more members of the
gatekeeper/liaison group remained in the core in 2010. 16 of 53
coordinators ranked on top 5% central keywords in 2010 while
27 out of 44 gatekeeper/liaison keywords ranked on top 5%.
These results indicate that the roles of these keywords which
are aggregating information from various domains could make
them competitive to create new corresponding keywords over
time. Similar results can be derived if we repeatedly investigate
on those root keywords.
Figure 7. Status of brokerage keywords of 1970 in 2010
D. Evolution trace of selected keywords
We selected 10 keywords from 1970 to trace the change of
their roles over time in detail. Those 10 keywords were
selected from our observation of change in research trends in
the computer science field as examples to show dynamics of
research interests in this discipline. Table IV describes the roles
of the selected keywords in the brokerage context. C, I, G, and
L represent coordinator, itinerant broker, gatekeeper and liaison
for each. X means that the keyword was not included in the
main component at the time, thus it was excluded from the
cluster and brokerage role analysis. Many of selected keywords
belonged to this category in 1970 because the keyword network
of 1970 was highly segmented. The value in bracket indicates
the degree centrality of the keyword at the time. Programming
language and computational complexity are coordinators as
well as gatekeepers/liaisons in 1970. That means that they were
already in the central position that links keywords inside the
cluster. We can observe that they kept their importance in the
keywords network until 2010. Speech recognition is an
interesting example. It took a role of liaison in 1970 to connect
different domains. As time passes by, it expanded its influence
and become a highly central keyword in later times. The
centrality rank of those keywords that existed in other
components apart from the main component was relatively low
but once they joined the main component, they got recognition
and their rank in degree centrality moved up.
TABLE IV. BROKERAGE ROLES FOR SELECTED KEYWORDS
Keywords
1970
1980
1990
2000
2010
speech
recognition
L
(11.8%)
C,I,G,L
(3.6%)
C
(0.8%)
C
(0.3%)
C
(1.6%)
Programming
Language
C,G,L
(0.4%)
C,I,G,L
(0.1%)
C
(0.1%)
C
(0.2%)
C
(0.4%)
operating
system
(17.4%)
C,I,G,L
(0.4%)
C
(0.2%)
C
(0.2%)
C
(0.9%)
traffic model
X
(15%)
-
(70.1%)
C
(3.7%)
C
(13.2%)
computer
network
X
(19%)
C,G,L
(6.2%)
C
(1.8%)
C
(1.2%)
C
(2.9%)
Computational
Complexity
C,I,G,L
(0.2%)
I,G,L
(5.4%)
C
(0.6%)
C
(0.2%)
C
(0.1%)
dynamic
program
X
(59.2%)
C,G,L
(7.5%)
C
(3.4%)
C
(0.8%)
C
(0.7%)
hash function
X
(65.7%)
(43.6%)
C
(16.3%)
C
(11.3%)
C
(3%)
optical
communication
X
(76.5%)
(25.6%)
C
(53.5%)
C
(15.9%)
C
(20.8%)
stable marriage
X
(76.3%)
-
-
-
C
(33.4%)
a. C: Coordinator, I: Itinerant Broker, G: Gatekeeper, L: Liaison, X: Not defined.
VI. DISCUSSION
In 1970 and 1980, many research communities form small
and independent clusters that use the same jargons. The size of
clustered keyword-sharing groups was relatively evenly
distributed. However, from 1990 to 2010, we observed the
emergence of one agglomerated big community that shared
keywords together. The number of distinct clusters reduced and
relations among clusters become much simpler. These changes
are attributed to strong tendency towards interdisciplinary
research in recent years.
We can also confirm that the time around 1980 was the
most important or fast-changing period to promote inter-
/multidisciplinary research beyond traditional domain
boundaries, since many keywords actively took brokerage roles
not only inside a cluster but between clusters. The development
of gatekeepers and liaisons is particularly important in this time
due to their potential to become innovators to create new inter-
multidisciplinary research by taking advantage of the ability to
link different domains. After 1990, the keywords-network
already formed a big component and has not shown much
dynamics. Therefore, most of keywords become coordinators.
The increasing tendency towards interdisciplinary research
is calling for a review or renewal on the academic domain
classification. With the emergence of new domains and
methodologies in research areas, the classification system has
also been adapted somehow such that some domains were
created and some perished. However, the academic system
seems not flexible enough to follow the ever-changing trend.
By using the predefined hierarchical classification framework,
114
some new research cannot be fully explained by the category
and some cannot even find the right place to fit in. Papers that
were not properly classified might lose the chance to be
properly exposed for search and citation by other researchers
and impact the rapid development of new research ideas.
For future direction, a comparison between our approaches
to derive new classification based on analyzing keyword-
sharing groups, with the traditional classification system is
required. However, for the solution to bridge the gap between
two, an evolutionary approach seems more appropriate than a
revolutionary one. The evolutionary approach can minimize
abrupt changes, and keeps the existing stable domain structure
as a basic framework and adds minor adjustments to it from
time to time; while the revolutionary approach would have
abandoned the entire regime and creates a new domain
structure from scratch.
Through the analysis, we observed that the gatekeeper or
liaison keywords were potential candidates as intermediaries to
connect different domains together and enable the knowledge
flow among them. However, the question is towards
developing a theory of how keywords can dynamically help us
define research domain for classification purposes. In the future,
we will investigate the development of multidisciplinary
research around the gatekeepers or liaisons. However, due to
the huge volume of data, a more systematic approach to deal
with large data must be adopted to test how this type of
keywords develop over time together with other keywords that
have previous contacts with them.
REFERENCES
[1] P. Bonacish, "Power and centrality", American Journal of Socialogy, vol.
92, no. 1, pp. 1170-1182, 1987.
[2] R. S. Burt, Structureal Holes. Cambridge: Harvard University Press,
1992.
[3] W. Glänzel and A. Schubert, "Analyzing scientific networks through co-
authorship", in Handbook of Quantitative Science and Technology
Research, H. F. Moed, W. Glänzel, and U. Schmoch, Eds. Dordrecht,
London: Kluwer Academic Publishers, 2004, pp. 257-276.
[4] C. S. Wagner and L. Leydesdorff, "Mapping the network of global
science: comparing international co-authorships from 1990 to 2000",
International Journal of Technology and Globalisation, vol. 1, no. 2, pp
185-208, 2005.
[5] D. B. Horn, T. A. Finholt, J. P. Birnholtz, D. Motwani, and S. Jayaraman,
"Six degrees of Jonathan Grudin: a social netowrk analysis of the
evolution and impact of CSCW research". Proceedings of the 2004
ACM conference on Computer Supported Cooperative Work (CSCW
'04), New York: ACM, pp. 582-591, 2004.
[6] M. E. J. Newman, "Coauthorship networks and patterns of scientific
collabaration". Proceedings of the National Academy of Sciences, vol.
101, pp. 5200-5205, 2004.
[7] M. Rosvall and C. T. Bergstrom, "Mapping change in large networks",
PLoS ONE, vol. 5, no. 1, 2010.
[8] A. L. Barabasi, H. Jeong, Z. Neda, E. Ravasz, A. Schubert, and T.
Vicsek, "Evolution of the social network of scientific collaborations",
Physica A, vol. 311, pp. 590-614, 2002.
[9] W. Cantner and H. Gra, "The network of innovators in Jena: An
application of Social Network Analysis", Research Policy, vol. 35, no. 4,
pp. 463-480, 2006.
[10] J. Owen-Smith, M. Riccaboni, F. Pammolli, and W. W. Powell, "A
Comparison of US and European University-Industry relations in the
Life Sciences", Management Science, vol. 48, no. 1, pp 24-43, 2002.
[11] J. L. Ortega, I. Aguillo, V. Cothey, and A. Scharnhorst, "Maps of the
academic web in the European Higher Education Area an exploration
of visual web indicators", Scientometrics, vol. 74, no. 2, pp 295-308,
2008.
[12] D. A. Batallas and A. A. Yassine, "Information Leaders in Product
development Organizational Networks: Social Network Analysis of the
Design Structure Matrix", IEEE Transactions on Engineering
Management, vol. 53, no. 4, pp. 570-582, 2006.
[13] A. Greve and J. W. Salaff, "The development of corporate social capital
in complex innovation processes", Research in the Sociology of
Organizations, vol. 18, pp. 107-134, 2001.
[14] H. Clark and R. Pradeep, "Visual representation of knowledge networks:
A social network analysis of hospitality research domain", International
Journal of Hospitality Management, vol. 27, pp. 302-312, 2008.
[15] K. Carley, "Knowledge acquisition as a social phenomenon",
Instructional Science, vol. 14, no. 3-4, pp. 381-438, 1986.
[16] C. H. Lin, Social capital: A theory of social structure and action.
Cambridge: Cambridge University Press, 2001.
[17] L. C. Freeman, "Centrality in Social Networks: Conceptual
Clarification", Social Networks, vol. 1, pp 215-239, 1979.
[18] V. Batagelj and A. Mrvar, Pajek Program for Large Network Analysis,
http://vlado.fmf.uni-lj.si/pub/networks/pajek/
[19] Microsoft Academic Search, http://academic.research.microsoft.com/
115
... The scope of data collection included news containing the keyword "fashion" and published between January 1, 2016, and December 31, 2020, through web crawling. We referred to the five-year unit to select the data period because it is used in prior studies on domestic and overseas industrial trends and social science surveys [61,62]. The search term "fashion" was chosen for the news articles as it included the comprehensive clothing industry sector; clothes, makeup, hairstyles, and accessories [63]. ...
Article
Full-text available
The growth of digital media usage has accelerated the development of big data technology. According to the agenda-setting theory, news media inform the public regarding major agendas and business cycles. This study investigated 168,786 news documents from 2016 to 2020 related the South Korea fashion business using Python. A total of 19 topics were extracted through latent Dirichlet allocation and then transformed into structured data using a time series approach to analyze significant changes in trends. The results indicate that major fashion industry topics include business management strategies to increase sales, diversification of the retail structure, influence of CEOs, and merchandise marketing activities. Thereafter, statistically significant hot and cold topics were derived to identify the shifts in topic themes. This study expands the fashion business contexts with agenda-setting theory through big data time series analyses and can be referenced for the government agencies to support fashion industry policies.
Article
Text mining techniques have become popular in many disciplines in the era of big data. Author profiling analysis, a method of text mining, is useful for identifying the intellectual structures of research fields and the latest research topics that did not appear from author co-citation analysis. This study analyzed the changing intellectual structure of the Human Resources Development (HRD) field in Korea by investigating researcher groups, research areas, and relationships between researchers based on research topics through author profiling analysis. Using selected authors’ articles published during periods 2010–2014 and 2015–2019, an index was extracted from titles, abstracts, and keywords of a total 1961 articles. 2670 indexes were derived through the index cleaning process structured by expert verification and expert cross-validation. The cosine similarity matrix between authors was calculated by applying weights to the index, followed by a clustering and network analysis using PNNC algorithm through WNET program. The results of this study advance discussion surrounding the existing HRD paradigm, definitions of HRD, areas of HRD, theoretical foundations of HRD, and previous studies that analyzed research trends by providing additional layers of granularity and an expanded framework of thought across each topic. The findings also suggested implications for academic and practical aspects and further research.
Thesis
Full-text available
International Development Projects (IDP) are designed to deliver sustained solutions to human life quality threatens. They aim to improve living conditions in emerging countries through initiatives that should provide long-term sustained results. However, the extreme characteristics of IDP contexts challenge traditional project management methodologies. The dynamic nature of stakeholders’ relationships and influence adds additional pressures to the management teams. Higher levels of uncertainty in IDP are faced with non-flexible strategies that compromise the long-term desired results. Not enough participation of relevant stakeholders and beneficiaries limits the impact of development initiatives. The authors could identify that IDP management is overall an under-investigated field. Specifically, contemporary challenges arisen from unbalanced robust/flexible strategies and low stakeholders participation have undermined the impact success of those projects. Moreover, there is no academic study that constructs on how flexibility could be enhanced in IDP, while maintaining control. Methodologies such as design thinking and agile have elucidated new paths of action for better impact and customer satisfaction in other industries, by enabling flexibility and change management. Despite their huge proven success, these methodologies are still a phenomenon limited to IT and design industries. Therefore, in IDP field there is no framework linking contingency and participatory development theories with flexible methods similar to design thinking and agile methodologies. In order to bridge this gap the authors will embark in a qualitative study to explore literature and gain insights from actors within the IDP field about the problem at hand. A multiple embedded case study will be conducted with ID academics and practitioners at supervisory and implementation levels from across the world. A possible solution will be designed for IDP management from a different angle to that of traditional management, in order to build up flexibility without compromising project structure. The proposed framework will tackle flexibility and participation issues by integrating design thinking and agile methodologies into IDP. The theoretical findings suggest that enabling participatory development strategies in the design phase, and expanding the available project knowledge would enhance IDP flexibility. Likewise, IDP flexibility during the implementation phase is impacted by the quality and relevance of information and methodological tools available, stakeholders’ involvement, as well as the handover process. Therefore, since no previous studies interconnecting those theories to enhanced flexibility could be found in IDP, an expansion of available theoretical knowledge on contingency and participatory development theory in IDP is produced. The authors concluded that design thinking and agile principles may conceptually prove useful to effectively deal with the problems identified, thus project design is more adequate, and collaboration among stakeholders proves effective to deal with uncertainty and complexity. These constructs are explained in the propositions made for the SUITE framework to IDP, which aims to practically contribute to the management field of IDP.
Article
This article aims to explore and discuss research trends in global cruise industry using keyword network analysis. We visualize keyword networks in each of four groups of 1982-1999, 2000-2004, 2005-2009, 2010-2014 based on the top 20 keyword nodes' degree centrality and betweenness centrality which are selected among four centrality measurements, comparing them with frequency order. The article shows that keyword frequency collected from 240 articles published in international journals is subject to Zipf's law and nodes degree distribution also exhibits power law. We try to find out research trends in global cruise industry to change some important keywords diachronically, visualizing several networks focusing on the top two keywords, cruise and tourism, belonging to all the four year groups, with high degree and betweenness centrality values. Interestingly enough, a new node, China, connecting the top most keywords, appears in the most recent period of 2010-2014 when China has emerged as one of the rapid development countries in global cruise industry. Therefore keyword network analysis used in this article will be useful to understand research trends in global cruise industry because of increase and decrease of numbers of network types in different year groups and the visual connection between important nodes in giant components.
Article
Full-text available
Science and technology has been shown to result from dynamic interactions occurring at many levels and across various sectors. This study maps and analyses the network operating at the global level. Using the Science Citation Index (CD-Rom version) for 1990 and 2000, the networks created by international co-authorships are revealed. We analyse the observed linkages at the global level and on regional bases. The architecture of the network is further explored using statistical methods and factor analysis to reveal intense relationships as well as the core members of a global network. Findings show that, in the ten years between 1990 and 2000, the global network has expanded to include more nations, but the structure of this network has become more pronounced. Regional networks show emerging hubs. However, in some cases, the regions have been disconnected from the global level. A core set of advanced industrial nations has expanded from six in 1990 to eight in 2000. Factor analysis suggests that various large countries compete with each other for partners in the global network.
Article
Full-text available
Using the Science Citation Index (CD-Rom version) for 1990 and 2000, this paper analyses international co-authorships. The paper presents the methodology for identifying these co-authorships. Analysis is presented on observed linkages at the global level and on regional bases. The architecture of the network is further explored using statistical methods and factor analysis to reveal intense relationships as well as the core members of a global network. Findings show that, in the 10 years between 1990 and 2000, the global network has expanded to include more nations and it has become more interconnected. Regional networks show emerging hubs. Within the global network, a core set of countries has expanded from six in 1990 to eight in 2000. Factor analysis suggests that large countries compete with each other for partners in the global network. We discuss implications for public policy of the rise of a global network of scientists operating somewhat independently of national interests.
Article
Full-text available
The acquisition of knowledge, the development of the individual's knowledge bases, does not occur in a vacuum. The social context, both social structure (the perceived regularities in the social network) and social knowledge, appears to be related to the individual's acquisition of knowledge. In this article, the relationship between the social world and individual knowledge acquisition is explored. A general theory is postulated and then formulated as a model. The basic theory is predicated on the assumption that social interaction is the driving force behind knowledge acquisition.A selection of results pertaining to social learning will be presented. These include, but are not limited to the following:• Social structure affects knowledge acquisition; e.g., individuals in tightly knit social groups develop similar cognitive structures, but not necessarily similar evaluations of those structures. • Social knowledge evolves slowly and relative to shared experience. • Individual knowledge acquisition is limited; e.g., individuals and groups talk past each other because of the lack of critical paths in their respective knowledge bases.
Article
The intuitive background for measures of structural centrality in social networks is reviewed and existing measures are evaluated in terms of their consistency with intuitions and their interpretability.
Article
Intellectual relationships and collaboration networks are the basis for the development of a knowledge domain. The visual representation of such “knowledge networks” contributes to the overall understanding of intellectual collaborations in a particular knowledge domain. Based on the co-authorship data from recent journal publications over a period of five years, the authors applied social network analysis to explore the network structures and identify their network properties in the hospitality research domain. The analysis revealed the core and peripheral networks where the power law distribution was observed on the pattern of publishing academic papers. The overall network was further examined by nine research streams in both “global” and “contextual” views to understand a broad variety of the collaboration patterns of hospitality researchers.
Article
We apply social network analysis methods to describe the evolution of the innovator network of Jena, Germany in the period from 1995 to 2001. We find this evolution to be directed towards an increasing focus on core competencies of the network. Further we analyse the network resulting from R&D cooperations and explain – by means of network regression techniques – that the job mobility of scientists and the technological overlap between the actors, rather than past cooperations, can best predict the resulting structure.
Article
2In an influential paper, Freeman (1979) identified three aspects of centrality: betweenness, nearness, and degree. Perhaps because they are designed to apply to networks in which relations are binary valued (they exist or they do not), these types of centrality have not been used in interlocking directorate research, which has almost exclusively used formula (2) below to compute centrality. Conceptually, this measure, of which c(ot, 3) is a generalization, is closest to being a nearness measure when 3 is positive. In any case, there is no discrepancy between the measures for the four networks whose analysis forms the heart of this paper. The rank orderings by the