Conference PaperPDF Available

Visualizing Design Erosion: How Big Balls of Mud are Made

September 2018

September 2018

DOI:10.1109/VISSOFT.2018.00022

Conference: IEEE Working Conference on Software Visualization (VISSOFT 2018)
At: Madrid, Spain

Authors:

David Baum

University of Leipzig

Craig Anslow

Victoria University of Wellington

Richard Müller

Deloitte

Software systems are not static, they have to undergo frequent changes to stay fit for purpose, and in the process of doing so, their complexity increases. It has been observed that this process often leads to the erosion of the systems design and architecture and with it, the decline of many desirable quality attributes, such as maintainability. This process can be captured in terms of antipatterns - atomic violations of widely accepted design principles. We present a visualisation that exposes the design of evolving Java programs, highlighting instances of selected antipatterns including their emergence and cancerous growth. This visualisation assists software engineers and architects in assessing, tracing and therefore combating design erosion. We evaluated the effectiveness of the visualisation in four case studies with ten participants.

Example of one antipattern splitting into two independent antipattern (left) and two independent antipattern merging into one antipattern (right)

…

Figures - uploaded by David Baum

Content may be subject to copyright.

Content uploaded by David Baum

Content may be subject to copyright.

Visualizing Design Erosion:

How Big Balls of Mud are Made

David Baum1, Jens Dietrich2, Craig Anslow3, Richard M¨

uller1

1Leipzig University, Germany

Email: {baum, rmueller}@wifa.uni-leipzig.de

2Massey University, New Zealand

Email: J.B.Dietrich@massey.ac.nz

3Victoria University of Wellington, New Zealand

Email: craig@ecs.vuw.ac.nz

Abstract—Software systems are not static, they have to undergo

frequent changes to stay ﬁt for purpose, and in the process of

doing so, their complexity increases. It has been observed that

this process often leads to the erosion of the systems design

and architecture and with it, the decline of many desirable

quality attributes, such as maintainability. This process can

be captured in terms of antipatterns - atomic violations of

widely accepted design principles. We present a visualisation

that exposes the design of evolving Java programs, highlighting

instances of selected antipatterns including their emergence and

cancerous growth. This visualisation assists software engineers

and architects in assessing, tracing and therefore combating

design erosion. We evaluated the effectiveness of the visualisation

in four case studies with ten participants.

I. INTRODUCTION

Software systems are not static, they have to evolve to stay

ﬁt for purpose, and as they do, their complexity increases [27].

This tends to have detrimental effects on their quality [5], [24]

as the ability to adapt a program to changing requirements

becomes more and more constrained by its complexity. Conse-

quently, desirable quality attributes suffer. The end stage of this

process many projects reach all too quickly has been dubbed

spaghetti code or big ball of mud [18], while the process itself

is often referred to as system rot.

While measuring the quality of software design is subjec-

tive, there is a large body of research trying to assess this by

relating it to properties that can be studied by means of static

analysis. The idea is to extract a representative model from a

software system, and then to measure and query it. One such

approach is the study of antipatterns [25] and smells [19]:

patterns consisting of artifacts (such as code packages, classes

and functions) and their relationships violating certain design

principles. There are catalogs of widely established principles

that facilitate a systematic study of the subject [2], [8], [40].

The evolution of antipatterns is under-researched. Having

better insights into their origins and their growth does have

beneﬁts that may help software engineers to maintain large

projects, and managers to allocate resources for those tasks.

For instance, it is useful to know how a part of the system

exhibiting strong coupling between modules came about.

Tracing this back to the version of the system when the

respective dependencies between modules ﬁrst emerged, and

to the respective people and commit messages and other

documentation (not) revealing their objectives at the time,

will provide valuable information to make informed decisions

about a suitable strategy to respond to those issues. The study

of design evolution will reveal when bad design becomes ram-

pant, and can relate this to events like product releases under

time pressure, team changes, helping with future planning by

more correctly assessing the implications of such events.

We present a visualisation of software design that focuses

on evolution in general, and on the evolution of selected

antipatterns in particular. The purpose of this visualisation is

to assist software engineers to better understand the emergence

of design problems. The usage of the visualisation is demon-

strated in a screencast1. Additionally, a demo including an

interactive tutorial is available online 2. We ﬁrst review related

work, followed by a discussion of the visualisation metaphor

used and various implementation issues. The evaluation is

presented in chapter 4, followed by a brief conclusion.

II. RE LATE D WORK

a) Antipatterns, Design and Evolution: The discussion

of elements of poor design can be traced back to the early

seminal work on software design. We chose to study circular

dependencies and subtype knowledge (STK) as they directly

relate to violations of widely accepted principles of object-

oriented design, namely the Acyclic Dependencies Principle

[30] and the Dependency Inversion Principle [38]. What is

more, they have precise deﬁnitions that facilitate formalisation

and therefore the implementation of tools to detect those

patterns, and there are algorithms that can be used to detect

them that scale well even for large programs. Circular depen-

dencies was ﬁrst discussed by Parnas who suggested to keep

dependencies between modules loop free [36].

Empirical studies on larger corpora of real-world programs

started in the early 2000s and revealed that surprisingly,

1https://youtu.be/RBgQnE-ozQQ

2https://home.uni-leipzig.de/svis/getaviz-antipattern/demo.html

antipatterns are prevalent [45]. This was ﬁrst discovered for

circular dependencies [31], and later conﬁrmed to apply to

other antipatterns as well [14]. Antipatterns can be detected

by means of static analysis before a system is deployed. The

main issue here is the use of dynamic programming language

features that create dependencies that may not be visible when

the static analysis models are built. This area is generally

under-researched, and we must assume that the models used

only under-approximate the behaviour of the actual program.

In particular, dependency graphs may not contain all edges

showing actual program dependencies.

b) Evolution Visualisation: Assessing software quality

and improving refactoring decisions are core tasks of software

visualisations. Many visualisations have been developed to

support these tasks, e.g. by visualising the systems structure,

call graphs, and dependency graphs [4], [21]. Dependency

graphs are usually visualised as node-link-diagrams, enriched

with further information [15], [37]. Since these visualisations

do not convey any evolutionary information they do not

provide any insight about the emergence and evolution of the

dependencies.

There exist many approaches to visualise software evolution

[46]. Most visualisations want to provide an improved under-

standing of the development activities by visualising structural

changes, e.g. by using added and removed lines as metrics

[11], [16], [22], [33], [39], [44], [47] or by providing highly

aggregated information [26], [34], [42]. Our use case requires

the visualisation of the structural evolution of the system and

the antipattern instances at the same time. We are not aware

of any evolution visualisation that supports this. There exist

evolution visualisations of call graphs [7], [17], [23]. However,

they do not provide any structural information.

III. VISUALISATION META PH OR A ND IMPLEMENTATION

a) Dependency Graph Construction: The conceptual

model of the visualisation presented here is based on a

dependency graph extracted from Java bytecode. The graph

extraction is based on an ASM-based bytecode analysis [10].

JDG [1] is used to visit the bytecode instructions extracted by

ASM and create a directed labelled graph, using data structures

from the JUNG network library [35]. JDG processes bytecode

in a two pass process: ﬁrst, all types are collected and added

to the dependency graph to be constructed as vertices. In the

second pass, JDG tracks all occurrences of types in a particular

class ﬁle and records them as dependency edges in the graph

being constructed, classifying them as extends,implements or

uses relationships. This corresponds largely with compile time

dependencies, although there are some subtle differences. In

particular, the Java compiler inlines constants, which leads to

a slight under-reporting of compile time dependencies in our

model.

Once a dependency graph has been constructed, it can be

queried for antipattern instances. To detect circular dependen-

cies, we use an implementation of Tarjan’s algorithm [43]. To

detect STK instances, we use the guery motif engine [13]. The

Fig. 1. Example of one antipattern splitting into two independent antipattern

(left) and two independent antipattern merging into one antipattern (right)

extraction pipeline is very similar to the pipeline used by the

Massey Architecture Explorer [12].

Finally, we rank the vertices representing types in the

dependency graph according to their severity by assigning a

value between 0 (least severe) and 1 (most severe). For STK, 1

is assigned to the supertype, and 0 to the subtype. Intermediate

vertices in the dependency chain connecting the supertype with

the subtype are assigned a value of 0.66 if they are abstract,

and 0.33 otherwise. Intermediate vertices in the subtype chain

from the subtype back to the supertype are all assigned a value

of 0.0. In the following we will call these values STK rank.

While the actual numerical values used here are arbitrary, they

reﬂect the intention of this antipattern as it relates to violations

of the dependency inversion principle [29].

To rank the severity of types in circular dependencies, we

use the (min-max-normalized) betweenness centrality [20],

computed using Brandes’ fast algorithm [9]. The intention here

is that it is particularly critical for a class to be in an antipattern

if it has more responsibility within the program topology. This

is similar to the approach suggested by Martin [28], however,

by using betweenness centrality over just assessing the relative

out-degree we do not only consider the localised impact of

dependencies.

To trace the evolution of an antipattern over multiple

versions, it has to be determined whether its occurrence in

a version is the result of the evolution of an antipattern

in the predecessor version, or a new, different antipattern

instance. We deﬁne this to be the case if the instance in

the successor version has at least 50% of the types of the

antipattern instance in the predecessor version or the other

way around. This implies that an antipattern can be split into

multiple independent antipattern instances as a system evolves.

It is also possible that two independent antipattern instances

merge and create a joint antipattern (cf. Fig. 1).

b) Visualisation Design: Getaviz 3is an open source

toolkit for the designing and generating software visualisations

3https://github.com/softvis-research/Getaviz

(a) Straight edges (b) Force-directed

edge bundling

Fig. 2. Visualisation of the largest cyclic dependency of antlr, including

dependencies between 239 classes

Fig. 3. Screenshot of Getaviz showing MongoDB Java Driver. 1) Antipattern

Explorer 2) Version Selector 3) Legend and Conﬁguration

[6]. Getaviz includes the automatic generation of visualisations

for several visualisation metaphors. Further, it comes with

a highly conﬁgurable browser-based user interface (UI) for

viewing and interacting with a visualisation. Currently, the UI

only supports X3DOM [3] as rendering platform. Getaviz can

be easily expanded to support new visualisation metaphors and

interaction components. Hence, we used Getaviz as starting

point and customised the application to ﬁt our requirements

for antipattern.

To ﬁt the presented use case a visualisation with many

degrees of freedom is necessary, so the structural evolution

as well as the antipattern evolution can be visualised at the

same time. We chose the two-dimensional Recursive Disk

(RD) metaphor for structure visualization [32] and enriched

it with information about evolution and design erosion. For

every class and package circular disks are used. Their area is

estimated using the normalised betweenness centrality. The

STK rank is visualised by using a colour scale ranging

from green (0) to red (1). The disks are nested according

to the package hierarchy, following a similar presentation in

mainstream development tools like IDEs. Since the developer

will work with the visualisation and the IDE at the same time

we believe it is important to align both presentations to make

it easier to locate entities of interest.

The Antipattern Explorer lists all detected antipattern in-

stances in a side bar. By selecting an instance, all dependencies

between the corresponding entities are visualised through red

connectors. The line thickness reﬂects the importance of the

dependency, so the most critical dependency can be seen on

the ﬁrst glance. To increase readability the user can choose

between straight edges and a forced based edge bundling

(cf. Fig. 2). All entities that do not belong to the selected

antipattern are greyed out, so the developer can focus on the

relevant elements.

Multiple versions can be visualised by piling up the two-

dimensional disk visualisations along the z-axis which leads

to a three-dimensional visualisation. The xand ycoordinates

of a disk are stable so that different versions of the same

classes are exactly above each other, which makes it easier

to visual track classes across different versions. The disks

are positioned in a helical layout. This leads to empty space,

but reduces occlusion and increases readability. Through the

Version Selector the user can hide uninteresting versions, e.g.

minor versions

Alternatively, multiple versions could have been visualised

one after the other through animation. However, the user needs

to remember all classes and relations to spot changes. This is

error-prone and time-consuming [39].

Small multiples are a good choice for visualisations that can

be viewed at a glance. Software visualisations are large and

require navigation. Since navigating in multiple visualisations

simultaneously can be troublesome, we preferred the three-

dimensional representation. However, both alternatives are

reasonable and have pros and cons.

IV. EVALUATION

We investigated the effectiveness of the visualisation based

on four case studies. We conﬁgured Getaviz to visualise

multiple versions of antlr, JavaMail, MongoDB Java Driver

and Undertow. The systems were chosen to cover different

sized project ranging from 300 classes (JavaMail) to 1,500

classes (Undertow) and includes standard software and re-

search prototypes.

As expected, we found many “big balls of mud” across all

systems. Every system contained circular dependencies with

several dozen classes. However, the visualisations did not show

how they grew from a small antipattern of only a few classes

to an antipattern with over 200 classes. In almost every case,

antipattern appeared out of nowhere in a version and stayed

unchanged in newer versions. We have not found antipattern

instances that decreased over time or got dissolved completely.

If an instance disappeared, then because the corresponding

classes have been removed in the new version. This already

demonstrates the validity of the visualisation since we had not

come to this insight without it and it indicates that developers

are not aware of these antipattern instances or do not know

how to resolve them.

We invited ten participants (9 male, 1 female) to explore

Getaviz. They were not paid and freely opted to participate in

the study. All of them have multi-year experience in software

development and assess their skills as at least average. First,

Fig. 4. Overview over effectiveness rating on Likert skale

they conducted an interactive tutorial to get familiar with the

visualisation. The evaluation included three comprehension

tasks. After each task we asked the participants to rate the

effectiveness of different parts of the visualisation (cf. Fig. 4).

We used a 5-point Likert scale, where 1is very ineffective and

5is very effective. Additionally, the participants were asked

which aspects they found (in)effective and if they have further

suggestions to improve the visualisation.

Task 1: Which version reduced the quality of the system the

most?

To solve this task, the participants had to compare the design

erosion for every version. The visualisation of the structure

and the antipattern explorer were rated as slightly effective,

the version selector and the representation of multiple versions

in parallel were rated more effective. The participants stated

that the visualisation can be explored in an intuitive way and

comparing versions is easy. They used the version selector to

show only one or two versions at the same time.

Task 2: Which packages are part of the original Circular

Dependency Component 1?

To solve this task, the participants had to identify the ﬁrst

appearance of the circular dependency and gather the corre-

sponding packages names through hovering over them to see

the tooltip. The visualisation of the structure and the version

layering were evaluated as slightly ineffective. The antipattern

explorer was rated as slightly effective. The participants liked

especially the visualisation of the dependencies between the

classes. The version selector was again the most effective part.

In order to solve this task the participants had to navigate

several times within the visualisation. This was the most

challenging part of the task. Participants sometimes lost track

of the elements of interest or needed several attempts to move

the visualisation to the desired position.

Task 3: With which class would you start refactoring?

This task refers to the most recent version only. Hence, the

visualisation of multiple versions is superﬂuous. Therefore, it

was rated as neither effective nor ineffective. The participants

used the version selector and rated it as effective. However,

some participants stated that it is time consuming to hide

every uninteresting version individually and it would be more

convenient to switch the displayed version with one click. The

structure visualisation and the antipattern explorer were rated

as more effective. The participants stated that problematic

classes are easy to detect, but that the visualisation is too

cluttered on the one side and lacks further information to

answer the question studiously on the other side.

V. D ISCUSSION

Almost all refactoring decisions of the participants are

reasonable. We are satisﬁed with these initial results, although

there is signiﬁcant potential for improvement and some design

choices should be reconsidered. In the following we would

like to discuss some problems identiﬁed by the study and

how they could be improved in future. Some ideas arose from

reviewing the participants answers, some were suggested from

the participants directly.

a) Technical limitations: The complaints of the partic-

ipants about navigation concerns the sometimes confusing

behaviour of X3DOM and not the actual visualisation. For

instance, zoom via scroll wheel works opposed to the usual be-

haviour. The visualisation makes extensive use of transparency.

The transparency support in X3DOM is defective. In some

cases transparent elements are displayed opaque so that other

elements are occluded. The version layering would work much

better with a better support for transparency. To overcome

these restrictions we will switch from X3DOM to Mozilla’s

A-Frame, which provides superior transparency handling and

navigation capabilities.

b) Colour Mapping: The disk colour depicts only one

isolated quality aspect. This is misleading as users might

interpret it as an overall quality aspect even if the legend

states otherwise. In Task 3, most participants chose classes

represented by red vertices. This is not necessarily wrong, but

might indicate that they made a decision mainly based on this

colour. To overcome this issue we may prefer a more neutral

colour palette [41].

c) Version layering: The version layering was useful for

some tasks, but can lead to complicated navigation issues and

cluttered visualisations. As already stated, the situation could

be improved through migrating to A-Frame. Nevertheless, the

participants perceived an information overload and reduced

the visible versions to one or two. Still, tracing antipatterns

through different versions was quite effective and was ex-

pressly praised by two participants. Therefore, it is probably

the best solution to support the current version layering as well

as small multiples in order to support more tasks effectively.

d) Supported tasks: The visualisation has to cover more

tasks and quality measures to be a comprehensive visual

analytics tool for assessing the design erosion of large and

complex software systems. For example, only one antipattern

instance can be highlighted currently. For assessing the overall

quality of a version it would be better to highlight all instances

at the same time and use different colours to distinguish them.

Further, Getaviz should support more antipatterns. Once the

antipatterns are detected by static analysis tools they can be

integrated in the visualisation easily.

e) Scalability: Scalability is a known issue for large

software visualisations, especially when multiple versions are

depicted. We are capable to visualise systems with up to

500,000 LOC and about ten versions at the same time.

However, the visualisation can consist of many more versions

if they are loaded on demand.

f) Threats to Validity: We conducted only a preliminary

study with ten participants. An extensive evaluation is neces-

sary once the biggest issues revealed have been solved. The

largest system of the evaluation has about 1,500 classes. The

visualisation might become more confusing on larger systems

due to more edge crossings and a higher number of involved

classes in general. Hence, the effectiveness of the visualisation

has to be evaluated for large systems in a controlled manner.

The participants rated the effectiveness of the visualisation

without a direct comparison to different solution approaches,

e.g. doing the tasks directly in an IDE or using conventional

visualisations.

VI. CONCLUSION

We demonstrated that Getaviz is an easy to adopt framework

for the visualisation of program evolution. We were able to

visualise the erosion of systems design and architecture exem-

plary for two antipatterns, cyclic dependencies and STK. The

validation with end users indicates that the tool has potential

to assist software engineers in gaining a better understanding

of design erosion, and to use this understanding for corrective

refactoring. Findings from our evaluation revealed signiﬁcant

potential for improvement, to be addressed in future work.

REFERENCES

[1] Jdg - a dependency graph extractor for java bytecode.

https://bitbucket.org/jensdietrich/jdg/. Accessed: 2018-05-20.

[2] The wikiwikiweb anti patterns catalog.

http://wiki.c2.com/?AntiPatternsCatalog. Accessed: 2018-05-20.

[3] X3dom. https://www.x3dom.org/. Accessed: 2018-05-22.

[4] C Anslow, S Marshall, J Noble, and R Biddle. SourceVis: Collaborative

software visualization for co-located environments. In VISSOFT, 2013.

[5] R D Banker, S M Datar, C F Kemerer, and D Zweig. Software

complexity and maintenance costs. Comm. of the ACM, 36(11), 1993.

[6] D Baum, J Schilbach, P Kovacs, U Eisenecker, and R M ¨

uller. GETAVIZ:

Generating Structural, Behavioral, and Evolutionary Views of Software

Systems for Empirical Evaluation. VISSOFT, 2017.

[7] D Beyer and A E Hassan. Evolution Storyboards: Visualization of

Software Structure Dynamics. In ICPC. IEEE, 2006.

[8] J S Bradbury and K Jalbert. Deﬁning a catalog of programming anti-

patterns for concurrent java. In SPAQu). Citeseer, 2009.

[9] U Brandes. A faster algorithm for betweenness centrality. J Math.

Sociol., 25(2), 2001.

[10] E Bruneton, R Lenglet, and T Coupaye. Asm: a code manipulation tool

to implement adaptable systems. Adaptable and extensible component

systems, 30(19), 2002.

[11] M D’Ambros, M Lanza, and H Gall. Fractal Figures: Visualizing De-

velopment Effort for CVS Entities. In 3rd Int. Workshop on Visualizing

Software for Understanding and Analysis (VISSOFT). IEEE, 2005.

[12] J Dietrich. The massey architecture explorer.

http://xplrarc.massey.ac.nz/. Accessed: 2018-05-20.

[13] J Dietrich and C McCartin. Scalable motif detection and aggregation.

In ADC. Australian Computer Society, Inc., 2012.

[14] J Dietrich, C McCartin, E Tempero, and S M A Shah. Barriers to

modularity-an empirical study to assess the potential for modularisation

of java programs. In QoSA. Springer, 2010.

[15] J Dietrich, V Yakovlev, C McCartin, G Jenson, and M Duchrow. Cluster

analysis of Java dependency graphs. Proc. of the 4th ACM symposium

on Software visuallization - SoftVis ’08, (May 2014):91, 2008.

[16] X Dong and M W Godfrey. Identifying Architectural Change Patterns

in Object-Oriented Systems. In ICPC. IEEE, 2008.

[17] M Fischer, J Oberleitner, H Gall, and T Gschwind. System evolution

tracking through execution trace analysis. In Int. Workshop on Prog.

Comp., 2005.

[18] B Foote and J Yoder. Big ball of mud. Pattern languages of program

design, 4, 1997.

[19] M Fowler and K Beck. Refactoring: improving the design of existing

code. Addison-Wesley Professional, 1999.

[20] L C Freeman. A set of measures of centrality based on betweenness.

Sociometry, 1977.

[21] N Hawes, S Marshall, and C Anslow. CodeSurveyor: Mapping large-

scale software to aid in code comprehension. VISSOFT, 2015.

[22] A Hindle, Z M Jiang, W Koleilat, M W Godfrey, and R C Holt. YARN:

Animating Software Evolution. In 4th Int. Workshop on Visualizing

Software for Understanding and Analysis. IEEE, 2007.

[23] P Khaloo, M Maghoumi, and D Bettner. Code Park: A New 3D Code

Visualization Tool. In VISSOFT, 2017.

[24] F Khomh, M Di Penta, Y Gu´

eh´

eneuc, and G Antoniol. An exploratory

study of the impact of antipatterns on class change-and fault-proneness.

Emp. Software Engineering, 17(3), 2012.

[25] A Koenig. Patterns and antipatterns. J. of Object-Oriented Programming,

8(1), 1995.

[26] M Lanza and S Ducasse. CodeCrawler-an information visualization tool

for program comprehension. ICSE, pages 5–9, 2005.

[27] M Lehman. Programs, life cycles, and laws of software evolution. Proc.

of the IEEE, 68(9):1060–1076, 1980.

[28] R C Martin. Object oriented design quality metrics: An analysis of

dependencies. Report on object analysis and design, 2(3), 1995.

[29] R C Martin. The dependency inversion principle. C++ Report, 1996.

[30] R C Martin. Design principles and design patterns. Object Mentor,

1(34), 2000.

[31] H Melton and E Tempero. An empirical study of cycles among classes

in java. Empirical Software Engineering, 12(4), 2007.

[32] R M¨

uller and D Zeckzer. The Recursive Disk Metaphor – A Glyph-based

Approach for Software Visualization. In IVAPP. SciTePress, 2015.

[33] S Neu, M Lanza, L Hattori, and M D’Ambros. Telling stories about

GNOME with Complicity. In 6th Int. Workshop on Visualizing Software

for Understanding and Analysis (VISSOFT). IEEE, 2011.

[34] R L Novais, C A N Lima, G de F. Carneiro, P R M S Junior, and

M Mendonca. An interactive differential and temporal approach to

visually analyze software evolution. In 6th Int. Workshop on Visualizing

Software for Understanding and Analysis (VISSOFT). IEEE, 2011.

[35] J OMadadhain, D Fisher, P Smyth, S White, and Y Boey. Analysis and

visualization of network data using jung. J. Stat. Softw., 10(2), 2005.

[36] D L Parnas. Designing software for ease of extension and contraction.

IEEE transactions on software engineering, (2), 1979.

[37] M Pinzger, K Gr¨

afenhain, P Knab, and H C. Gall. A tool for visual

understanding of source code dependencies. ICPC, 2008.

[38] A J Riel. Object-oriented design heuristics, volume 335. Addison-

Wesley Reading, 1996.

[39] J Schilbach. Analyse, Erzeugung und Evaluation animierter Software-

visualisierungen. Dissertation, Leipzig University, Leipzig, 2018.

[40] C U Smith and L G Williams. Software performance antipatterns. In

Workshop on Software and Performance, volume 17, 2000.

[41] Maureen Stone. Field Guide to Digital Color. A. K. Peters, Ltd., Natick,

MA, USA, 2002.

[42] E Sultanow, M Tobolla, and G Vladova. Visual Analytics Supporting

Knowledge Management:. i-KNOW, 2017.

[43] R Tarjan. Depth-ﬁrst search and linear graph algorithms. SIAM J.

Comput., 1(2), 1972.

[44] A Telea and L Voinea. Interactive Visual Mechanisms for Exploring

Source Code Evolution. In 3rd Int. Workshop on Visualizing Software

for Understanding and Analysis. IEEE, 2005.

[45] E Tempero, C Anslow, J Dietrich, T Han, J Li, M Lumpe, H Melton,

and J Noble. Qualitas corpus: A curated collection of java code for

empirical studies. In APSEC2010, December 2010.

[46] A R Teyseyre and M R Campo. An overview of 3d software visualiza-

tion. IEEE Transactions on Visualization and Comp. Graphics, 2009.

[47] E F Vernier, P E Rauber, J L D Comba, R Minghim, and A C Telea.

Metric Evolution Maps: Multidimensional Attribute-driven Exploration

of Software Repositories. In Vision, Modeling & Visualization, 2016.

Understanding Conditional Compilation Through Integrated Representation of Variability and Source Code

Preprint

Full-text available

Aug 2019

The C preprocessor (CPP) is a standard tool for introducing variability into source programs and is often applied either implicitly or explicitly for implementing a Software Product Line (SPL). Despite its practical relevance, CPP has many drawbacks. Because of that it is very difficult to understand the variability implemented using CPP. To facilitate this task we provide an innovative analytics tool which bridges the gap between feature models as more abstract representations of variability and its concrete implementation with the means of CPP. It allows to interactively explore the entities of a source program with respect to the variability realized by conditional compilation. Thus, it simplifies tracing and understanding the effect of enabling or disabling feature flags.

Taxonomy of Architecture Maintainability Smells

Conference Paper

Dec 2023

Understanding Conditional Compilation through Integrated Representation of Variability and Source Code

Conference Paper

Full-text available

Sep 2019

Understanding Software Architecture Erosion: A Systematic Mapping Study

Preprint

Full-text available

Feb 2022

Architecture erosion (AEr) can adversely affect software development and has received significant attention in the last decade. However, there is an absence of a comprehensive understanding of the state of research about the reasons and consequences of AEr, and the countermeasures to address AEr. This work aims at systematically investigating, identifying, and analyzing the reasons, consequences, and ways of detecting and handling AEr. With 73 studies included, the main results are as follows: (1) AEr manifests not only through architectural violations and structural issues but also causing problems in software quality and during software evolution; (2) non-technical reasons that cause AEr should receive the same attention as technical reasons, and practitioners should raise awareness of the grave consequences of AEr, thereby taking actions to tackle AEr-related issues; (3) a spectrum of approaches, tools, and measures has been proposed and employed to detect and tackle AEr; and (4) three categories of difficulties and five categories of lessons learned on tackling AEr were identified. The results can provide researchers a comprehensive understanding of AEr and help practitioners handle AEr and improve the sustainability of their architecture. More empirical studies are required to investigate the practices of detecting and addressing AEr in industrial settings.

Understanding Software Architecture Erosion: A Systematic Mapping Study

Article

Full-text available

Feb 2022

A systematic mapping study on architectural smells detection

Article

Dec 2020
J SYST SOFTWARE

The recognition of the need for high-quality software architecture is evident from the increasing trend in investigating architectural smells. Detection of architectural smells is paramount because they can seep through to design and implementation stages if left unidentified. Many architectural smells detection techniques and tools are proposed in the literature. The diversity in the detection techniques and tools suggests the need for their collective analysis to identify interesting aspects for practice and open research areas. To fulfill this, in this paper, we unify the knowledge about the detection of architectural smells through a systematic mapping study. We report on the existing detection techniques and tools for architectural smells to identify their limitations. We find there has been limited investigation of some architectural smells (e.g., micro-service smells); many architectural smells are not detected by tools yet; and there are limited empirical validations of techniques and tools. Based on our findings, we suggest several open research problems, including the need to (1) investigate undetected architectural smells (e.g., Java package smells), (2) improve the coverage of architecture smell detection across architectural styles (e.g., service-oriented and cloud), and (3) perform empirical validations of techniques and tools in industry across different languages and project domains.

The Recursive Disk Metaphor - A Glyph-based Approach for Software Visualization

Presentation

Full-text available

Mar 2015

In this paper, we present the recursive disk metaphor, a glyph-based visualization for software visualization. The metaphor represents all important structural aspects and relations of software using nested circular glyphs. The result is a shape with an inner structural consistency and a completely defined orientation. We compare the recursive disk metaphor to other state-of-the-art 2D approaches that visualize structural aspects and relations of software. Further, a case study shows the feasibility and scalability of the approach by visualizing an open source software system in a browser.

Code Park: A New 3D Code Visualization Tool

Conference Paper

Full-text available

Sep 2017

GETAVIZ: Generating Structural, Behavioral, and Evolutionary Views of Software Systems for Empirical Evaluation

Conference Paper

Full-text available

Sep 2017

Software visualizations are used to support stake-holders in software engineering activities like development, project management, and maintenance. The respective tasks determine which aspects of software, i.e., structural, behavioral and/or evolutionary information, need to be visualized. To promote the usage of software visualizations they have to optimally support the needs of the respective stakeholder for the specific task at hand. Therefore, we see the necessity to create innovative visualizations and to optimize existing ones. In order to achieve this, it is necessary to empirically evaluate the different visual-izations and their variants. In this paper, we present GETAVIZ as a toolset to support these processes, i.e., designing visualizations, generating task-and role specific visualizations, and conducting empirical evaluations. The toolset implements the concept of generative and model-driven software visualization and makes it possible to generate different visualizations for all three aspects of software. Its strength lies in its adaptability, so that new visualizations and variations of existing ones can be implemented easily. In addition to the generator this toolset contains several extractors for different programming languages, a browser-based user interface for viewing and interacting with visualizations, and an evaluation server to facilitate the execution of local and remote experiments. The paper illustrates the capabilities of GETAVIZ and it discusses plans for its further development.

Code Park: A New 3D Code Visualization Tool

Article

Full-text available

Aug 2017

We introduce Code Park, a novel tool for visualizing codebases in a 3D game-like environment. Code Park aims to improve a programmer's understanding of an existing codebase in a manner that is both engaging and intuitive, appealing to novice users such as students. It achieves these goals by laying out the codebase in a 3D park-like environment. Each class in the codebase is represented as a 3D room-like structure. Constituent parts of the class (variable, member functions, etc.) are laid out on the walls, resembling a syntax-aware "wallpaper". The users can interact with the codebase using an overview, and a first-person viewer mode. We conducted two user studies to evaluate Code Park's usability and suitability for organizing an existing project. Our results indicate that Code Park is easy to get familiar with and significantly helps in code understanding compared to a traditional IDE. Further, the users unanimously believed that Code Park was a fun tool to work with.

The Recursive Disk Metaphor: A Glyph-based Approach for Software Visualization

Conference Paper

Full-text available

Mar 2015

Sourcevis: Collaborative software visualization for co-located environments

Article

Jan 2013

CodeSurveyor: Mapping large-scale software to aid in code comprehension

Conference Paper

Sep 2015

The dependency inversion principle

Article

Jan 1996

R.C. Martin

Refactoring: Improving the Design of Existing Code

Book

Jan 1999

Scalable motif detection and aggregation

Conference Paper

Jan 2012

Motif search in graphs has become a popular field of research in recent years, mainly motivated by applications in bioinformatics. Existing work has focused on simple motifs: small sets of vertices directly connected by edges. However, there are applications that require a more general concept of motif, where vertices are only indirectly connected by paths. The size of the solution space is a major limiting factor when dealing with this kind of motif. We try to address this challenge through motif instance aggregation. It turns out that effective, parallel algorithms can be found to compute instances of generalised motifs in large graphs. To expedite the process, we have developed GUERY, a tool that can be used to define motifs and find motif instances, in graphs represented using the popular JUNG graph library [10]. GUERY consists of two parts - a simple domain specific language that can be used to define motifs, and a solver. The main strengths of GUERY are 1. support for motif instance aggregation, 2. generation of query result streams, as opposed to (very large) static sets of matching instances, 3. support for effective parallelisation in the evaluation of queries. The examples used for validation originate from problems encountered when analysing the dependency graphs of object-oriented programs for instances of architectural antipatterns.

Visualizing Design Erosion: How Big Balls of Mud are Made

Abstract and Figures

Recommended publications

Visualizing Design Erosion: How Big Balls of Mud are Made

Visualizing Design Erosion: How Big Balls of Mud are Made

On the existence of high-impact refactoring opportunities in programs

CorpusVis – Visualizing Software Metrics at Scale