ArticlePDF Available

A methodology of engineering ontology development for information retrieval

January 2007

January 2007

Authors:

Zhanjun li

Pennsylvania State University

Victor Raskin

Purdue University

Karthik Ramani

Purdue University

When engineering content is created and applied during the product lifecycle, it is often stored and forgotten. Since search remains text-based, engineers do not have the means to harness and reuse past designs, experiences, and mistakes. On the other hand, current information retrieval approaches based on statistical methods and keyword matching are not directly applicable to the engineering domain. The long term goal of this research is to develop an engineering ontology based computational framework in order to (1) structure unstructured engineering documents; and (2) achieve more effective information retrieval. This paper focuses on the method and process that acquire and evaluate the engineering ontology. We propose a principled, systematic, and semi-automatic ontology development methodology that is based on ontological semantics and is integrated with Protégé, one of the most widely used ontology engineering tools. The methodology is applied in acquiring the established engineering knowledge from various resources. A preliminary test based on engineering catalogs, CAD drawings, and project reports has been conducted. The results validate the effectiveness of the engineering ontology as well as the methodology.

Modules and process of concept tagging

…

Figures - uploaded by Karthik Ramani

Content may be subject to copyright.

Content uploaded by Karthik Ramani

Content may be subject to copyright.

ICED’07/471 1

INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN, ICED’07

28 - 31 AUGUST 2007, CITE DES SCIENCES ET DE L'INDUSTRIE, PARIS, France

A METHODOLOGY OF ENGINEERING ONTOLOGY

DEVELOPMENT FOR INFORMATION RETRIEVAL

Zhanjun Li1, 2, Victor Raskin1, 3 and Karthik Ramani1, 2, 4

1Purdue Research and Education Center for Information Systems in Engineering (PRECISE)

2School of Mechanical Engineering

3Department of English and Linguistics

4School of Electrical and Computer Engineering (by courtesy)

Purdue University, West Lafayette IN, USA

ABSTRACT

When engineering content is created and applied during the product lifecycle, it is often stored and

forgotten. Since search remains text-based, engineers do not have the means to harness and reuse past

designs, experiences, and mistakes. On the other hand, current information retrieval approaches based

on statistical methods and keyword matching are not directly applicable to the engineering domain.

The long term goal of this research is to develop an engineering ontology based computational

framework in order to (1) structure unstructured engineering documents; and (2) achieve more

effective information retrieval. This paper focuses on the method and process that acquire and

evaluate the engineering ontology. We propose a principled, systematic, and semi-automatic ontology

development methodology that is based on ontological semantics and is integrated with Protégé, one

of the most widely used ontology engineering tools. The methodology is applied in acquiring the

established engineering knowledge from various resources. A preliminary test based on engineering

catalogs, CAD drawings, and project reports has been conducted. The results validate the

effectiveness of the engineering ontology as well as the methodology.

Keywords: Engineering ontology, Knowledge acquisition, Engineering information retrieval, Reuse

1 INTRODUCTION

Engineers are dependent on accessing documents in order to fulfill various design and engineering

tasks. In fact, today’s engineers simply do not make an effort to find engineering content beyond mere

keyword searches [1]. However, current information retrieval (IR) approaches either retrieve too

much or irrelevant results for engineering. In industry sectors, it was reported that design engineers

spent 20% to 30% of their time retrieving and communicating information [2]. “Delivering the right

information to the right people at the right time” plays an important role in supporting engineers’

memory extension, knowledge sharing, design concept exploration, design reuse, and the learning

process particularly of novice engineers [3, 4]. However, current engineering practices ignore reuse of

previous knowledge because appropriate engineering information retrieval tools have not been

developed. As a result, a large amount of time is spent reinventing what is already known in the

company or is available in outside resources [5, 6]. It is, therefore, imperative to minimize such

overhead by developing the science base for contextual retrieval and then using this knowledge to

create effective computer-aided tools.

Statistics-based methods and keyword-based input have been prevalent in IR research such as

vector space model [7], latent semantic analysis [8], language modeling [9], and probabilistic model

[10]. They can be viewed as sophisticated stochastic techniques for matching terms from queries with

terms in documents under the assumption of term independence. They try to derive the meaning of the

text from the observable syntactic and statistical behavior of its units without any attempt to represent

the meaning directly. However, words alone cannot capture the semantics or meanings of the

document and query intent. To put it differently, the search results should satisfy the users, who are

looking for something that matches their understanding of pertinent text—an understanding that

includes, among other things, the relations among the terms and the ability to disambiguate and to

infer. This is where the statistical keyword-based techniques fail the users and defeat their purposes.

ICED’07/471 2

In the engineering domain, there has been very limited research aimed at analyzing unstructured

engineering documents for retrieval purposes. Most of it has been based on IR approaches. Dong and

Agogino [11] proposed to use vector space model and belief networks to represent design manuals.

Ahmed et al. [12] developed taxonomies in order to index corporate documents. The vector space

model was also used to classify the documents against the terms in the taxonomies. Yang et al. [13]

attempted to automate the population of a thesaurus from notebooks by using the latent semantic

analysis. McMahon et al. [4] employed predefined taxonomy to classify documents by rule-based

matching.

Current approaches (1) do not attempt to provide the semantics-based representation of

engineering documents or provide for engineers’ information needs; (2) do not reflect and utilize

engineering knowledge in the organization of the search; and (3) are unable to handle complex queries

that have qualitative as well as quantitative engineering specifications.

The long term goal of this research is to develop a content-oriented, knowledge and meaning based

computational framework to form the ontological basis of the search, browsing, and learning tasks in

the engineering domain. This paper focuses on investigating the method and process required to

develop such ontological basis. The proposed methodology

1. Specializes the ontological semantics methodology proposed by Nirenburg and Raskin [15] for

machine translation and natural language understanding;

2. Represents a structured process in developing the engineering ontology (EO) and its associated

engineering lexicon (EL);

3. Formalizes the cumulative domain knowledge such as the classification of mechanical elements,

their function, design, and manufacturing knowledge and formulates in a single standard format;

4. Integrates with an ontology engineering tool;

5. Incorporates semi-automatic tools into the practical acquisition process; and

6. Evaluates the acquired EO by using principled and empirical methods.

Section 2 first provides definitions of an ontology and its distinct features compared to other

representation schemas. Current ontology acquisition methods are summarized. An overview of our

knowledge-based computational framework is described in section 3. Section 4 discusses in detail the

proposed engineering ontology development methodology and the acquired EO. The evaluation

method and experiment are introduced in section 5. Section 6 concludes the paper.

2 RELATED WORK

2.1 Ontology Definition

An ontology is a constructed model of reality, a theory of a domain. In more practical terms, it is a

highly structured system of concepts covering the processes, objects, and attributes of a domain as

well as all their pertinent complex relations. The grain sizes of the concepts are determined by

considerations such as the need for an application or for computational complexity.

From one aspect, an ontology can be viewed as a decomposition of a domain: it is a tangled

hierarchy of conceptual nodes, each of which can be represented as:

property-slot (CONCEPT-NAME, PROPERTY-VALUE/FILLER-CONCEPT-NAME) +

Every concept but the root of the ontology has the property-slot is-a, and the value of this property is

the parent of this concept. A concept may have multiple parents and multiple inheritances.

From the other aspect, an ontology reflects the correlations among concepts across sub-domains:

the PROPERTY-VALUE of a concept refers to its filler concept, i.e., these two concepts are

connected by the specific property-slots, i.e., (binary) relationships.

Ontologies share the inheritance feature with the object-oriented (OO) programming languages,

which are indeed suitable for implementing ontological procedures. However, in OO programming,

the focus is on designing the operational properties, i.e., the methods of a class, whereas ontology

development is based on the structural properties, i.e., relationships of a class. More importantly, the

OO approach lacks the conceptual content of ontologies, and it is not sufficient for addressing the rich

knowledge modeling needs discussed here. The distinction between form and content is crucial for

understanding the proposed ontology model. It is the content of ontologies that makes them useful for

this application, independent of the choice of form, i.e., format or language. Currently, there is also

ICED’07/471 3

confusion between taxonomy and ontology based applications. One of the major differences between

taxonomies and ontologies is that an ontology represents much richer domain contexts than a

taxonomy or a list of taxonomies. A taxonomy is a hierarchical classification of concepts in a sub-

domain. These concepts are connected only by domain-independent (or taxonomic) relationships such

as is-a. An ontology, however, consists of several taxonomies, along with multiple domain-specific

(or non-taxonomic) relationships to connect concepts across taxonomies. See [14] for comparisons

between ontologies and database schema, as well as those between ontologies and knowledge

representation; [15] for an extended view of what a full-fledged ontology must be and how to bring

that about; and [16] for an extensive survey on existing ontological systems from manufacturing and

knowledge sharing perspectives.

2.2 Methods for Ontology Development

While ontologies have found many applications in the fields where semantics-based

communications among people and systems are crucial [14], only a few methods for developing

ontologies have been reported.

The method used to build the Cyc [17] ontology consists of general steps and codification of

articles and pieces of knowledge. Manual process is used to extract the common sense knowledge that

is implicit in different sources. Latterly proposed methods all start from the identification of the scope

and the need for the ontology: The work by Gruber [18] represents the first attempt to consolidate

experience gained in developing ontologies. It can be summarized as five ontology design criteria:

clarity, coherence, extensibility, minimal ontological commitment, and minimal encoding bias.

Uschold and King [19] developed Enterprise Ontology for enterprise modeling processes. Their

development method includes four activities: 1) identification purpose, 2) build ontology, 3)

evaluation, and 4) documentation. They also proposed three strategies for identifying the concepts in

the ontology: top-down, bottom-up, and middle-out. Grüninger and Fox [20] proposed an ontology

design and evaluation method while developing the TOVE (Toronto Virtual Enterprise) project

ontology for business processes and activities modeling. It uses a set of natural language questions,

called competency questions to determine the scope of the ontology and to extract the main concepts

of the ontology as well. However, the major focus is on in building the first order logical model

representation of the ontology. A similar method was introduced by Noy and McGuinness [21] with

an example of wine ontology acquisition using Frame-based representation. Fernandez et al. [22]

presented a more structured method and life cycle definition for developing ontologies from scratch,

called METHONTOLOGY. However, the evaluation is purely subjective. Notice that the

implementation step (manually editing and coding ontologies in a specific language) specified in most

of these methods is not necessary any more because of the maturity of the ontology engineering tools

nowadays [23].

Among the recently proposed ontology acquisition methods in engineering, Eris et al. [24]

presented an initial ontology framework in modeling product development projects in small teams.

Nanda et al. [25] applied the formal concept analysis to form the product family ontology of one-time-

use cameras. Ahmed and Wallace [12] intended to design an ontology development process which can

be customized for a particular manufacturing company. However, their acquisitions did not explicitly

explore the domain-specific relationships among concepts and therefore, the acquisition result is a list

of independent taxonomies, not an ontology.

In summary, very little effort has been made to systemize the established knowledge in the

engineering domain by using formal ontology representation for the purpose of more effective

information retrieval. Most of the current ontology development methods still require tremendous

effort and subjective judgments from the ontology developers to acquire and maintain the ontology.

To our limited knowledge, no attempt has been made to evaluate the resulting ontologies both from

the principle perspective and the application perspective. It is critical to investigate a principled,

systematic, and more structured acquisition method combined with an evaluation process for

developing such an engineering ontology in order to support knowledge and meaning based

information retrieval. Our method also is different because it uses EL to formalize the lexicon

knowledge in order to bridge the concept-based representation of the ontology and the word-based

representation of documents and user queries.

ICED’07/471 4

3 OVERVIEW OF EO-SEARCH

Figure 1 shows the overall architecture of

interactions between the ontological basis,

i.e., the EO and EL, with other functional

modules applied to the knowledge-based

engineering information retrieval framework

(EO-Search). The framework comprises six

portions: pre-processing, ontology basis,

ontology acquisition and maintenance,

concept tagging, concept indexing, and query

processing. Note that concept tagging also

refers to the empirical process of evaluating

the EO and EL in section 5.

1. Pre-processing: The task of pre-

processing is to convert engineering

documents into a unified format such as

.txt files, which can then be processed by

the system. The inputs may include

catalog descriptions, CAD drawings,

technical reports, and engineers’

notebooks.

2. Ontology basis: This consists of domain

knowledge and lexical knowledge, i.e.,

the EO and its associated EL,

respectively. They are used to assist in

recognizing technical terms (in documents

and queries) at the concept level.

3. Ontology acquisition and maintenance:

Protégé 3.1 (http://protege.stanford.edu) is used to build and update the EO and EL. The output

scripts from Protégé record the content of the EO and EL. These Frame-based XML scripts are

then read into the system to generate the EO and EL in the memory.

4. Concept tagging: The documents in the unified format are tagged by using the concepts in the EO

and are then transformed into an XML-based representation. Using EO and EL makes the tagging

process less dependent on NLP techniques in understanding the texts. Metadata, such as names of

the original documents, are also stored.

5. Concept indexing: An inverted index is generated to index the XML documents. The filenames

and the locations where the concept (tag) appears are listed along with the concept. This index is

accessed when the system ranks the documents in query processing.

6. Query processing: EO plays an important role in interpreting the user’s queries accurately, and

therefore improves retrieval performance. Queries with qualitative or quantitative property-value

pairs are also handled. Ontology-based query processing algorithms are developed to fulfill these

tasks.

Please refer to [28, 29] for more details of concept indexing and ontology-based query processing.

4 DEVELOPING EO AND EL

The process of developing the EO and its associated EL includes six steps. These are 1)

Specification: determining the scope and granularity of the EO; 2) Conceptualization: acquiring the

EO (according to the scope and granularity) and EL from various knowledge resources; 3)

Formalization: the acquired knowledge is put into structured formats; 4) Population: the formalized

knowledge is converted into Protégé’s Frame-based representation, manually or automatically; 5)

Evaluation: using principled and empirical methods to validate the quality of the EO and EL; and 6)

Maintenance: revising the EO and EL. Figure 2 illustrates the development process and the supporting

activities in each step. Note that the de facto development of EO and EL is an iterative process.

Indeed, the specifications of an ontology may change throughout its life cycle as the definitions are

initialized, modified, and deleted.

Figure 1. System architecture and functional modules

1. Pre-processing: Consolidating heterogeneous documents

2. Ontology basis: Engineering Ontology & Engineering Lexicon

3. Ontology acquisition and maintenance

4. Concept tagging

5. Concept indexing

6. Query processing

PartTexts

(unstructured)

Tagging

Query

processing 6

Inverted

concept ind ex 5

Doc ument collections

Technical reports

CAD drawings

Engineers' notebooks

Online catalo gs

......

Pre-processing

Engineering

lexicon

Engineering

ontolo gy

Indexing

PartXMLs

(structure d)

ICED’07/471 5

4.1 Specification

The first step is to

identify the scope or

themes of the EO for

information retrieval

purpose. These themes are

determined based on the

discoveries by cognitive

studies in the engineering

domain, such as [12, 30-

33]. The prior studies have

categorized the domain-

specific issues being

documented during the

product development

process as well as the

information needs of

engineers. Currently, in

designing the themes of

the EO, this research

considers issues such as

the products or

components being

designed (i.e., devices),

their functions, properties,

material selections, shape

features, various processes

(e.g., manufacturing) in product development, the environmental objects with which the product or

component interact, and the standards that certain design or manufacturing entities comply with. The

overall schema of the EO is shown in Figure 3. Each taxonomy represents an issue or a sub-domain of

the EO. Recall that a taxonomy consists of concepts organized in a hierarchy. However, the EO is

differentiated from simply a set of loosely connected taxonomies (at their root level) by having other

domain-specific inter-relationships among concepts across these taxonomies. Therefore, what types of

inter-relationships exist among concepts and should be acquired must also be determined.

Now the question becomes what level of granularity of knowledge should be taken into account in

the EO. Since the goal is to build a search mechanism that is more effective than keyword-based

search while less dependent on using NLP techniques to understand documents or queries, the EO

must include more specific concepts (lower-level concepts), such as spur gears, as well as more

general concept categories (upper-level concepts), such as mechanical components. This is because

specific concepts are usually used in documentation while both general and specific concepts may be

the interest of users’ queries. Note that 1) different brand names of the product or components are not

treated as separate concepts; and 2) the instances of the concepts will appear only in the documents by

concept tagging rather than as part of the EO.

4.2 Acquisition

Most of the ontology development methods conduct the ontology acquisition in a subjective

manner. They generate concepts either by brainstorming (i.e., randomly enumerating a list of terms

and then figuring out how they are related to each other), or by interviewing with experts. The first

approach may be effective in creating ontologies for simple domains with shallow knowledge.

Figure 3. The schema of the ontology basis

Engineering ontology

Property

taxonomy

Measurement unit

taxonomy

Device

taxonomy

Function

taxonomy

Environment

taxonomy

Shape feature

taxonomy

Standard

taxonomy

Engineering

lexicon

Material

taxonomy

Value type

taxonomy

Process

taxonomy

Methods / Steps Supporting activities

1. Specification

1.1 Scope determination

1.2 Granularity selection

2. Acquisition

2.1 Concept acquisition

2.2 Relationship acquisition

2.3 Lexical term acquisition

3. Formalization

3.1 Taxonomy formalization

3.2 Relationship formalization

3.3 Lexical term formalization

4. Population

4.1 Manual population

4.2 Automatic population

5. Evaluation

5.1 Principled evaluation

5.2 Empirical evaluation

6. Maintenance

6.1 EO maintenance

6.2 EL maintenance

Protégé modeling

OntoClean [41]

Concept tagging

Protégé modeling

Automatic parsing

Knowledge worksheets

Knowledge resource collection

Knowledge resource reuse

Knowledge extraction

Generalization & Specification

Literature studies

Design considerations

Figure 2. Methodology and process of developing the EO and EL

ICED’07/471 6

However, it is not feasible in developing the EO, which includes broader as well as complex domain

knowledge. The second approach may be appropriate if the ontology is built based upon the

knowledge in a small domain, such as a company. However, the content of the ontology may be

skewed and limited.

The knowledge acquisition task is conducted mainly by utilizing the established engineering

knowledge resource (EKR) and analyzing the content of these resources based on our domain

knowledge background. Examples of the EKR are engineering handbooks, textbooks, online catalogs,

literature, and bill of materials (BOMS). The last one is analyzed in order to acquire the desired

knowledge from a specific company. The ontology acquisition consists of three tasks: concept

acquisition or taxonomy acquisition, relationship acquisition, and lexicon acquisition. In practice, the

first two tasks are done simultaneously. First, the EKRs corresponding to a specific taxonomy, or part

of the taxonomy, are collected, for example, material selection handbooks for the material taxonomy.

Second, the sentences and phrases which describe the concepts of this taxonomy as well as their

relationships with other concepts are extracted, and then documented in free texts. Certain EKRs, such

as Function Basis from [34] and motor and pump ontologies from [35], where concepts are already

organized, are reused directly. The references from the investigated EKR are recorded. The generated

descriptions/documentations are informal representations of the domain knowledge. The lexicon

acquisition is integrated with the lexical term formalization in the next step.

4.3 Formalization

When most of the knowledge has been acquired, it is unstructured and needs to be organized and

structured by using representations that both computers and humans can understand. Such

representations are named “knowledge worksheets.” They are formatted templates and independent of

the ontology engineering tools or implementation languages used. The worksheets 1) are used as

formal documentations of the EO and EL development; 2) direct the acquisition of the EO and EL;

and 3) improve the efficiency of the ontology development process by enabling automatic upload of

the acquired knowledge into Protégé. They have been used extensively by the undergraduate students

who fulfill the acquisition and formalization tasks in our group. In the process of full deployment, the

ontological semantic toolbox [15, 36] will be used.

Basically, there are two types of worksheets: taxonomy worksheets and relationship worksheets.

Each taxonomy corresponds to a taxonomy worksheet while each concept, in general, has a

relationship worksheet. The taxonomy worksheet is used in organizing the unstructured results from

the concept acquisition into a hierarchical structure. In our experience, this is the most challenging

step of the overall development process. For example, different EKRs may classify the same

taxonomy or concept from different perspectives and therefore have to be merged carefully. For

Taxonomies

Num. o

concepts Examples of concepts Acquisition resources

Examples of acquisition

resources

Engineering

component 451

D-LOCK-WASHER,

D-LINEAR-SLIDE

Engineering texts, Handbooks,

Online catalogs

[35, 37],

www.globalspec.com

Device Proprietary

product 190 D-BASE-COVER BOMs (or Product dissection)

BOMs of the base cover

assembly

Function 246 F-SUPPORT, F-LOCK Existing taxonomies [34, 38]

Material 1017

M-STAINLESS-STEEL,

M-2008-T4 AL

Engineering texts, Handbooks,

Online catalogs [39], www.matweb.com

Process 252

MF-CASTING,

MF-WELDING Engineering texts, Handbooks [39, 40]

Property 378

P-SHAFT-DIAMETER, P-

DUCTILITY Same as Device taxonomy Same as Device taxonomy

Measurement unit 64

MU-INCH,

MU-FT-LB/SECOND Online resources

www.ex.ac.uk/cimt/dictunit/

dictunit.htm

Shape feature 47

SF-LINEAR-SLOT,

SF-TOOTH Existing taxonomies

STEP AP224, vocabularies

of major CAD packages

Environment object 135

E-HEAT,

E-AXIAL-LOAD

Engineering texts, linguistic

resources [32], WordNet2.1

Standard 31 S-MIL-STD-130 Standard libraries www.nssn.org

Value-type 8

V-FLOAT (Numerical),

V-HIGH (Symbolic)

Engineering common sense;

Online catalogs N/A

Table 1. The EO contents and acquisition resources

ICED’07/471 7

instance, the manufacturing process can be classified from either the functional aspect or the material

removal/addition aspect. And some EKRs, especially the online catalogs, may have contradictory

classifications (e.g. a child concept becomes an ancestor of its parent). In this case, re-classification is

needed. Note that concept naming conventions are applied in order to 1) make the EO more readable;

and 2) make each concept unambiguous even at its (term) representation. Otherwise for example,

‘cylinder’ can refer to both a device concept and a shape feature concept and therefore, can cause

ambiguities in the EO, which is not allowed. The naming conventions require that each concept

consist of a prefix representing the taxonomy the concept belongs to, in upper case, and that its terms

be connected by “-.” Therefore, the two concepts in the previous example are written as D-

CYLINDER and SF-CYLINDER, respectively. Table 1 lists more details of the EO concepts and the

acquisition resources.

In general, concepts in the EO are connected with their relevant concepts through relationships.

For instance, a property concept (e.g., P-PITCH-DIAMETER) is related with some measurement unit

concepts (e.g., MU-MILLIMETER) and value type concepts (e.g. V-FLOAT). Exceptions include

value type concepts and standard concepts, which are self-contained. Note that these relationships are

one-way connections. Definitions of the relationships are given in Table 2. The relationship worksheet

describes the relevant knowledge (concepts + relationships) of a concept. Table 2 illustrates that the

most significant relationship worksheet is the one for device concepts because a device concept has

correlations with most of the other types of concept (see Figure 4). Note that some relationship

descriptions may be empty either because such knowledge has not been acquired or because of the

characteristics of the concept being described. For example, D-LOCK-WASH does not have has-part

relationships with a device concept since it is a part type of component.

Note that the device taxonomy includes classifications of engineering catalog components and

proprietary products. The latter one needs to be customized for each specific company including

product line classifications, subassembly classifications, and part inventory classifications. The

properties of the device concepts are conceptualized in the property taxonomy and connected with the

device concepts through the has-property relationship. This is also true for the properties of the

material concepts and shape feature concepts.

The lexical terms are the natural language phrases of the corresponding concept. They are used to

match with word(s) in documents or queries. Therefore, morphology forms, abbreviations, acronyms,

Relationship (Concept*, Filler

concept)

Definitions of the relationship Examples

is-a Child Parent Describes the generalization from a child concept to its

parent concepts or the specification from a parent concept

to its child concepts

is-a (D-ELECTRICAL-MOTOR, D-MOTOR)

has-part DC DC Represents the part-whole between an DC and the other

has-part (D-LINEAR-SLIDE, D-BALL-

BEARING)

has-function DC FC Refers to the connection between a DC and one of its FCs has-function (D-LOCK-WASHER, F-LOCK)

interface-with

Interact-with

Complements the has-function relationship when there is

an ‘object’ in the function description of ‘subject + verb [+

objects]’. Together, they represent the interactions

between an DC and the other DC or EC

interface-with (D-LOCK-WASHER, D-

FASTENER);

interact-with (D-LOCK-WASHER, E-

FRICTION)

has-material DC MC Describes the type of materials used in making the DC has-material (D-WASHER, M-METAL)

has-process DC MFC Describes the type of manufacturing process used to

make/fabricate the DC

has-process (D-GEAR, MF-HOBBING)

use-material MFC MC Describes the type of possible raw materials that certain

manufacturing processes act on

user-material (MF-COATING, M-

NONFERROUS-METAL)

has-property DC/MC/SFC PC Each DC has several PCs characterizing its attributes such

as various physical attributes and geometry attributes; each

MC may also have several PCs specifying its

characteristics such as physical and mechanical attributes

has-property (D-PLAIN-WASHER, P-INSIDE-

DIAMETER);

has-property (M-METAL, P-HARDNESS)

has-measurement PC MUC Most of the PCs have one or several MUCs has-measurement (P-LENGTH, MU-METER)

has-value PC/MUC VC Each PC may have numerical VC or symbolic VC while

MUC only has numerical VC

has-value (P-DIAMETER, V-NUMERICAL)

has-feature DC SFC Describes the significant shape features a device may have has-feature (D-SCREW, SF-THREAD)

has-standard DC/MC/MFC SC Specifies the standard a DC/MC/MFC may comply with has-standard (D-WASHER, S-ASME B18.13)

Table 2 Definitions of the relationships

ICED’07/471 8

D-LOCK-WASHER

Definition

A washer designed to prevent undesired loosening of a

nut after it has been tightened

Lexical terms

lock washer, lock washers

Sub-part

Function descriptions

F-LOCK D-FASTENER, F-DISTRIBUTE E-FORCE

Properties

P-INSIDE-DIAMETER, P-OUTSIDE-DIAMETER, P-

THICKNESS

Material

M-FERROUS-METAL, M-THERMOPLASTICS

Manufacturing process

MF-COATING

Shape feature

SF-HOLE

SF-TOOTH

Figure 4. Relationship worksheet for

‘lock washer’

and synonyms of the word/phrase are also lexical terms

and share the same concept with the original lexical term.

For example, move and moving are lexical terms of the

functional concept F-MOVE.

Currently, there are 10 taxonomies, 2,819 concepts

and 13 types of relationships in the EO, and more than

10,000 lexical terms in the EL. The EO represents the

general domain knowledge as well as

the company-specific or proprietary product knowledge.

We have investigated the design of a commercialized

surgery robot as an example of the proprietary products.

The general domain knowledge refers to the knowledge

about the more-standardized catalog components, such as

gears, and more-customized catalog components, such as

linear slides.

4.4 Population

The population step refers to modeling the EO and EL

by using the ontology engineering tool, i.e., Protégé as

well as the generated knowledge worksheets. Two

options are provided: manual population and automatic

population. The modular structure of the EO and EL lend themselves easily to an expansion, such as

the addition of a new relationship or new concept. In Protégé, concepts are modeled as classes while

relationships are slots. An attribute (unary relationship) slot named lexical-terms is assigned to each

class. This attribute contains all the lexical terms of the pertinent concept. In automatic population, a

Protégé plug-in is developed by using Protégé APIs that can read in the knowledge worksheets and

generate the EO and EL model. The manual approach is more appropriate for maintenance purposes

where limited number of concepts, relationships, or lexical terms need to be changed sometimes. It is

more efficient to use the automatic population especially when building the EO and EL from scratch.

In automatic population, taxonomy worksheets are loaded prior to the relationship worksheets.

However, it is possible that certain concepts which are part of the descriptions in the relationship

worksheets may not be defined in the EO yet. Therefore, some human interventions are expected.

5 EVALUATION

The resultant EO is organized in a directed graph or lattice: each node represents a concept and

each arc represents a relationship. A portion of the EO is shown in Figure 5.

Now the questions are: How is the EO to be validated? How much does this ontology cover? And

how accurate are the concept definitions? The OntoClean [41] is applied in order to validate the

choice of the concepts. This method is based on general ontological notions drawn from philosophy,

such as essence, rigidity, identity, and unity, which are used to characterize relevant aspects of the

intended meaning of the ontology concepts and relationships. Regarding the completeness and

accuracy of the EO, because lower-level concepts are taken from various EKRs and more upper-level

concepts are added

in the EO, we

believe the EO

covers the domain

within the scope of

the components and

proprietary products

modeled.

In order to

estimate the

coverage and

accuracy, an

experiment is

Figure 5. A Portion of the EO

D-WASHER

FUNCTION

......

F-LOCK

MF-COATING

(protective finish...)

MANUFACTURING

DEVICE

MF-METAL-

COATING

MATERIAL

......

ROOT

......

...... M-NON-

FERROUS-METAL

MF-ORGANIC-

COATING

......

PROPERTY

......

NUMERIC-VALUE

has-value

has-

manufacturing

use-material

M-ZINC

P-SURFACE-FINISH

D-PLAIN-WASHER

D-LOCK-WASHER

D-BELLIVILLE-

WASHER

D-TOOTH-LOCK-WASHER

......

has-function

has-material

M-FERROUS-

METAL

......

F-ROTATE

is-a

DC: device concept; FC: function concept; EC: environment concept; MC: material concept;

MFC: manufacturing process concept; SFC: shape feature concept; SC: standard concept;

PC: property concept; MUC: measurement unit concept; VC: value type concept

ICED’07/471 9

established by using the concept tagging. Recall that concept tagging is also part of the framework for

EO-based information retrieval. However, in this experiment, our goal is to select concepts from the

tagged document, which includes 1,000 catalog descriptions downloaded from 62 manufacturers and

205 CAD drawings and 11 technical reports from a design company. It is observed that 92.7% of the

test documents are associated with the concept of the EO, while 7.3% of the documents failed to

associate with any concept of the EO due to incompleteness of the EO or EL. However, after the EO

and EL get updated accordingly, tagging errors can be neglected. This observation indicates that the

completeness and maintenance issues will be the life cycle issues in using EO for information

retrieval.

In order to tag engineering documents by using concepts in the EO, documents from various

resources are first converted into .txt files, i.e., PartTexts, during pre-processing as in Figure 1. XPDF

(www.foolabs.com/xpdf/) is incorporated into our framework in order to convert the PDF documents

into the unstructured PartTexts. Certain symbols such as Ø are replaced by their textual descriptions

(e.g., diameter). Texts in CAD drawings are extracted using I-migrate, a software program that

employs various CAD application program interfaces (APIs) and was donated by our industry partner.

Our method makes use of the EO and EL to recognize concepts contained in the documents. By

doing so, the PartTexts are converted into an XML and concept based representation, i.e., PartXMLs,

where each recognized word/phrase is tagged by the corresponding concept. Figure 6 shows the

modules and process of concept tagging.

1. Tokenization: The input character streams are parsed into tokens and punctuation marks.

2. Segmentation: Sentences are formed by using punctuation marks and symbols such as “\n.”

3. Concept recognition:

3.1 Cardinal number recognition: Cardinal numbers such as 3.2,

1:20, and 200 are identified.

3.2 Concept matching: Assigning each word/phrase the concepts

it refers to. This process takes two iterations. The first iteration is full

matching, where lexical terms are retrieved in an orderly manner and

matched against words in each sentence sequentially. Word(s) that

fully match with a lexical term will be assigned the pertinent

ontology concept. Note that multiple concepts may be assigned to a

single word or a series of words (i.e., a phrase) because different

concepts may have the same lexical term. The next iteration is

partial matching, where each unrecognized word is matched against

lexical terms sequentially. The concept will be assigned if the word

matches part of its lexical term.

3.3 Numerical value recognition: The system recognizes

numerical values such as 3.2 inch, HRC 55, and 32°F - 212°F. First,

it recognizes a single numerical value by converting the cardinal

number recognized in step 3.1 to a single numerical value if the

number is adjacent to a measurement unit concept such as ‘mm’ (MU-millimeter), a property concept

such as ‘diameter’ (P-DIAMETER), or certain symbols, such as ‘+/-.’ Next, range values are

recognized. Currently, the system recognizes five types of numerical values: integer, float (e.g., 3.2

and 1/2), percentage (e.g., 20%), ratio (e.g., 1:4), and tolerance (e.g., +/- 0.001).

4. Concept disambiguation: A word or phrase which matches multiple concepts causes ambiguities.

There are two major types of ambiguities:

• Polysemy: for example, the word cylinder may refer to a shape feature concept, SF-CYLINDER,

or a device concept, D-CYLINDER, because both concepts have the same lexical term cylinder.

• Ellipsis: for instance, the word finish may (partially) match the lexical term surface finish, which

is associated with the property concept P-SURFACE-FINISH, and protective finish, which is

associated with the manufacturing process concept, MF-COATING.

Ambiguities are resolved by referring to the contexts of the word/phrase that is ambiguous. The

contexts of a word refer to the concepts its adjacent words/phrases are tagged. For example, if the

untagged word finish is followed by a phrase tagged material concept, e.g., M-ZINC, then the word

finish must be tagged MF-COATING. If the word is followed by a numerical value concept such as

+/-0.001, it must be tagged P-SURFACE-FINISH, because this property concept is related to

Figure 6. Modules and

rocess of conce

t ta

Tokenization

Sentence

segmentation

Concept

recognition

Concept

disambiguation

Engineering

lexicon

Engineering

ontology

PartXMLs

PartTexts

ICED’07/471 10

numerical value concepts as defined in the EO. See [29] for details about the concept disambiguation

method.

5. PartXML generation: The processed partText is converted to PartXML, where each word/phrase is

enclosed with its concept as tags. Figure 7 presents examples of a 2D drawing (notes) and component

catalog descriptions before and after the tagging process. Note that the tag <TEXT> serves as a

containment of words not semantically tagged. These tags are used for a repeated updating of the EO

and EL because the words can easily be pulled out and analyzed.

6 CONCLUSION

A systematic, principled, and semi-automatic ontology development methodology has been

described. It has several distinctive characteristics: 1) The acquisition method is based upon the

ontological semantics theory which has been justified in various semantics-based applications; 2) The

elicitations of the EO take into account both general EKRs independent of a particular company,

EKRs specific to a company, and the information needs of engineers; 3) The knowledge worksheets

further structure the acquisition process and enable an automatic ontology population; 4) The

evaluations of the EO using the principled method and the empirical experiment; and 5) The overall

development process is integrated with Protégé in order to utilize its wide range of plug-ins (e.g.

ontology visualization and reasoning) and language formats such as XML and OWL (Web Ontology

Language). In addition, though the EO is developed for information retrieval purpose in particular, it

has a broader array of potential applications, such as semantics-based system interoperability by

extending the EL, and knowledge reuse.

PART: SUPPORTING BLOCK

APPLICABLE STANDARDS OR SPECIFICATIONS ASME Y14.5M-1994

FINISH 3.2 microinches

ALL FILLETS AND EDGES SHALL BE 0.025+-0.13 ULESS OTHERWISE SPECIFIED

MATERIAL: WROUGHT MATERIAL STEEL BARS

PROTECTIVE FINISH: FINISH IAW 5.3.1.3 OR 5.3.2.3 OF ML-STD-171

MARK IAW WITH MIL-STD-130 METHOD RUBBER STAMP

......

......

<F-SUPPORT>SUPPORTING</F-SUPPORT>

<D-BLOCK>BLOCK</D-BLOCK>

<TEXT>APPLICABLE </TEXT>

<TEXT>STANDARDS</TEXT>

<TEXT>SPECIFICATIONS</TEXT>

<S-ASME Y14.5M-1994>ASME Y14.5M-1994</S-ASME Y14.5M-1994>

<P-SURFACE-FINISH>FINISH</P-SURFACE-FINI SH>

<V-FLOAT>3.2</ V-FLOAT>

<MU-INCH>microinches</MU-INCH>

......

<TEXT>MATERIAL</TEXT>

<MF-WROUGHT>WROUGHT</MF-WROUGHT>

<TEXT>MATERIAL</TEXT>

<M-STEEL>STEEL</M-STEEL>

<D-BAR>BARS</D-BAR>

......

</PartXML>

<FILENAME>washer1_3.pdf</FILENAME>

……

<D-EXTERNAL-TOOTH-LOCK-WASHER>External Tooth Lock Washers</D-EXTERNAL-

TOOTH-LOCK-WASHER>

<TEXT>maximum</TEXT>

<F-HOLD>holding</F-HOLD >

<P-POWER>power</P-POWER>

<D-SCREW>screws</D-SCREW>

<SF-HEAD>heads</SF-HEAD>

<P-ROUND>round</P-ROUND>

<P-PAN>pan</P-PAN>

<SF-TEETH>teeth</SF-TEETH>

<S-ASME B18.21.1>ASME B18.21.1</S-ASME B18.21.1>

<M-18-8-STAINLESS-STEEL>18-8 stainless steel</M-18-8-STAINLESS-STEEL>

<P-CORROSION-RESISTANC>corrosion resistan ce</P-CORROSION-

RESISTANCE>

<P-MAGNETIC>magnetic</P-MAGNETIC>

<M-TYPE-410-STAINLESS-STEEL>410 stainless steel</M-TYPE-410-STAINLESS-

STEEL>

<P-CORROSION-RESIS TANCE>corrosion resistance</P-CORROSION-

RESISTANCE>

<P-ROCKWELL-HARDNESS>Rockwell hardness</P-ROCKWELL-

HARDNESS>

<MU-HARDNESS>C</MU-HARDNES S>

<V-INT>34</V-INT>

……

</PartXML>

Figure 7. Examples of the document tagging results

ote that letters in bold are the words from the original document or PartText.

For the sake of clarity, 1) the title block in the drawing is not shown; 2) only parts of the tagged documents

are illustrated; and 3) the PartText is ignored in b.

a. Example of a drawing notes and its tagging results b. Example of a catalog description and its tagging results

ICED’07/471 11

ACKNOWLEDGEMENTS

Our special thanks go to Professor Bill Peine, School of Mechanical Engineering, Purdue

University, for his help in knowledge acquisition about the surgery robot design, and to Professor

David C. Anderson, School of Mechanical Engineering, Purdue University, for his insightful

suggestions. We would also like to thank Imaginestics LLC for their help in preparing the test data

and for donating I-migrate. Chris Bence, In Chul Chang, and Kaushik Mantri helped in acquiring the

engineering ontology. This research is supported by the 21st Century R&T funds, the National

Science Foundation Partnership for Innovation Award, and University Faculty Scholar Award for

Professor Ramani. We are grateful to the Purdue Center for Education and Research in Information

Assurance and Security (CERIAS) for making its ontological semantics resources available to us and

to its internal pilot grants, funded initially by an Eli Lilly Foundation grant, as well as to its external

NSH-ITR and NSF CyberTrust grants on the extension of the resources to new domains.

REFERENCES

[1] McMahon C.A., Lowe, A., Culley, S.J., Corderoy, M., Crossland, R., Shah, T., and Stewart, D.

Waypoint: An Integrated Search and Retrieval System for Engineering Documents. ASME J.

Computer and Information Science in Engineering (JCISE), 2004, 4(4), 329-338.

[2] Court, A.W., Ullman, D.G., and Culley, S.J. A Comparison between the Provision of

Information to Engineering Designers in the UK and the USA. Int. J. Information Management,

1998, 18(6), 409-425.

[3] Ullman, D.G. The Mechanical Design Process. 2001 (New York: McGraw-Hill).

[4] Ahmed, S. and Wallace K.M. Identifying and Supporting the Knowledge Needs of Novice

Designers within the Aerospace Industry. J. of Engineering Design, 2004, 15(5), 475-492.

[5] Sivaloganathan, S. Engineering Design Conference 98: Design Reuse, 1998 (ASME 98).

[6] Hertzum, M. and Pejtersen, A.M. The Information-seeking Practices of Engineers: Searching

for Document as well as for People. J. Information Processing and Management, 2000, 36(5),

761-778.

[7] Salton, G. Automatic Text Processing. 1989 (Wokingham: Addison Wesley).

[8] Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. Indexing by

Latent Semantic Analysis. J. American Society for Information Science, 1990, 41(6), 391– 407.

[9] Ponte, J. and Croft, W. A Language Modeling Approach to Information Retrieval. ACM SIGIR,

1998.

[10] Robertson, S.E., Walker, S., and Hancock-Beaulieu, M. Okapi at TREC-7: Automatic Ad hoc

Filtering, VLC and Interactive. TREC-7, 1999.

[11] Dong, A. and Agogino, A.M. Text Analysis for Constructing Design Representations. J.

Artificial Intelligence in Engineering, 1996, 11, 65-75.

[12] Ahmed, S., Kim, S., and Wallace, K.M. A Methodology for Creating Ontologies for

Engineering Design. Proc. ASME / IDET&CIE Conf., Long Beach, CA, 2005.

[13] Yang, M.C., Wood, W.H. and Cutkosky, M.R. Design Information Retrieval: A Thesauri-based

Approach for Reuse of Informal Design Information. J. Engineering with Computers, 2005,

21(2), 177-192.

[14] Uschold, M. and Grüninger, M. Ontologies and Semantics for Seamless Connectivity.

SIGMOD Record, 2004, 33(4), 58-64.

[15] Nirenburg, S. and Raskin, V. Ontological Semantics. 2004 (Cambridge, MA: MIT Press).

[16] Schlenoff, C. Denno, E., Ivester, R., Libes, D., and Szykman, S, An Analysis and Approach to

Using Existing Ontological Systems for Applications in Manufacturing. J. Artificial

Intelligence for Engineering Design, Analysis and Manufacturing (AIEDAM), 2000, 14, 257-

270.

[17] Lenat, D.B. and Guha, R.V., Building Large Knowledge-based Systems: Representation and

Inference in the Cyc Project. 1990 (Addison-Wesley, Boston).

[18] Gruber, T. Towards Principles for the Design of Ontologies Used for Knowledge Sharing. Intl.

J. of Human-Computer Studies, 1995, 43(5/6), pp. 907-928.

[19] Uschold, M. and King, M. Towards a Methodology for Building Ontologies. IJCAI95

Workshop on Basic Ontological Issues in Knowledge Sharing, 1996, Montreal.

ICED’07/471 12

[20] Grüninger, M. and Fox, M.S. Methodology for the Design and Evaluation of Ontologies. Proc.

Int’l Joint Conf. AI Workshop on Basic Ontological Issues in Knowledge Sharing, 1995.

[21] Noy, N.F. and McGuinness, D.L. Ontology Development 101: A Guide to Creating Your First

Ontology. Technical Report, KSL-01-05 and SMI-2001-0800, 2001, Stanford, Knowledge

Systems Laboratory and Stanford Medical Informatics.

[22] Fernández-López, M., Gómez-Pérez, A., and Sierra, J.P. Building a Chemical Ontology Using

Methonology and the Ontology Design Environment. IEEE Intelligent Systems, 1999, 14(1).

[23] Gómez-Pérez, A., Angele, J., Fernández-López, M., Christophides, V. Sure, Y. (ed.) A Survey

on Ontology Tools. OntoWeb Deliverable, Universidad Politecnia de Mardrid, 2002.

[24] Eris, O., Hansen, P.H.K., Mabogunge, A., and Leifer, L. Toward a Pragmatic Ontology for

Product Development Projects in Small Teams. In Proc. Int’l Conf. on Engineering Design

(ICED’99), 1999, Munich.

[25] Nanda, J. Simpson, T.W., Kumara, S.R.T., and Shooter, S.B. A Methodology for Product

Family Ontology Development Using Formal Concept Analysis and Web Ontology Language.

JCISE, 2006, 6, 1-11.

[26] Hwang, C. H. Incompletely and Imprecisely Speaking: Using Dynamic Ontologies for

Representing and Retrieving Information. Technical Report, 1999, Austin: Microelectronics

and Computer Technology Corp., TX.

[27] Shamsfard, M. and Barforoush, A.A. Learning Ontologies from Natural Language Texts. Int.

J.Human-Computer Studies, 2004, 60, 17-63.

[28] Li, Z., Raskin, V., and Ramani, K. Developing Ontologies for Engineering Information

Retrieval. Proc. ASME 2007 IDETC/CIE Conf., Las Vegas, NE, September, 4-7 (accepted).

[29] Li, Z. and Ramani, K. Ontology-based design information extraction and retrieval. AIEDAM,

21(2), 2007.

[30] Kuffner, T.A. and Ullman, D.G. The Information Request of Mechanical Design Engineers.

Design Studies, 1991, 12, 42-50.

[31] Baya, V, Gevins, J, Baudin, C, Mabogunje, A, Leifer, L., and Toye, G. An Experimental Study

of Design Information Reuse. Proc. 4th ASME/DTM Conf., Scottsdale, AZ, 1992, 42, 141-147.

[32] Pugh, S. Total Design: Integrated Methods for Successful Product Engineering. 1997

(Wokingham: Addison-Wesley).

[33] Lowe, A. McMahon, C. Shah, T., and Culley, S. An Analysis of the Content of Technical

Information Used by Engineering Designers. Proc. of ASME/DET Conf., 2000, Baltimore.

[34] Hirtz, J., Stone, R.B., McAdams, D.A., Szykman, S., and Wood, K.L. A Functional Basis for

Engineering Design: Reconciling and Evolving Previous Efforts. Research in Engineering

Design, 2002, 13(2), 65–82.

[35] Kim, J., Will, P., Ling, S.R., & Neches, B. Knowledge-rich Catalog Services for Engineering

Design. AIEDAM, 2003, 17 (4), 349-366.

[36] Triezenberg, K.E. Ontology of Emotion. PhD thesis, 2006, Purdue University.

[37] Rothbart, H.A. Mechanical Design Handbook, 1996 (McGraw-Hill, NY).

[38] Collins, J.A., Hagan, B.T., and Bratt, H.M. The Failure-Experience Matrix – A Useful Design

Tool. J. of Engineering for Industry, 1976, August, 1074-1079.

[39] Kutz, M. Handbook of Materials Selection. 2002 (John Wiley & Sons, NY).

[40] Kutz, M. Mechanical Engineers' Handbook, Manufacturing and Management. 2005 (John

Wiley & Sons, NY).

[41] Guarino, N. and Welty, C. Towards a Methodology for Ontology-based Model Engineering. In

Proc. of the ECOOP-2000 Workshop on Model Engineering, 2000.

Contact info:

Zhanjun Li Professor Karthik Ramani

Purdue University Purdue University,

School of Mechanical Engineering School of Mechanical Engineering

West Lafayette, IN 47907, USA West Lafayette, IN 47907, USA

Phone: 01-765-4945653 Phone: 01-765-4945725

Fax: 01-765-4940539 Fax: 01-765-4940539

liz@purdue.edu ramani@purdue.edu

https://engineering.purdue.edu/precise http://widget.ecn.purdue.edu/~ramani

Towards Ontology-Based Design Science Research for Knowledge Accumulation and Evolution

Conference Paper

Full-text available

Jan 2019

Introducing a multipliable BOM-based automatic definition of information retrieval in plant engineering

Article

Full-text available

May 2024

The complexity of process plants and the growing demand for digitalization require efficient and accurate information retrieval throughout the lifecycle phases of a process plant. This paper discusses the concept of instantiation and introduces a method for identifying and multiplying required information in plant engineering using scalable so-called Instantiation Blocks linked to the Bill of Material. Core functionality, an ontology graph and a user interface based on Python and React are developed to demonstrate the implementation of the framework and validate its effectiveness in practice.

Knowledge-Based Product-Service Ecosystems

Chapter

Feb 2023

Due to the ongoing digital transformation, businesses are exposed to work with a dynamic business environment. Traditional 1-to-1 relationships between manufacturers and end customers are transformed into a network of relationships between manufacturers, end customers, software developers, suppliers, and third parties. The network of partners evolves into an ecosystem. Using a smart product-service system as the central anchor of value creation for an ecosystem enables hybrid value creation for producers and consumers simultaneously. In this paper we develop a knowledge-based model of smart product-service systems in their corresponding ecosystems, that assists designers and developers in decision making during the product development process.KeywordsSmart product-service systemsProduct-service ecosystemsKnowledge-based engineeringSmart product engineering

Ontology‐Based System for Patient Monitoring

Chapter

Nov 2021

In the current scenario, web sources are flooded with information and the user can search for needed information from it. The output of the search keyword may be relevant or irrelevant despite the search keyword. Since the vast majority of the web sources are finding their results syntactically than semantically. So, the machine must understand the meaning of the search keyword to give relevant search results. The responsibility of researchers is to find the solution for Patient Monitoring System, which must give an accurate result for the search keyword. There is a need to construct the Patient Monitoring System with the support of ontology. Every detail of the patient and their health information must be stored in the Ontology. This research work focuses on the importance of ontology based system for the patient monitoring system. The domain ontology has been constructed to preserve the details of the patient health issues. This system consists of four phases includes Domain ontology construction, Pre-processing, Rule construction and Reasoning of Query. After the validity of the query, the most relevant result must be produced automatically. The correctness of the query must be validated with the support of reasoner.

Extraction of Formal Manufacturing Rules from Unstructured English Text

Article

Jan 2021
COMPUT AIDED DESIGN

Semantics-based approaches—founded on the idea of explicitly encoding meaning separately from the data or the application code—are being applied to manufacturing, for example, to enable early manufacturability feedback. These approaches rely on formal, i.e., computer-interpretable, knowledge and rules along with the context or semantics, which facilitates the reuse and sharing of the knowledge via semantic web technologies. On the other hand, manufacturing knowledge has been maintained primarily in the form of unstructured English text. It is considered impractical for engineers to author accurate, formal, and structured manufacturing rules. However, previous efforts on extracting semantics from unstructured text in manufacturing have mainly focused on basic concept names and hierarchies for ontology creation, rather than extracting complex manufacturing rules. In this context, this paper focuses on the development of a semantics-based framework for acquiring formal manufacturing rules from English text, such as those written in manufacturing handbooks, by guiding standard Natural Language Processing (NLP) techniques with formal manufacturing knowledge (i.e., controlled vocabulary and domain ontology). Specifically, this paper studies the problem of rule extraction in the manufacturing domain, proposes the formal rule extraction framework, and demonstrates its feasibility. From the dataset of 133 sentences with a manufacturing rule, the proposed framework was able to extract correct rules from approx. 57% of the sentences. This paper also demonstrates the extensibility of the framework. Specifically, the framework was initially developed using the three sections of a manufacturing handbook, including milling, metal stamping, and die-casting sections, and could be successfully applied to the rest of the book after just updating the formal manufacturing knowledge to cover the other sections. This paper provides meaningful results in terms of formalization, thus will contribute to the development, sharing, and reuse of formal manufacturing knowledge that includes complex manufacturing rules.

Organizational Learning for Value Creation

Chapter

Jan 2020

Technology and industry growth urge organizations to enhance their knowledge level and creation of new knowledge is highly valued in learning organizations (LO). It is inevitable that knowledge is a critical part in the OL context. This chapter discusses the importance of knowledge creation in an LO and the relevant issues of organizational knowledge, such as knowledge process, learning policy and learning technologies.

Automated feedback generation for formal manufacturing rule extraction

Article

Mar 2019

Manufacturing knowledge is maintained primarily in the unstructured text in industry. To facilitate the reuse of the knowledge, previous efforts have utilized Natural Language Processing (NLP) to classify manufacturing documents or to extract structured knowledge (e.g. ontology) from manufacturing text. On the other hand, extracting more complex knowledge, such as manufacturing rule, has not been feasible in a practical scenario, as standard NLP techniques cannot address the input text that needs validation. Specifically, if the input text contains the information irrelevant to the rule-definition or semantically invalid expression, standard NLP techniques cannot selectively derive precise information for the extraction of the desired formal manufacturing rule. To address the gap, we developed the feedback generation method based on Constraint-based Modeling (CBM) coupled with NLP and domain ontology, designed to support formal manufacturing rule extraction. Specifically, the developed method identifies the necessity of input text validation based on the predefined constraints and provides the relevant feedback to help the user modify the input text, so that the desired rule can be extracted. We proved the feasibility of the method by extending the previously implemented formal rule extraction framework. The effectiveness of the method is demonstrated by enabling the extraction of correct manufacturing rules from all the cases that need input text validation, about 30% of the dataset, after modifying the input text based on the feedback. We expect the feedback generation method will contribute to the adoption of semantics-based technology in the manufacturing field, by facilitating precise knowledge acquisition from manufacturing-related documents in a practical scenario.

An abstract model for e-content search using ontology

Conference Paper

Jun 2017

A Novel Approach to Mapping for KBQA System Using Ontology

Conference Paper

Jul 2018

Knowledge Based Question Answering System (KBQA) aims to provide suitable answers for the queries posted by users using ontology. Ontology is supposed to refine the search with semantics and confer the best result to the user. Mapping of Question and Answers are mandatory in every Question Answering System. This research work proposed the framework for KBQA Mapping system, which helps in retrieving precise information from a large collection of documents. The KBQA system consists of three phases such as Question Preprocessing, Answer Evaluation and Concept Mapping of question with their answer. The question posted by the user will be evaluated to get the answer and the answer will be taken from the pool of KBQA. This research paper explores the novel approach to the mapping of Question and most relevant Answers in KBQA System.

An evolutionary computation based method for creative design inspiration generation

Article

Full-text available

Apr 2019
J INTELL MANUF

Product design is an important part of the manufacturing system from a macro point of view. In the design process, creative design is one of the important factors to survive the fierce competition. During creative design process, designers are the most critical component. However, design fixation often negatively influences the design outcomes. Many researches from cognitive science and design science reveal that the presentation of outer information can alleviate design fixation effectively. Among different kinds of outer information, language terms are proved to be effective. This work presents a method of automatically generating language terms as design inspirations based on 500,000 granted patents. This method adopts evolutionary computation as the fundamental algorithm to retrieve language terms from a vocabulary base. To implement the algorithm, the vocabulary is first encoded by high dimensional vectors through word embedding, which is a three layers neural network. Further, two metrics for measuring the inspiration potential of language terms are defined in a computable manner. This work also conducts experiments to validate the method, and the experimental results show that (1) the algorithm is efficient and has the potential to be extended to larger vocabulary; (2) the generated design inspirations have a positive influence on the design outcomes.

The information requests of mechanical design engineers

Article

Full-text available

Jan 1991
DESIGN STUD

Design documentation does not typically include all of the information sought by mechanical design engineers. This paper reports on a study of practicing engineers making modifications to existing designs. Particular attention is paid to the design information required to answer questions about the design and to verify and refute conjectures about the design. A taxonomy of the questions asked by the designers in this study and the conjectures they formed is presented. It is proposed that an intelligent CAD system be developed to capture, structure, and re-play this information.

Ontology Development 101: A Guide to Creating Your First Ontology

Article

Full-text available

Jan 2001

Identifying and supporting the knowledge needs of novice designers within the aerospace industry

Article

Full-text available

Oct 2004

A long-term research project is being undertaken to understand how to support engineering designers through the provision of appropriate knowledge during the design process. Empirical research already carried out within the aerospace industry found that novice designers were aware of their knowledge needs in only 35% of their queries. In addition, observations of experienced and novice designers identified design strategies employed by experienced designers. The aim of this research was to develop and evaluate a method of support for designers based upon these findings. This paper describes the development of the method and its preliminary evaluation within the aerospace industry. The evaluation indicated that the method was able to reduce the number of inappropriate questions asked by novice designers.

An Analysis of the Content of Technical Information Used by Engineering Designers

Conference Paper

Sep 2000

This paper reports on an investigation that aims to characterise the technical content of documentary information used by engineers within two aerospace companies. The work has been carried out in order to gain an understanding of the technical requirements for future information systems to support designers. A rigorous analysis of the textual content of exemplar information objects provided by engineers is initially described. From this analysis, Object Content Profiles for three distinct types of text-based objects are identified: (i) correspondence, (ii) technical (report) objects and (iii) manuals, catalogues and standards. This work is complemented by a survey of designers which identifies the attributes that engineers consider the most helpful as search criteria when trying to locate information objects. These results suggest that whilst drawings and CAD models can be represented by a small number of core attributes, for text-based objects a wider range of descriptive attributes is important. Finally, an outline framework to allow the development of hierarchical taxonomies is presented. This is based on ontology development guidelines but is modified to promote the reuse of existing hierarchically structured sources of indexing terms.

Indexing by Latent Semantic Analysis

Article

Jan 1990

The Failure-Experience Matrix—A Useful Design Tool

Article

Aug 1976
J Eng Ind Trans ASME

A three-dimensional failure-experience cell matrix is proposed for the purpose of organizing and analyzing existing failure experience data. In the proposed matrix the three axes represent failure modes, elemental mechanical functions, and corrective actions. The usefulness of the failure-experience matrix is demonstrated by investigating over 500 individual failed parts from U. S. Army helicopters. It is proposed that data from all industry should be gathered and inserted into a central failure-experience matrix and made accessible to all designers.

A Methodology for Creating Ontologies for Engineering Design

Conference Paper

Jan 2005

This paper describes a methodology for developing ontologies for engineering design. The methodology combines a number of methods from social science and computer science, together with taxonomies developed in the field of engineering design. A case study is used throughout the paper focusing upon the use of an ontology for searching, indexing and retrieving of engineering knowledge. An ontology for indexing design knowledge can assist the users to formulate their queries when searching for engineering design knowledge. The root concepts of the ontology were elicited from engineering designers during an empirical research study. These formed individual taxonomies within the ontology and were validated through indexing a set of ninety-two documents. Relationships between concepts are extracted as the ontology is populated with instances. The identified root concepts were found to be complete and sufficient for the purpose of indexing. A thesaurus and an automatic classification are being developed as a result of this evaluation. The methodology employed during the test case is presented in this paper. There are six separate stages, which are presented together with the research methods employed for each stage and the evaluation of each stage. The main contribution of this research is the development of a methodology to allow researchers and industry to create ontologies for their particular purpose and to develop a thesaurus for the terms within the ontology. The methodology is based upon empirical research and hence, focuses upon understanding a user’s domain models as opposed to extracting an ontology from documentation.

Toward a Pragmatic Ontology for Product Development Projects in Small Teams

Conference Paper

Jan 1999

A Comparison Between the Provision of Information to Engineering Designers in the UK and the USA

Article

Dec 1998
INT J INFORM MANAGE

Understanding the information accessing activities of engineering designers has been a topic of considerable interest both to industrialists and researchers alike in forming the basic requirements and specification for document management systems, engineering database management systems and information delivery systems. As part of this effort studies have been undertaken to establish how the designers themselves access and distribute this information. However, such studies have been specific to the host country in which the research was originated and comparisons with the wider design community across the world have not possible to establish or quantify. This paper will present the salient findings and observations of a global comparison between engineering designers working within the UK and USA. It is based on two questionnaire surveys, one undertaken in each of the two countries, where over 300 designers were questioned. Thus, it is anticipated that this paper would serve to enhance the management of information and the associated information delivery systems on a world-wide basis.

Total Design: Integrated Methods for Successful Product Engineering

Article

Jan 1991

Stuart Pugh

A methodology of engineering ontology development for information retrieval

Abstract and Figures

Recommended publications

Locking Cap Designs Improve Fatigue Properties of Polyaxial Screws in Upper Extremity Applications

Adding experimental arms to platform clinical trials: Randomization procedures and interim analyses

A Semantic Vector Retrieval Model for Desktop Documents

Verifying Text Similarity Measures for Two Layered Retrieval