Transcript and Presenter’s Notes
Title: Dissertation Defense
1
Dissertation Defense
- Turning Information into Action Assessing and
Reporting GIS Metadata Integrity Using Integrated
Computing Technologies
Timothy Mulrooney
2
Outline
- GIS Metadata Background
- The Issue
- Research Questions
- Methodology
- Testing and Results
- Conclusion
- Discussion
3
GIS Data Details
- Most data possess a spatial element
- Most costly element of GIS is data
development
4
What’s Metadata?
Drunk driving in Winston-Salem
How Have you Create These Data?
Exactly What Do These Data Represent?
How do i Make contact with the one who
Manages the information?
When Were These Data Printed?
5
The Issue
- Inside a Single Metadata File
- Greater than 400 Elements
- 7 FGDC (according to CSDGM) Needed Elements
- 15 FGDC Recommended Elements
- 21 Other Interesting Elements
- These work
- 120 Databases
- 50 110 layers per database
6
Research Questions
- Just how can mathematical methods be relevant to GIS
metadata to aid the choice making process?
7
Methodology
- Idea came about from mass population and extraction of
metadata values - Used ArcObjects/VBA
- Previous metadata extraction restricted to
- software program
- operating-system
- Why done regularly?
8
Metadata Assessment and Reporting Tool (MART)
9
Data Preparation
- Perl (Practical Extraction and Reporting
Language) - Perl extracts components from various XML metadata
files and puts them into single CSV file
10
FGDC Compliancy
- Test FGDC Compliancy
- Used Perl to see if appropriate metadata
elements were populated
11
Data Analysis
- What descriptive metrics can best be relevant to
GIS metadata? - Temporal Mean
- Temporal Median
- Converted big Endian format (ISO 8601) to ratio
number, performed calculations after which to
date
12
Data Analysis
- Use Perl and R programming language to
dynamically assess quantitative and qualitative
metadata fields - R is really a language and atmosphere for record
computing and graphics
13
Is a result of this Analysis
Average Horizontal Precision 104.3
Meters Temporal Mean 20020402
Average Horizontal Precision 31.7
Meters Temporal Mean 20071002
14
Supervised Techniques
- Custom application accustomed to query database
that contains metadata information - PHP language and MySQL database to question
details about all data layers using web
interface - Helps you to advice the making decisions process
15
Without supervision Data Mining Techniques
- Procedure for sorting through considerable amounts of information
to be able to choose relevant information - Rule Induction (Association Rule Mining)
- Uncover interesting relationships between
variables - Beer and Diapers example
- Integrate existing Perl ARM module with custom
application to produce transaction table from
which rules are derived.
16
Transaction Table Creation
- Decomposition of the 2-dimensional array into
1-dimension. For 890 layers and 43 attributes,
you will see 38,270 transactions - How you can express quantitative values in nominal or
ordinal atmosphere (low, medium, high, location) - How you can express categorical data within
transaction table
17
MARTO-XML
- XML Standard accustomed to describe output from
analysis (Metadata Assessment and Reporting Tool
Output)
18
Data Rendering
- Answers are printed inside a web site
- Modules tied together using batch files
- New data and graphs are produced on the schedule
- Old data are archived
- New links established
- Saved and referenced from legacy XML files
19
Testing
- Human Testing of 40 respondents using real GIS
data on MART - GIS professionals navigate is a result of analysis
in web atmosphere - Technology Acceptance Model (TAM) accustomed to assess
the potency of fraxel treatments
20
Testing Atmosphere
- GIS database of 890 individual data layers
- Ran and printed output all aforementioned
modules - Surveyed 40 respondents for his or her opinions around the
applications simplicity of use, effectiveness, potential
utility and also the intention to make use of the program
21
Testing FGDC Compliancy
22
Temporal and Horizontal Precision
23
Supervised Techiques
- Database from 890 data layers was queried utilizing a
web interface produced using PHP and dynamically
produced HTML form elements - Output was printed inside a tabular HTML table with
records satisfying the information query being printed
24
Without supervision Techniques
- Search for patterns inside a large transaction
table inside a support, confidence and strength - Used an assistance Degree of 2 (having a support level
of four, there’d become more than 526,00 rules) - For support level 2 and confidence of .7 (1
antecedent and 1 consequence), 6204 rules were
produced - Answers are printed inside a .txt file
25
Sample Rules Produced
- 67 1.000 Place_KeyNorth_Carolina gt
GeoidNOT_FOUND - 508 .992 Place_KeyForsyth_County gt
Publication_DateMedium - 32 .800 LocationNorthwest gt Publication_DateOl
d - 353 1.000 Data_ThemePublic_Safety gt
EllipsoidNOT_FOUND - 14 .824 Data_ThemeWetlands gt
Publication_DateUnknown - These rules coupled with supervised techniques
can dictate allocation of sources and decisions
later on
26
Testing
- Technology Acceptance Model
- Uses survey-based inquiries to distinguish
relationship from a technologys - Perceived Simplicity of use (PEOU)
- Perceived Effectiveness (PU)
- Attitude Towards Using (ATTIUDE)
- Intention to make use of (ITU)
27
TAM Model Used
- H1 Perceived Simplicity of use for MART includes a
important effect around the Perceived Effectiveness of
MART. - H2 Perceived Simplicity of use for MART includes a
important effect around the Attitude Towards Using
MART - H3 Perceived Effectiveness of MART includes a
important effect around the Attitude Towards Using
MART. - H4 Perceived Effectiveness of MART includes a
important effect on Intention to make use of MART. - H5 Attitude towards using MART includes a
important effect on Intention to make use of MART.
28
TAM Question
29
Precursory Analysis of individuals Taking Test
30
Is a result of Respondents
- People responded most positively to the simplicity of
use - Least positively for the intention to make use of
31
Hypothesis Testing
- After performing Chronbachs alpha to determine
reliability and PCA to describe variance,
ideas were tested from categories of question
utilized in market research.
32
Hypothesis Testing
33
Hypothesis Testing
34
Explanation of Hypothesis Testing
- H3 not recognized Strong correlation between
Perceived Simplicity of use and Attitude Towards Using.
Perceived Simplicity of use is much more of the adding
factor for the Attitude Towards Using than
Perceived Effectiveness. - H5 not recognized Respondents impression about
free atmosphere in position to apply
MART, but was recognized at approximately 70 CI.
35
Conclusions
- Growing schism between data creation and it is
assessment - Metadata reinforces qc and quality
assurance procedures utilized by a company - We want a method so everybody can assess various
size of metadata on time - MART serves in an effort to evaluate GIS metadata
- MART supplies a forum so users can communicate with
GIS metadata by having an finish objective of supporting
business decisions which ultimately save your time and
money
36
Conclusions
- Quantitative metadata elements for example FGDC
compliancy, date and horizontal precision could be
assessed using programming languages for example R
and Perl - Users can search GIS metadata using supervised
techniques using a web interface - Association Rule Mining does apply to GIS
metadata - If given an option, users choose to query GIS
metadata instead of receiving is a result of
without supervision techniques - Using TAM, 3 from 5 research ideas
supported at 95 CI - According to user feedback, the implementation or
necessity of MART and free atmosphere within
their It had been the greatest hindrance to some users
intention to make use of MART
37
Discussion
- Integration of MART along with other types of
geo-referenced data - Remotely thought data and ortho-imagery
(Laboratory for Advanced It
and Standards) - TINs
- Topologies
- Relationship Classes
- Stand-alone tables
- Metadata and proprietary format
- Usability with current GIS software
- VBA / ArcObjects to transform metadata in BLOB
format to XML for ESRI software - Various Accuracies within MART
- Temporal and Horizontal
- Attribute
- Logical Consistency
- Semantic
38
Discussion
- Interestingness problem
- 6,204 rules at support level 2 and confidence of
.7 - Cardinality of information
- 43 attributes. 6,204 rules
- 6 attributes. 73 rules
- Attributes selected were Data Theme, Location,
Horizontal Precision, Publication Date,
Responsible Party, Metadata POC - TAM Methodology different types and ideas
might be suggested - Presentation of without supervision approaches to a text
file. Web atmosphere might be more helpful - Knowledge of outdoors source atmosphere
39
- Questions or Comments?
40
(No Transcript)
41
(No Transcript)
42
Decomposition of XML File Using Perl
- metadata”r01_Data_Set_Title” gt
“idinfo_citation_citeinfo_title” - While using following command
- traverse all files
- foreach filename (_at_files)
- filename s////g change all forward
slashes to back-slashed to match proper
navigation - print “nn. Decomposing “,
basename(filename), “. n” - Create structure to traverse XML schema.
Before you go to the following value, however, - we have to reset the hash value.
- tree XMLin(filename)
- metadata”rMissing” ”
- metadata”fileName” filename
- metadata”sMissing” ”
- foreach key (sort keys SearchList)
- print “key SearchListkeyn”
- Item FindItem(SearchListkey)
- print “Item Itemnn”
43
National Mapping Precision Standards
For Scales 120,000 or greater .033 inches
Proportions of Map For Scales 120,000 or lower .02
inches Proportions of Map
44
Sample Output from Supervised Techniques
45
Chronbachs Alpha
Chronbachs alpha is computed using the amount of
respondents within the set, the variance from the data
and mean from the covariance between all people of
the set. While there’s no universal threshold
to find out data consistency, Hair et. al.
(1998) recommended the absolute minimum threshold between .6
and .7. According to Table 10, only one of those values
(Perceived Simplicity of use) is between .6 and .7
while two values (Perceived Effectiveness and
Attitude Towards Using) are between .7 and .8.
The Chronbachs Alpha constant for that Intention
to make use of component is .807, that is considered
excellent (Nunnally 1978). Given these values,
it may be surmised the questions posed for
the respondent ser
46
Principal Components
To assist comprehend the individual factors that
lead to the potential inconsistency,
principal component analysis was performed on
each one of the individual inquiries to help
determine their potential contribution towards the
variability from the observed results. Four
factors were calculated, in line with the different
aspects of the study ideas to become
tested. After rotation, the Perceived Easy
Use taken into account 56.33 from the variance. The
Perceived Effectiveness components take into account
11.93, Attitude Towards Using taken into account
9.24 as the Intention to make use of factor accounted
for 7.04. Table 11 shows the products and factor
loadings for that individual factors. Finally,
another fundamental correlations were run between
potentially dependent factors for example sex and age
to assist determine their potential contribution to
the outcomes. However, no significant correlation
was discovered between participants age, gender and
even self-described GIS experience versus
dependent variables for example Perceived Easy
use, Perceived Effectiveness, Attitude and Intention
to make use of that’ll be utilized in the TAM analysis.
47
Potential RS Attributes
PowerShow.com is really a leading presentation/slideshow discussing website. Whether the application is business, how-to, education, medicine, school, church, sales, marketing, online training or for entertainment, PowerShow.com is a superb resource. And, on top of that, the majority of its awesome features have the freedom and simple to use.
You should use PowerShow.com to locate and download example online PowerPoint ppt presentations on almost any subject imaginable so that you can learn to enhance your own slides and presentations free of charge. Or utilize it to locate and download high-quality how-to PowerPoint ppt presentations with highlighted or animated slides which will educate you the way to behave new, furthermore free. Or utilize it to upload your personal PowerPoint slides so that you can share all of them with your teachers, class, students, bosses, employees, customers, potential investors or even the world. Or utilize it to produce really awesome photo slideshows – with 2D and 3D transitions, animation, and the selection of music – that you could tell your Facebook buddies or Google+ circles. That’s all free too!
For a small charge you will get the industry’s best online privacy or openly market your presentations and slide shows with high google rankings. But in addition to that it’s free. We’ll even convert your presentations and slide shows in to the universal Flash format with all of their original multimedia glory, including animation, 2D and 3D transition effects, embedded music or any other audio, or perhaps video baked into slides. All free of charge. The majority of the presentations and slideshows on PowerShow.com can view, most are at no charge to download. (You may choose whether or not to allow individuals to download your original PowerPoint presentations and photo slideshows for a small fee reely or by no means.) Take a look at PowerShow.com today – free of charge. There’s truly something for everybody!
presentations free of charge. Or utilize it to locate and download high-quality how-to PowerPoint ppt presentations with highlighted or animated slides which will educate you the way to behave new, furthermore free. Or utilize it to upload your personal PowerPoint slides so that you can share all of them with your teachers, class, students, bosses, employees, customers, potential investors or even the world. Or utilize it to produce really awesome photo slideshows – with 2D and 3D transitions, animation, and the selection of music – that you could tell your Facebook buddies or Google+ circles. That’s all free too!
For a small charge you will get the industry’s best online privacy or openly market your presentations and slide shows with high google rankings. But in addition to that it’s free. We’ll even convert your presentations and slide shows in to the universal Flash format with all of their original multimedia glory, including animation, 2D and 3D transition effects, embedded music or any other audio, or perhaps video baked into slides. All free of charge. The majority of the presentations and slideshows on PowerShow.com can view, most are at no charge to download. (You may choose whether or not to allow individuals to download your original PowerPoint presentations and photo slideshows for a small fee reely or by no means.) Take a look at PowerShow.com today – free of charge. There’s truly something for everybody!