The Effectiveness of Test Coverage Criteria for Relational Database Schema Integrity Constraints

McMinn, Phil; Wright, Chris J.; Kapfhammer, Gregory M.

doi:10.1145/2818639

The Effectiveness of Test Coverage Criteria for Relational Database Schema Integrity Constraints

Persistent URL

http://hdl.handle.net/10456/40668

Author(s)

McMinn, Phil

Wright, Chris J.

Kapfhammer, Gregory M.

Date Issued

December 2, 2015

Abstract

Despite industry advice to the contrary, there has been little work that has sought to test that a relational database's schema has correctly specified integrity constraints. These critically important constraints ensure the coherence of data in a database, defending it from manipulations that could violate requirements such as “usernames must be unique” or “the host name cannot be missing or unknown.” This article is the first to propose coverage criteria, derived from logic coverage criteria, that establish different levels of testing for the formulation of integrity constraints in a database schema. These range from simple criteria that mandate the testing of successful and unsuccessful INSERT statements into tables to more advanced criteria that test the formulation of complex integrity constraints such as multi-column PRIMARY KEYs and arbitrary CHECK constraints. Due to different vendor interpretations of the structured query language (SQL) specification with regard to how integrity constraints should actually function in practice, our criteria crucially account for the underlying semantics of the database management system (DBMS). After formally defining these coverage criteria and relating them in a subsumption hierarchy, we present two approaches for automatically generating tests that satisfy the criteria. We then describe the results of an empirical study that uses mutation analysis to investigate the fault-finding capability of data generated when our coverage criteria are applied to a wide variety of relational schemas hosted by three well-known and representative DBMSs—HyperSQL, PostgreSQL, and SQLite. In addition to revealing the complementary fault-finding capabilities of the presented criteria, the results show that mutation scores range from as low as just 12% of mutants being killed with the simplest of criteria to 96% with the most advanced.

Journal

ACM Transactions on Software Engineering and Methodology

Department

Computer Science

Citation

Phil McMinn, Chris J. Wright, and Gregory M. Kapfhammer. 2015. The effectiveness of test coverage criteria for relational database schema integrity constraints. ACM Trans. Softw. Eng. Methodol. 25, 1, Article 8 (November 2015), 49 pages. DOI: http://dx.doi.org/10.1145/2818639

Publisher

Association for Computing Machinery

Version of Article

Published article

DOI

10.1145/2818639

ISSN

1049-331X

1557-7392

Subjects

Software testing

coverage criteria

relational database s...

schema testing

integrity constraints...

automatic test data g...

mutation analysis

search-based software...

File(s)

Name

Kapfhammer 2015 ACM.pdf

Description

Published article

Size

2.11 MB

Format

Adobe PDF

Checksum (MD5)

92b4dfdf13deaf75a217825c2e6facdf