1 Today Another approach to “coverage” Cover “everything” – within a well-defined,...

Another approach to “coverage”• Cover “everything” – within a well-defined,

feasible limit• Bounded Exhaustive Testing

Coverage Revisited

With some kinds of coverage, we expect to be able to reach 100% coverage

if (x < y){ y = 0; x = x + 1;}else{ x = y;}

x >= yx < y

x = yy = 0

x = x + 1

Statement coverage:Cover every reachablestatement

Coverage Revisited

if (x < y){ y = 0; x = x + 1;}

2x >= y

y = 0x = x + 1

Branch coverage

Coverage Revisited

Input space partitioning

Coverage Revisited

With other kinds of coverage, we know that reaching 100% is very difficult, perhaps completely infeasible

if (x < y){ y = 0; x = x + 1;}else{ x = y;}

if (x < y)

y = 0;

x = x + 1;

x >= yx < y

x = yy = 0

x = x + 1

5x >= y

y = 0x = x + 1

Path coverage: exponential inthe number of conditional branches!

Coverage Revisited

State space coverage

Coverage Revisited

Complete input coverage

4 operations x

0, 1, 2, 3, 4… 232 x

0, 1, 2, 3, 4… 232 =

TOO MUCH

Coverage Revisited

What if we aim for exhaustive coverage, but arbitrarily limit the size somehow?• Daniel Jackson: “small scope

hypothesis”

“Our approach is simply to truncate the state space artificially, checking only within some finite bounds.” - Jackson and Damon, “Elements of Style, Analyzing a Software Design Feature with a Counterexample Detector

Small Scope Hypothesis

Remember “downward scalability”?

Idea related (in a hand-waving fashion) to small model property in logics:• For some logical formulas, though in

principle the variables may have infinite domains, it can be shown that you only have to consider a finite set of individuals when checking for satisfiability

• Bounded by length of the formula

The idea in a nutshell:• Most faults can be exposed by some “short”

failing trace• Short may mean in # of operations• Short may mean in complexity of input

structures• How many nodes in the red-black tree?

• Short may mean something else entirely• Number of voluntary thread context switches• Small flash device• Small # of different pathname components• Bounded pathname length

Obvious dangerous exception: resource bound violations• Can be handled by “shrinking” the resource

bound• May require “shrinking” types in a program

• E.g., converting some ints to chars• May be difficult, depending on program and

language

“All” Binary Trees of Size 3

N2left

rightN0

left right

N2left

rightN0

left right

Do we care about the actual elements?

N2left

rightN0

left right

What if these were red-black trees?

In general: exploit isomorphisms

Enumerating “All” Inputs

For more complex data structures, enumerating all valid inputs may be a complex problem• E.g., all “programs” (random ASCII

sequences) of length 100 vs.• All parseable C programs of length 100

• May require enumeration by a constraint solver

Enumerating “All” Inputs

Enumeration may require staging• Constraint solver generates a set of abstract

inputs, satisfying the constraints• A postprocessor then concretizes each

abstract input into a (large) set of concrete inputs

Bounded (Depth) Model Checking

All states reachable with a path of no more thank transitions

Or in which no loop executes more thank times…

Bounded (Depth) Model Checking

Can’t quite do the naïve thing:

pan –m1000

Won’t guarantee reaching everystate reachable in 1,000 steps

Why not?

Iterative Bounding

Typical approach is iterative• Start with a small bound• If you find a failing trace, fix the software or

the specification and repeat• If you can show no faults with bound k

• Increase the bound and repeat until you can’t exhaustively test for the given bound

• Traditional search technique, iterative deepening

Limitations to Scope/Bound

Stop increasing scope for one of two reasons:• Difficulty of generating all inputs

• Clever approaches can often deal with this• Difficulty of executing all the test cases!

• This one is more fundamental• With large enough k, exhaustive coverage

become “exhausting”

Evaluation

How does bounded exhaustive testing stack up against random testing?• Hard to say in general• Marinov, Andoni, Daniliuc, Khurshid, and

Rinard at MIT tried to look at this question for some Java programs

• “An Evaluation of Exhaustive Testing for Data Structures”

Evaluation

Marinov et al.: let’s use mutation testing and kill rates to compare• Generated all tests for limit k• Generated all tests for limit k-1• Compared the mutation kill rate for the

complete k-1 tests to the rate for a random subset of the k tests

• Subset same size as complete k-1 tests

Evaluation

K Tests

K-1 Tests

Randomsubset

Evaluation

Benchmark Scope Kill-rate Scope-1SearchTree 7 99.26% =DisjSet 5 95.06% =HeapArray 7 95.99% <BinomialHeap 7 95.10% <FibonacciHeap 5 86.87% >LinkedList 7 99.59% =SortedList 7 96.40% <TreeMap 7 89.08% <HashSet 7 91.39% <AVTree 5 93.17% >

Cases where bounded exhaustive testing beat random testing

Evaluation

More interesting facts:• Scope correlated highly with statement

coverage• Even after statement coverage was

complete, increasing scope increased the mutant kill rate

Now we’ll look at one of the most interesting bounded exhaustive testing approaches• Context bounding• Idea: limit number of pre-emptive context

switches by threads

Review

Before I hand over to Klaus for good…

Black box (Finite State Machine) testing

Design for testability

Coverage measures

Random testing

Constraint-based testing

Debugging and test case minimization

Using model checkers for testing

Coverage revisited (“small model property”)

Topics in Testing We’ve Covered

Black box (Finite State Machine) testing

• There “are no Turing machines”

• Vasilevskii and Chow algorithm for conformance testing based on spanning trees and distinguishing sets

• Exhaustive testing that cannot miss bugs is often computationally intractable

Design for testability

• Controllability and observability

• Simulation and stubbing, assertions, downward scalability, etc.

Coverage measures

• Not necessarily correlated with fault detection!• Still useful!

• Graph coverage: node and edge (statement and branch coverage)

• Logic coverage• Input space partitioning• Syntax-based coverage

x >= yx < y

x = yy = 0

x = x + 1

((a <= b) && !G) || (x >= y)

Random testing

• Generate inputs at random• Explore very large numbers of executions• Relies on a good automatic test oracle• Feedback to bias choices away from

redundant and irrelevant inputs is useful

• Good baseline for evaluating other methods, and often very effective

Constraint-based testing

• Addresses weaknesses of random testing• E.g., finding needles in haystacks, such as

where hash(x) = y

• Combines concrete and symbolic execution to generate inputs

• Concrete execution helps where symbolic solvers choke

Debugging and test case minimization

• Automatic minimization of test cases is very valuable for debugging and reducing regression suite size

• Debugging can be considered as an application of the scientific method

• Various techniques exist for using test cases to localize faults

Using model checkers for testing

• Testing based on states, rather than on executions or paths

• Use abstractions to reducestate space

• Use automatic instrumentationto handle the engineeringdifficulties

Any Questions???

1 Today Another approach to “coverage” Cover “everything” – within a well-defined,...

Documents

Transcript of 1 Today Another approach to “coverage” Cover “everything” – within a well-defined,...

1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.

272: Software Engineering Fall 2012 Instructor: Tevfik Bultan Lecture 6: Exhaustive Bounded Testing and Feedback-Directed Random Testing.

Exhaustive Signature Algorithm

BlinkDB: Queries with Bounded Errors and Bounded Response …db.cs.berkeley.edu/cs286/papers/blinkdb-eurosys2007.pdf · 2014-08-28 · BlinkDB: Queries with Bounded Errors and Bounded

Bounded real balanced truncation for strictly bounded real ...fa/cdps/talks/Guiver.pdf · Bounded real balanced truncation for strictly bounded real well-posed systems. Chris Guiver

BlinkDB: Queries with Bounded Errors and Bounded Response ... · BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data ... interactive SQL queries on

Splash Screen. Lesson Menu Five-Minute Check (over Lesson 3–2) CCSS Then/Now New Vocabulary Key Concept: Feasible Regions Example 1: Bounded Region Example.

Bounded Exhaustive Test-Input Generation on GPUs · AHMET CELIK, The University of Texas at Austin, USA SREEPATHI PAI, The University of Texas at Austin, USA SARFRAZ KHURSHID, The

Exhaustive and Semi-Exhaustive Procedural Content Generationsturtevant/papers/sturtevant18epcg.pdf · of exhaustive procedural content generation (EPCG). EPCG approaches use a generator

BOUNDED MODEL CHECKING OF 1-BOUNDED PETRI NETS USING … · BOUNDED MODEL CHECKING OF 1-BOUNDED PETRI NETS USING A SATISFIABILITY BASED PLANNER ... This study is partofresearch whose

linear programming - ETH Z · theorem: every feasible, bounded linear program in equational form has a BFS as optimum there may be (inﬁnitely) many optima, but at least one of them

Basics of Budgeting Exhaustive

Bounded var

EXHAUSTIVE CHARACTERIZATION OF PYROGENIC

Software Assurance by Bounded Exhaustive Testing

NOURISHING Methods for compiling and updating the database · 2018. 7. 17. · database, it is not feasible to include an exhaustive list of policies. The policy database includes

BlinkDB: Queries with Bounded Errors and Bounded Response ... · BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data SameerAgarwal†,BarzanMozafari

Exhaustive Combinatorial Enumeration · Exhaustive Combinatorial Enumeration ... I Polyhedral enumeration techniques and algorithms. ... I Tabu search, ...

BlinkDB: Queries with Bounded Errors and Bounded Response ...apanda/assets/papers/eurosys13.pdfBlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data Sameer

Queries with Bounded Errors & Bounded Response Times · PDF fileQueries with Bounded Errors & Bounded Response Times on Very Large Data Sameer Agarwal Electrical Engineering and Computer