1 Exploiting Random Walk Strategies in Reasoning Wei.
-
Upload
darren-wheeler -
Category
Documents
-
view
220 -
download
1
Transcript of 1 Exploiting Random Walk Strategies in Reasoning Wei.
1
Exploiting Random Walk Strategies in
Reasoning
Wei Wei
2
Universal inference engine One of the oldest AI dreams
General Problem Solver (Newell & Simon, 1961)Use logic reasoning (“Program with Common Sense”, McCart
hy 1968) Largely unsuccessful due to the tradeoff between co
mputational complexity and representation power. => many successful systems are domain-specific (e.g. Dendral, 1968-85, organic chemistry)
Recent improvement in computer’s abilities to perform large-scale search => Revisit the original idea
3
Computational Power - SAT Hardware power Average-case
complexity New algorithmic tools:
randomization, learning
Increasing demand: verification
Research input: annual conference, competition
year #variable
1962 50
1992 300
1996 1,000
2002 1,000,000
4
Boolean Satisfiability Problem Boolean Satisfiability Problem (SAT) asks if a Boolean
expression can be made true by assigning Boolean values to its variables.
The problem is well-studied in AI community with direct application in reasoning, planning, CSP, VLSI microprocessor verification etc.
Does statement s hold in world A (represented by a set of clauses)?
A s (¬s) ^ A unsatisfiable
5
SAT SAT (even 3SAT) is NP-complete. Best theoreti
cal bound so far is (2-2/k)N randomized (Schoening 1999) or
(2-2/(k+1))N deterministic (Dantsin et al 2002) for k-SAT
In practice, there are two different kinds of solvers DPLL (Davis, Logemann and Loveland 1962) Local Search (Selman et al 1992)
6
DPLL (x1 x2 x3) (x1 x2 x3) (x1 x2) DPLL was first proposed as a basic depth-
first tree search.
x1
x2
FT
T
null
F
solution
x2
Potential problem: early commitment
7
DPLL Recently (since late 90’s), many
improvements: Randomization restarts out-of-order backtracking clause learning
8
Local Search(x1 x2 x3) (x1 x2 x3) (x1 x2 ) The idea: Start with a random assignment.
And make local changes to the assignment until a solution is reached (010 011 001)
Pro: often efficient in practice. Sometimes the only feasible way for some problems
Con: Cannot prove nonexistence of solutions. Difficult to analyze theoretically.
Example GSAT (Selman et al. 1992)
9
Local Search Schemes local search schemes used:
Simulated annealing Tabu search Genetic algorithms Random Walk and its variants the most successf
ul so far
10
Unbiased (Pure) Random Walk for SATProcedure Random-Walk (RW)Start with a random truth assignmentRepeat
c:= an unsatisfied clause chosen at randomx:= a variable in c chosen at randomflip the truth value of x
Until a satisfying assignment is found
11
Unbiased RW on any satisfiable 2SAT Formula
Given a satisfiable 2SAT formula with n variables, a satisfying assignment will be reached by Unbiased RW in O(n2) steps with high probability. (Papadimitriou, 1991)
12
We have an unbiased random walk with a reflecting barrier at distance N from T (max Hamming distance) and an absorbing barrier (satisfying assignment) at distance 0.
We start at a Hamming distance of approx. ½ N.
Property of unbiased random walks: after N^2 flips, with high probability, we will hit the origin (the satisfying assignment). (Drunkards walk)
So, O(N^2) randomized algorithm (worst-case!) for 2-SAT.
T A0T A0
reflecting
13
Unbiased RW on 3SAT Formulas
Random walk takes exponential number of steps to reach 0.
T A0
reflecting
14
Comments on RW
1) Random Walk is highly “myopic” does not take into account any gradient of the objective function (= number of unsatisfied clauses)!
Purely “local” fixes.
2) Can we make RW practical for SAT? Yes --- inject greedy bias into walk biased Random Walk.
15
Biased Random Walk
Procedure WalkSatRepeat
c:= an unsatisfied clause chosen at random
if there exist a variable x in c with break value = 0 // greedy biasflip the value of x (“freebie move”)
else with probability p // pure walk
x:= a variable in c chosen at random flip the value of x
with probability (1-p)x:= a variable in c with smallest break value // more greedy biasflip the value of x
Until a satisfying assignment is found
Note: tune parameter p.
16
Random Walk algorithms Random walk algorithm (e.g. Walksat) offer sig
nificant improvement on performance over hill-climbing algorithms.
ID Vars clauses GSAT+w WalksatSsa7552-038 1501 3575 129 2.3
Ssa7552-158 1363 3034 90 2
Ssa7552-159 1363 3032 14 0.8
Ssa7552-160 1391 3126 18 1.5
17
First, bringing out the worst in random walks…
(2-SAT)
X1 X2
X2 X3
X3 X4
Xn X4
Xn X1
Note: Only 2 satisfying assignments, all False and all True.
18
Binary Chains
Walk is exactly balanced.
19
Results: Speeding up Random Walks on Binary Chains
*: empirical results**: proof available
Pure binary chain
Chain withredundancies
RW (n2)** (n2)**
RWF (n2)** (n1.2)*
WalkSat (n2)** (n1.1)*
Becomesalmost likeunit prop.
20
Ternary Chains
In general, even a small bias in the wrong direction leads to exponential time to reach 0.
reflecting
21
What about 3-SAT? Again, consider “chain” formulas.
X1 & X2 X3
X2 & X3 X4
X_(n-2) & X_(n-1) X_n
X1 & X2
X1 & X2
X_floor(n/2) & X_(n-1) X_n
22
Theoretical Results on 3-SAT Chains
Function low(i) Expected run time of pure RW
i-2 (highly local) ~ Fib(n) (i.e., exp.)
i/2 (interm. reach) ~ nlog n (i.e., quasi-poly)
log i (interm. reach) ~ n2 . (log n)2 (i.e., poly)
1 (full back reach) ~ n2
low(i) captures how far back the clauses reach.
23
Recurrence RelationsOur formula structure gives us:
E(f(zi)) = (E(f(zlow(i)) + E(f(zi) + 1) * 1/3
+ (E(f(zi-1) + E(f(zi) + 1) * 1/3
+ 1 * 1/3
E(f(zi)) = E(f(zlow(i)) + E(f(zi-1) + 3
24
Decompose: multiple “0”s
110101010101
110001010001
100001
111001110001
101001111001
111111111101
110101010101
110001100001
111001110001
110101
010001
101001111001111101111111
110111010111
110111100111
111111110111
111101
111001
101001111001111101111111
110111 111111
111101 111111
Start
Sat assign.
25
Recap Chain Formula Results Adding implied constraints capturing long-range
dependencies speeds random walk on 2-Chain to near linear time.
Certain long-range dependencies in 3-SAT lead to poly-time convergence of random walks.
Can we take advantage of these results on practical problem instances?
Yes! (next)Our methodology: Identify, analyze, and “exploit” special tractable structure in large practical reasoning problems.
26
Empirical Results
SSS-SAT-1.0 instances (Velev 1999). 100 total. level of redundancy added (20% near optimal)
Formulas(redun. level)
<40 sec <400 sec
<4000 sec
= 0.0 15 26 42
= 0.2 85 98 100
= 1.0 13 33 64
28
Probabilistic Reasoning Previously, we asked “does statement s hold
in world A (represented by a set of clauses)?”
A s (¬s) ^ A unsatisfiable Now, what is the probability that statement s
holds in world A?
Pr = #( s ^ A ) / #A
29
Probabilistic Reasoning Close connection between counting and samp
ling (Jerrum et al, 1986). Bayesian Net Queries can be encoded as #SAT.
(Kautz, 2004) How to Sample:
By repeated counting using DPLL algorithms Monte Carlo Markov Chain Method Use state-of-the-art local search methods Note: Random Walk not uniform
30
Characteristics of Solution space: Solution Clustering Visualization with multi-dimensional scaling (MDS)
Solutions to specific 75 variable, 325 clause 3-SAT instances
75 dimensional solution projected to two dimensions Distance between points approximates hamming distance
31-10 -5 0 5 10 15 20 25-25
-20
-15
-10
-5
0
5
10
15
20SA on SAT-75 - Multi-dimensional Scaling Using Hamming Distance Between Solutions
25 Most Often Found Solutions25 Next Most Often Found SolutionsMiddle Solutions25 Next Least Often Found 25 Least Often Found Solutions
Empirically determined each solution’s probability (uf75-01 - 75 variable, 325 clause 3-SAT instance)
WalkSat finds every solution, but with very large range of probabilities (1:104)
Probability Clusters
Solution Probability Using WalkSat Algorithm
32
Probability Ranges in Different Domains
Instance Runs Hits Rarest
Hits Common
Common-to -Rare
Ratio
Random 50*106 53 9*105 1.7*104
Logistics 1* 106 84 4*103 50
33
Improving the Uniformity of Sampling - mixing sampling strategy
To reduce the range of probabilities, we propose a hybrid local search algorithm: With probability p, the algorithm makes a biased
random walk move With probability 1-p, the algorithm makes a SA (si
mulated annealing) move In our experiments, we used
50% WalkSat + 50% SA at a fixed temperature
34
Results of the Hybrid Approach
Our key figure.
35
Solution Clusters
Results on a random 3-SAT instance (70 vars, 301 clauses, 2531 solutions).
36
Summary
1) WalkSAT does sample all solutions.2) But, sampling can be highly biased.3) Using a new hybrid strategy, we can obtain effective near-uniform sampling.
Lesson: Hybrid of SA and biased walk, is a promising alternative to MCMC methods for sampling.
Idea: Use SAT solvers to sample solutions from a combinatorial space.Findings:
37
Research Directions
Can we exploit techniques from DPLL solvers, such as learned clauses (inferred structure) in local search?
Can we identify other classes of structure in real-world problem that can further accelerate random walk style SAT solvers?
Can we design better SAT encodings based on our insights about structure?
38
Research Directions, cont. Solution sampling
Main Challenge: moving between clusters evaluation tools.
Randomize DPLL to sample solution space Early indicator of solution counts.
Compare with Bayesian/probabilistic state-of-the-art inference methods. Can we outperform them?