An Engineering Problem 1. A first step is to articulate a clear statement of the problem 2.
2006, Carla Ellis Vague idea 1. Understand the problem, frame the questions, articulate the goals. A...
-
Upload
avis-fields -
Category
Documents
-
view
213 -
download
0
description
Transcript of 2006, Carla Ellis Vague idea 1. Understand the problem, frame the questions, articulate the goals. A...
© 2006, Carla Ellis
Vague idea
1. Understand the problem,frame the questions, articulate the goals.A problem well-stated is half-solved.Why, not just what
“groping around” experiences
Hypothesis
Model
Initialobservations
Experiment
Data, analysis, interpretation
Results & finalPresentation
Experimental Lifecycle
© 2006, Carla Ellis
What can go wrong at this stage?
• Never understanding the problem well enough to crisply articulate the goals / questions / hypothesis.
• Getting invested in some solution before making sure a real problem exists. Getting invested in any desired result. Not being unbiased enough to follow proper methodology.– Any biases should be working against yourself.
• Fishing expeditions (groping around forever).• Having no goals but building apparatus for it 1st.
– Swiss Army knife of simulators?
© 2006, Carla Ellis
Strong InferenceJ. Pratt
• Progress in science advances by excluding among alternate hypotheses.
• Experiments should be designed to disprove a hypothesis.– A hypothesis which is not subject to being
falsified doesn’t lead anywhere meaningful– Any conclusion which is not an exclusion is
insecure
© 2006, Carla Ellis
Steps
1. Devise alternative hypotheses2. Devising experiments with alternative
outcomes which will exclude hypothesis3. Carrying our experiment to get clean result4. Repeat with subhypotheses
© 2006, Carla Ellis
Steps0. Identify problem, observed phenomenon1. Devise alternative hypotheses2. Devising experiments with alternative
outcomes which will exclude hypothesis3. Carrying our experiment to get clean result4. Repeat with subhypotheses
© 2006, Carla Ellis
Steps0. Identify problem, observed phenomenon1. Devise alternative hypotheses2. Devising experiments with alternative
outcomes which will exclude hypothesis3. Carrying our experiment to get clean result4. Repeat with subhypotheses
Intellectual Challenge – to do this efficiently
© 2006, Carla Ellis
Logical Tree• Our conclusion X
might be invalid if alternative hypothesis 1, alternative hypothesis 2, … alternative hypothesis n
• We describe experiments to eliminate alternatives.
• We proceed along the branches not eliminated.
Problem
Alt 1 Alt n…
Alt1a Alt1b
© 2006, Carla Ellis
Multiple Hypotheses• One can become emotionally “attached” to a
single hypothesis– Temptation to demonstrate it is right, make facts fit
the theory.• Multiple working hypotheses turns research
into a competition among ideas rather than among personal agendas– Gets at the issue of bias
© 2006, Carla Ellis
“Support Activities” in Science
• Surveys and taxonomy• Experimental infrastructure development• Measurements and tables
(e.g. file system usage studies)• Theoretical/abstract modelsUseful, provided they contribute to chain of
discovery but not as ends in themselves.
© 2006, Carla Ellis
The QuestionApply to one’s own thinking (but useful in
someone else’s talk)• What experiment could disprove your
hypothesis?or• What hypothesis does your experiment
disprove?
© 2006, Carla Ellis
Applying Strong Inference to Computer Systems Research
This has not been our culture– “Mine is better than theirs” and experiments that
show this affirmatively (not honestly attempted to show otherwise)
– Non-hypotheses – statements that really can’t be shown to be false.“This system does what it was designed to do” (by definition).
– Negative results are hard-sells to publishIssue is scientific effectiveness.
© 2006, Carla Ellis
A Good ExampleWolman et al, On the scale and
performance of cooperative web proxy caching, SOSP 99
Question: Should multiple proxies cooperate in order to increase client populations, improve hit ratios, and reduce latency?
© 2006, Carla Ellis
Logical treeCoop web caching works
Increasehit ratio,ideal case
Decreaseobjectlatency, ideal case
…
Increasehit ratio,real case
© 2006, Carla Ellis
Experiments• Web traces at UW and Microsoft• Simulation:
– Infinite cache size (no capacity misses)– Single proxy (sees all information, no overhead)– 2 cases
• Ideal caching – all documents in spite of “cachability”• Respecting cacheability
• Upper bound on performance
Beyond the knee, no significant improvement
Singleproxyenoughhere
Little impact on latency beyond small populations
© 2006, Carla Ellis
Discussion• What do you think computer scientists
are doing wrong?• Why doesn’t this approach seem natural
to us?• How can we improve?• Will system research look significantly
different if strong inference can be applied regularly?
© 2006, Carla Ellis
Discussion Next Time:Exercise in Strong Inference
• Pick one paper that seems like an important scientific advance and recast its experimental evaluation in terms of hypotheses and experiments to exclude (as a logical tree).
© 2006, Carla Ellis
Vague idea
1. Understand the problem,frame the questions, articulate the goals.A problem well-stated is half-solved.Why, not just what
“groping around” experiences
Hypothesis
Model
Initialobservations
Experiment
Data, analysis, interpretation
Results & finalPresentation
Experimental Lifecycle
© 2006, Carla Ellis
Back of the Envelope(SEESAW)
What information do we need to know?
Sending sWReceiving rWListening iWSleeping zW
© 2006, Carla Ellis
Hypothesis(SEESAW)
• Asymmetric MAC protocol can extend network lifetime by balancing energy consumption (battery depletion)– An asymmetric protocol does not waste energy
• in control overhead,• in message loss and retransmission.
– An asymmetric protocol can be automatically tuned.• can be hand-tuned.• can be tuned off-line algorithmically
– An asymmetric protocol has acceptable performance • message latency• Message throughput
– There is opportunity in balancing.