Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to...

34
Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International Software Benchmarking Standards Group

Transcript of Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to...

Page 1: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Beyond the Statistical Average How trying to keep it too simple can lead

to difficulty

John Ogilvie

Chief Executive Office International Software Benchmarking

Standards Group

Page 2: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Introduction

• This presentation is based on the speaker’s long experience in negotiating and managing software development contracts containing financial commitments to meeting productivity targets using Function Pont based measures

• However the lessons learned can be equally applied to software development departments within organisations

Page 3: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Agenda

• What is the International Software Benchmarking Standards Group (ISBSG)?

• A brief overview of Function Points • Case study in developing a baseline from which

to measure productivity changes

• Lessons implementing a successful Productivity Management Program

Page 4: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

ISBSG Mission

ISBSG is a not-for-profit organisation with the mission: “To improve the management of IT resources by both business and government

Through the provision and exploitation of public repositories of software engineering knowledge that are standardised, verified, recent and representative of current technologies.” We have many international partners including in China:

Page 5: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

To deliver the mission

The ISBSG has established and now grows, maintains and exploits two international repositories of IT history data: - Software Development & Enhancement - Over 7,500 projects - Maintenance & Support - Over 1,000 applications

Page 6: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

The ISBSG is Unique The only international data repositories accessible to all for a modest fee. All ISBSG data is… - Validated and rated in accordance with its quality guidelines - Current, independent and trusted - Captured from a range of organisation sizes and industries - Industry leaders around the world contribute to the ISBSG’s development, offering the highest metrics expertise worldwide Explore ISBSG Offerings at www.isbsg.org

Page 7: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Productivity Rationale • Across all industries, businesses are continually trying to

improve productivity

• In manufacturing, a common measure of productivity is labour expended and/or cost incurred in producing a single unit of output ( eg. a car, a loaf of bread, a light globe)

• The IT industry is no different, with companies requiring

increasing returns on their investments in technology

• As a result, senior managers now usually require their IT organisations to demonstrate continuous productivity improvement

Page 8: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Measuring Output in Software Development

• Without a measure of the output of a software development organisation, management is hampered

• Measurement of inputs alone, such as development effort, reveals little about the performance of an organisation

• The most widely used output measures are:

Lines of Code -Relatively easy to count but are highly dependant on language, programming style and physical structure of the application.

Function Points - More difficult to count but offer several significant advantages:

– Based on user view of the software’s functionality and hence can be understood by the user

– Can be applied at any time in the development lifecycle

– Offer a standard, repeatable process

– Derived without reference to the physical or technical implementation

Page 9: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

International Function Point Users Group

– Pervasive World Wide Membership - 1450 members of IFPUG

– Defined by International Standards:

• IFPUG Counting Practices Manual (CPM);

• (ISO 20926:2003) (Fundamental concepts of Functional Size Measurement –FSM);

– Certification Program: CFPS, Certified Function Point Specialist

– Many International and National Conferences and Special Interest Committees:

Page 10: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Are Function Points Convertible to Effort or Cost?

• Function Point is a measure of size and cannot be converted to effort or cost without knowledge of other attributes of the application

– A useful analogy is in building a house:

• A 400 square metre house is half the size of an 800 square metre house

• But

– The cost of building the house (cost per square metre) will depend on things such as nature of land, materials used, number of bathrooms / bedrooms, quality of fittings etc..

– It is quite possible that the smaller house may cost more to build

• Similarly

– An application of size 800FP delivers twice the business functionality as one of 400FP but the relative cost per FP to develop will also depend on things such as technology platform, complexity, size etc..

Page 11: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Case Study

• ABC company is outsourcing their application development and maintenance.

• They wish to establish a set of targets for annual improvements in productivity based on what they were achieving internally prior to outsourcing.

• The contract with the vendor specified shared risk/reward

• bonus/penalty payments for over/under achievement against targets.

Page 12: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Case Study • At the end of each quarter, the average actual

performance in each segment was calculated and the bonus/penalty rules applied

• No minimum number of data points for the calculation was specified

• In many cases only 1 or 2 actual projects in each category

–After 12 months there was considerable conflict between ABC and vendor with ABC threatening legal action and vendor threatening to exit contract .

• Both over and under achievements were challenged

Page 13: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

What Questions arise in Case Study What are the characteristics of the data we have?

• Shape of distribution

• Handling of Outliers

Baseline:

• How much data is required

• Do performance segments make sense

Measurement:

• How do we determine how productivity has changed

• How much measurement data is required

Page 14: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Data Used in This Presentation For the purposes of this presentation, data was extracted from the ISBSG Development and Enhancement Repository.

• Data Quality Rating: A or B

• Development Type: Enhancement

• Count Method: IFPUG

• Application Group: Business Application

• Development Platform: Mainframe/Midrange/Multi

•Analyses and tables where produced using Minitab Statistical Software

Page 15: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

ISBSG Relative Size • Categorises the Functional Size into groups, similar to

clothing sizes, as follows:

Relative Size Functional Size

1. XXS Extra-extra-small => 0 and <10

2. XS Extra-small => 10 and <30

3. S Small => 30 and <100

4. M1 Medium1 => 100 and <300

5. M2 Medium2 => 300 and <1000

6. L Large => 1,000 and < 3,000

7. XL Extra-large => 3,000 and < 9,000

8. XXL Extra-extra-large => 9,000 and < 18,000

9. XXXL Extra-extra-extra-large => 18,000

Page 16: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Examine the Data: Descriptive Statistics Relative Total

Size Count Mean TrMean StDev Minimum Median Maximum

1. XXS 43 27.01 19.84 45.27 1.40 7.90 236.30

2. XS 157 19.06 14.25 29.83 0.90 10.90 271.60

3. S 424 16.34 12.23 30.27 0.40 10.00 424.90

4. M1 470 13.19 11.47 12.67 0.90 9.60 97.90

5. M2 187 14.52 13.10 12.68 0.80 11.10 80.70

6. L 31 11.69 10.25 11.63 1.00 9.10 42.90

7. XL 4 1.50 * 1.34 0.10 1.30 3.30

8. XXL 2 0.35 * 0.21 0.20 0.35 0.50

Page 17: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Examine the Data

420360300240180120600

PDR (afp)

Dotplot of PDR (afp)

Each symbol represents up to 17 observations.

Page 18: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Examine the Data

420360300240180120600

1. XXS

2. XS

3. S

4. M1

5. M26. L

7. XL8. XXL

PDR (afp)

Rela

tive S

ize

Dotplot of PDR (afp)

Each symbol represents up to 16 observations.

Page 19: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Handling Outliers An outlier is an unusually large or small observation. Outliers can have a disproportionate influence on statistical results, such as the mean, which can result in misleading interpretations

A variety of techniques can be used

• Trim the data by removing the top and bottom 5% - simple to do

• Remove data more than 2 standard deviations from the mean ( simple to do but assumes data has normal distribution)

• Statistical test that all values in the sample are from the same, normally distributed population.

• Graphically using a Boxplot

Page 20: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International
Page 21: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Boxplot

• “Box” shows values in from Quartile 1(Q1) to

Quartile 3(Q3)

• Inter Quartile Range (IQR) is from Q1 to Q3. • Value is Q3 – Q1

• Mean and Median are shown

• “Whiskers” go to 1.5*IQR above and below the box

• An outlier is taken to be any value beyond the Whiskers

• Applying this to each of the size groups and removing sizes 7&8 reduced the number of data points by 106 from 1,319 to 1,231

Page 22: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Descriptive Statistics after Outliers Removed

Relative Total

Size Count Mean TrMean StDev Minimum Median Maximum

1. XXS 38 13.44 11.84 14.92 1.40 6.00 53.80

2. XS 147 13.08 12.31 10.12 0.90 10.30 44.80

3. S 388 10.62 10.13 7.24 0.40 8.60 32.70

4. M1 433 10.20 9.79 6.38 0.90 8.80 30.30

5. M2 178 12.50 11.89 8.78 0.80 10.15 38.90

6. L 28 8.64 8.20 7.04 1.00 6.30 27.70

Page 23: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Data Distribution after Outliers Removed

484032241680

PDR (afp)

Dotplot of PDR (afp)

Each symbol represents up to 3 observations.

Page 24: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

How much data is required for Baseline & Performance Measurement?

• The need in a baseline is to have sufficient data points (n) such that their average will closely estimate the population average .

• One approach , based on the Central Limit Theorem in statistical theory indicates:

• In general try to have n>30

• If data is highly skewed, ideally more data points

• If data is symmetric , less than 30 may suffice

• 5 data points was insufficient for establishing a baseline in the case study

Page 25: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

How much data required for Baseline & Performance Measurement

• In the Case Study, in addition to setting a baseline, ABC wanted to determine if target productivity was being met

• Statistically, the 95% Confidence Interval for the true average of our performance is expressed as:

• “ We are 95% certain that the true mean is contained in the interval: CI=S±1.96σ/√n “

• where:

• S=sample mean, σ=sample standard deviation, n=sample size

Page 26: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Required Sample Sizes

• For example, if we have 15 data points for size M1 projects, with average S, we can be confident at a level of 95% that the true average of M1 projects would be in the range of S±3

• Therefore it is this range, not just the value of S which needs to be considered

If the productivity target is in the range then it has been achieved.

Standard

Deviation

Confidence

Interval

Size M1=6.38 95% 90%

± 1.0 159 112

± 1.5 72 51

± 2.0 42 30

± 2.5 28 20

± 3.0 20 15

± 3.5 16 11

± 4.0 13 9

Sample Size @

Confidence Level

Page 27: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Baseline – Do segments make sense

In deciding what segmentation should be used in establishing the baseline and subsequent performance management, the question is whether there is sufficient evidence that performance is different in each segment.

Too much segmentation reduces the number of data points in each segment which impacts the Confidence Interval of the measurement, as described earlier

Page 28: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

How to Determine if Segments Differ

Page 29: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

How to Determine if Segments Differ

• The Interval Plot indicates that S & M1 could be combined

• The fact that XS and M2 are similar is unexpected

• Possibly need to add further attributes to data selection criteria

• The CI for XXS and L is too large to be useful due to small numbers of data points ( 38 and 28 respectively)

Page 30: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Baseline Recommendations

• Beware of basing conclusions on small numbers of data points

• Check data for outliers • Try and determine reason for outlier and do not

remove if likely to occur in your own data.

• Do not segment data unless you are confident there is a real difference between segments.

• Your own data is always best. Industry data is a valuable benchmark reference and can provide data until you build up your own repository.

• Measure what you do and improve

Page 31: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Considerations for a successful Productivity Management Program

Senior Management commitment is vital. Productivity measurements will not be popular with project teams and the Business Managers may question the value.

Ensure productivity performance is a key measure at the project level

Sufficient resource must be allowed for records management.

It is necessary to implement a repository data base to assist managing productivity data

To achieve required consistency and accuracy of FP counts a review and validation process is required

Page 32: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Considerations for a successful Productivity Management Program

Discipline in time recording is vital. While the first priority is to ensure that every billable hour is recorded, also need to ensure FP and non-FP effort can be separated.

Strong change management disciplines need to be in place as early as possible. Even time and materials work needs fixed price level control to ensure FP credit is obtained for change and rework.

Productivity data should not be used to measure individual staff performance

Always use trained and certified Function Point counters

Page 33: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

Considerations for a successful Productivity Management Program

Determine if commitments should be against a single average productivity figure across the entire contract or at a more granular level, by type of work for example

Determine how improvements will be measured:

Against a baseline. This can become somewhat meaningless as time passes as the type of work being performed can differ markedly from that in the baseline period. Need to track work profile changing over time so does not mask real productivity improvements

Year on Year. Helps address the above issue but can cause problems if over achieve in one period thus increasing the bar for the next.

It is critical to ensure adequate resources are available to enable implementation of the necessary process, training and systems to manage the measurement regime.

Page 34: Beyond the Statistical Average - bscea.org€¦ · Beyond the Statistical Average How trying to keep it too simple can lead to difficulty John Ogilvie Chief Executive Office International

In Conclusion

• In establishing a baseline or measuring productivity changes, more sophisticated statistical analysis than just calculating averages is necessary.

• Measuring output is necessary to manage productivity

• Please contact me at [email protected] if you have any questions

W. Edwards Deming: "In God we trust, all others must bring data!"