Safety and security in mission critical IoT systems

37
Safety and security in Mission Critical IoT Systems - Supporting human decision makers in dynamic environments Einar Landre

Transcript of Safety and security in mission critical IoT systems

Page 1: Safety and security in mission critical IoT systems

Safety and security in Mission Critical IoT Systems- Supporting human decision makers in dynamic environments

Einar Landre

Page 2: Safety and security in mission critical IoT systems

Statoil

Page 3: Safety and security in mission critical IoT systems

Motivation

Page 4: Safety and security in mission critical IoT systems

Failed Safety Critical Decisions

- Situational awareness- Trustworthiness- Culture- Decision quality

Page 5: Safety and security in mission critical IoT systems

Human brain - planets most sophisticatedand vulnerable decision maker

the weakest point

• Emotions trumps facts (irrationality)

• Limited processing capacity

• Need to rest, easily bored

• Inconsistency across exemplars

• Creative, easily distracted

• Values (ethics and morale)

• Mental illness

How to compensate?

Page 6: Safety and security in mission critical IoT systems

Things

Page 7: Safety and security in mission critical IoT systems

Troll A, 472 meters, the largest man made “thing” ever moved

Software was an alien concept

things anno 1995

Page 8: Safety and security in mission critical IoT systems

things anno 2015

Asgard subsea compression runs on software

Size = a football field

Page 9: Safety and security in mission critical IoT systems

things anno 2025

Internet of critical things

Page 10: Safety and security in mission critical IoT systems

critical thingsThings or networks of things where failure could lead to an accident

- Pressure vessels- Oil & Gas wells- Boilers

- Industrial Instrumentation & Control - Emergency shutdown - Fire and gas leak detection

- Life support devices- Pacemakers- Infusion pumps

form critical systems

Page 11: Safety and security in mission critical IoT systems

system criticalityNon - Critical

Useful system- Low dependability

- System does not need to be trusted

Business - Critical Mission - Critical Safety - Critical

High Availability- Focus on cost s of

failure caused by system downtime, cost of spares, repair equipment and personnel and warranty claims

High Reliability- Increase the

probability of failure free system operation over a specified time in a given environment for a given purpose

High Safety & Integrity Level- High reliability

- High availability

- High security

- Focus is not on cost, but on preserving life and nature

Page 12: Safety and security in mission critical IoT systems

Case Study

Page 13: Safety and security in mission critical IoT systems

Drill string

Drilling Control System

Weight on Bit

Rotation Mud Circulation

Manual Control- Interpret data- Perform tasks

A manually controlled process

drilling

Page 14: Safety and security in mission critical IoT systems

• I have to make frequent decisions and many of them depend upon readings from sensors that can be correct, noisy, random, unavailable, or in some other state.

• The decisions I have to make often have safety consequences, they certainly have economic consequences, and some are irreversible.

• At any point in time there may be three or four actions I could take based on my sense of what’s happening on the rig

• I would like better support to determine how trustworthy my readings are, what the possible situations are and the consequences of each action.

What is the best actionto take?

enhance human decision making

Page 15: Safety and security in mission critical IoT systems

systems of action

• Can sense or observe a phenomena, process or machine

• Process observations and search for anomalies, undesired statechanges and other deviations that must be dealt with.

• Plan and execute / (recommend execution of) actions to bring the observedphenomena, process or machine back to its desired operational state.

• Monitor effects of actions and re-plan if action did not have intended effecton process state

Computer systems that

making better decisions under stress and uncertainty

Page 16: Safety and security in mission critical IoT systems

“Drillers Buddy”

Real-time data

Manual Control

Recommend actions incontext of process state

add active computer support

Drill string

Drilling Control System

Weight on Bit

Rotation Mud Circulation

Page 17: Safety and security in mission critical IoT systems

Drillers Buddy

State & Events

Drilling Simulator• Hydraulic model• Mechanical model• Temperature model

Drilling Advisor• Uncertainty model• Causality model• Reasoning• Planning model

Drilling Control System

Real-Time DataActions

technical building blocks

Action to be executed by human, but concept opens up for more computer control in the future.

i.e. Drilling advisor can be turned into “synthetic driller”.

Historical Data

Page 18: Safety and security in mission critical IoT systems

What is the best action to take for the business?

What is the best action to take for control or safety?

What is the process state and where is it heading?

What do we know for certain and what are we estimating?

What are we measuring directly, with what accuracy?

What can we infer about performance and changes in the physical system?

Local Action Optimization

Situational Awareness

Uncertainty and Validation

Physical System Behavior

Physical System Sensing

Global Action Optimization

Incr

easin

gly

Acti

onab

le In

form

ation

expressed in capabilities

Page 19: Safety and security in mission critical IoT systems

Local Action Optimization

Situational Awareness

Uncertainty and Validation

Physical System Behavior

Physical System Sensing

Global Action Optimization

Machine learning

(Bayesian)+

Physics(Cyb)

Decision /

game theory

Automated planning

and schedulin

g

Rational agent

• has goals• models uncertainty• chooses action with

optimal expected outcome for itself

• Examples: − human (on a good

day)− intelligent software

agent

more sophisticated technology

Sensors

Page 20: Safety and security in mission critical IoT systems

solution creates new challenges

What parts are safety critical?

What parts are only business critical?

How to assess and protect against cyber threats?

How does failure in non-safety part influence safety and security?

What dependencies do we have?

Industry become software dependent

How to design software that tackles mechanical failures?

Page 21: Safety and security in mission critical IoT systems

21 2014-04-24

how to build trustworthy software?

Page 22: Safety and security in mission critical IoT systems

Software

Page 23: Safety and security in mission critical IoT systems

before softwareTangible control logic

• Design level

• Implementation level

• Verification & test level

No cyber threats

• Intrusion

• Viruses

• Theft

• Identity

Page 24: Safety and security in mission critical IoT systems

two unique propertiesInspection & Test • Software can’t be inspected and

tested as analogous components

CPU – the single point of failure• All signals are threaded through the

one single element.

• Execution sequence is un-known

• Same defect is systemized acrossmultiple instances

Impacts how we must manage software for critical systems

Page 25: Safety and security in mission critical IoT systems

some specific challengesCommon mode failure

Malware, Viruses and Hacking

Human Factors

Blurred boundaries

Page 26: Safety and security in mission critical IoT systems

common mode failure“results from an event which because of dependencies causes a coincidence of failure states of components in two or more separate channels of a redundancy system, leading to the defined systems failing to perform its intended function”.

Ariane 5 test launch, 1996

Page 27: Safety and security in mission critical IoT systems

malware, viruses and hacking

Motivated by financial, political, criminal or idealistic interests

Software created to cause harm• Change of system behaviour• Steal / destroy data or machines

Exploits weaknesses in• Human character• Technical designs

Horror stories:• Stuxnet and the Iranian centrifuges (Siemens control system)• Saudi Aramco hack of 35000 computers (Windows back office)

Page 28: Safety and security in mission critical IoT systems

human factors

How to minimize the effects of human error?

Mistakes occur everywhere• Specification• Design• Implementation• Deployment• Operations

Humans make mistakes• By commission • By omission• By carelessness

Page 29: Safety and security in mission critical IoT systems

blurred boundaries

Conflicting interests, divergent situational understanding acrossdisciplines and roles.

Architects thinks and designs in terms of hierarchy and layering

Programmers thinks and designs in terms of threads of execution

Users need systems that works and solves a real world problems

Operations needs to get the job done

Page 30: Safety and security in mission critical IoT systems

Tools

Page 31: Safety and security in mission critical IoT systems

systems engineeringArchitecture centric• Design• Implementation• Deployment• Usage

Risk based• Requirements• Design• Implementation• Commissioning• Usage

Forging “design thinking” with “high-integrity systems” practices

Page 32: Safety and security in mission critical IoT systems

architecture

Separation and protection of critical functions

Local Action Optimization

Situational Awareness

Uncertainty and Validation

Physical System Behavior

Physical System Sensing

Global Action Optimization

Page 33: Safety and security in mission critical IoT systems

standards

IEC 61508 Functional safety of safety instrumented systems for the process industry sector

IEC 61511 Safety instrumented systems for the process industry sector

DO-178C Software considerations in airborne systems and equipment certification

The good thing about standards is that there are so many to choose fromAndrew S. Tanenbaum

Not sufficient on their own

Represents insights

Must be tailored to be useful

Page 34: Safety and security in mission critical IoT systems

evidence based safety & security

Thanks to professor Tim Kelly @ University of York

Page 35: Safety and security in mission critical IoT systems

Summary

Page 36: Safety and security in mission critical IoT systems

summaryThings run on software

Critical things form critical / high-integrity systems

Cognitive functions make software inherent complicated

Holistic, architecture centric Systems Engineering

Software is used to offload and support human operators2nd and 3d order failure effects must be addressed upfront

Forging design thinking with high-integrity systems practices

Page 37: Safety and security in mission critical IoT systems

Safety and security in mission critical IoT systems

Einar LandreLead AnalystE-mail [email protected]: +4741470537

www.statoil.com

Thank you