Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module...

20
Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li-Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang, Department of Electrical Engineering National Cheng Kung University, Tainan, Taiwan 10th IAA Symposium on Small Satellites for Earth Observation

Transcript of Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module...

Page 1: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Implementation of the Fault Tolerance Module in PHOENIX CubeSat

Author: Li-Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Department of Electrical Engineering National Cheng Kung University, Tainan, Taiwan

10th IAA Symposium on Small Satellites for Earth Observation

Page 2: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Outline

• Introduction • Phoenix Nanosatellite • Flight Software Design for Phoenix • FDIR Module • Conclusion

2015/04/21 IAA 2015 2

Page 3: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Introduction • A CubeSat is a type of Nanosatellite (10x10x10 cm,

weight 1 kg) which offers all the standard functions of a normal satellite.

2015/04/21 IAA 2015 3

• Command and Data Handling System(C&DH):

- Schedule Mission Tasks - Payload Operation and Data Handling - House-Keeping - Ground Command Handling

Phoenix, 2U CubeSat

Page 4: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

QB50 Mission

2015/04/21 IAA 2015 4

An international network of 50 CubeSats for multi-point, in-situ, long duration measurements and in-orbit demonstration in the lower thermosphere.

Images from VKI

Page 5: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Phoenix Nanosatellite

2015/04/21 IAA 2015 5

INMS Science Unit

ADCS

C&DH

EPS

TT&C

Solar EUV Sensors

Solar EUV Board

Payloads: 1. Ion and Neutral Mass Spectrometer (INMS) Payloads: 2. Solar EUV Sensors

Page 6: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

C&DH Module

2015/04/21 IAA 2015 6

Images from GOMSpace

NanoMind A712D is a space proven COTS manufactured by GomSpace

• Main Features: - 32-bit ARM7 RISC CPU - Compatible with FreeRTOS - 2GB MicroSD card - I2C interface - USART interfaces - GPIO pins - Real Time Clock

Page 7: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Flight Software Feature

2015/04/21 IAA 2015 7

Page 8: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Failure Mode Analysis

2015/04/21 IAA 2015 8

• Hardware Damaged • Single Event Upset(SEU) • Single Event Latch-Up (SEL)

Hardware Issue

• Parameters changed by SEU • Task Execution Failure • Execute illegal Ground commands

Software Issue

• Set improper configuration on the CubeSat. • Send improper commands…,etc.

Human Factor

Page 9: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Failure Mode Analysis

2015/04/21 IAA 2015 9

• Hardware Damaged • Single Event Upset(SEU) • Single Event Latch-Up (SEL)

Hardware Issue

• Parameters changed by SEU • Task Execution Failure • Execute illegal Ground commands

Software Issue

• Set improper configuration on the CubeSat. • Send unauthorized commands…,etc.

Human Factor

Software Isolation

Power Reset

Acceptance Check

Define the margin for each parameter

Authority Check

Page 10: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

FDIR Implementation

2015/04/21 IAA 2015 10

Data Redundancy

Watch Dog Timer

Software Supervisor

Page 11: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

FDIR Module Implementation

2015/04/21 IAA 2015 11

Data Redundancy

• To against Single Event Upset(SEU) - Cyclic Redundancy Check(CRC) algorithm in file-system - One additional backup copy

Page 12: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

FDIR Module Implementation

2015/04/21 IAA 2015 12

Watch Dog Timer

• As the final defense • Against communication latch-up - Should be reset by software in a period - If somehow no one reset it, power reset the satellite.

Page 13: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

FDIR Module Implementation

2015/04/21 IAA 2015 13

Software Supervisor

• To against single events & Task Failures -Routinely scan important parameters -Reset watch dog timer -Supervise target tasks -Recover failure according to library -Generate error report -Library can be updated by uploading scripts

Page 14: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Supervisor Operation Flow

2015/04/21 IAA 2015 14

Activate by Task Scheduler

Work Tasks Supervisor

Tasks

Activate by Task Scheduler

(option)

Page 15: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Supervisor Operation Flow

2015/04/21 IAA 2015 15

Work Tasks Supervisor

Tasks

• Stage 1 • Stage 2 • Stage 3 • Stage 4 • Stage 5

LOOP

Message Queue

• Check Stage 1 • Check Stage 2 • Check Stage 3 • Check Stage 4 • Check Stage 5

LOOP

Send S1 status

Get S1 status

Page 16: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Supervisor Operation Flow

2015/04/21 IAA 2015 16

Work Tasks Supervisor

Tasks

• Stage 1 • Stage 2 • Stage 3 • Stage 4 • Stage 5

LOOP

Message Queue

• Check Stage 1 • Check Stage 2 • Check Stage 3 • Check Stage 4 • Check Stage 5

LOOP If every thing

goes well

Page 17: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Supervisor Operation Flow

2015/04/21 IAA 2015 17

Supervisor Tasks

• Check Stage 1 • Check Stage 2 • Check Stage 3 • Check Stage 4 • Check Stage 5

LOOP

Get error code

Hey, something is wrong!

Page 18: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Error Recovery Script

• Library can be updated by uploading scripts

2015/04/21 IAA 2015 18

Task ID Stage Error code Solution ID Solution command Checksum

Work Task ID

The failure stage

Describe the error type

The steps of solutions

Ex: Reboot a Subsystem, Configure Parameters, Power control

CRC16

Page 19: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

Conclusion • Three FDIR method are discussed here: Data Redundancy, Watch Dog Timer and Software Supervisor. • SEU can be detected and corrected by Data

Redundancy and Software Supervisor • Watch Dog Timer is the final defense. • Solution library provides flexibility when dealing with

failures.

2015/04/21 IAA 2015 19

Page 20: Implementation of the Fault Tolerance Module in …...Implementation of the Fault Tolerance Module in PHOENIX CubeSat Author: Li -Wei Chen, Te-Chuan Huang Advisor: Jyh-Ching Juang,

THANK YOU FOR YOUR ATTENTION

[email protected] satellite.ncku.edu.tw

10th IAA Symposium on Small Satellites for Earth Observation