LBTO software startup/shutdown and troubleshooting July 18, 2006 Chris Biddick 1 cjb.

25
LBTO software startup/shutdown and troubleshooting July 18, 2006 Chris Biddick 1 cjb

Transcript of LBTO software startup/shutdown and troubleshooting July 18, 2006 Chris Biddick 1 cjb.

LBTO software startup/shutdown

and troubleshooting

July 18, 2006

Chris Biddick

1 cjb

TCS servers

lbtmu01 (TO station) lbtmu102 lbtmu103 lbtmu104 lbtmu105 lbtmu107 jet (special)

2 cjb

Start TCS networks

netconfig start (on all servers, but not jet)• networkserver subsystem manager• rpcserver RPC manager• gshmserver reflective memory manager

netconfig start passive (TO station only)

prevents subsystems from running here

3 cjb

Start MCSPU on jet

User telescope rpcconfig start 10.144.0.20 [passive] ./gotemp (used)

Will start mcstemp (most recent control program) and the engineering interface program mcsDisplay.

./go (not used)Will start mcsworking (stable control program) and the engineering interface program

4 cjb

Start subsystems

netconfig start [left | right] SUBStarts subsystem SUB on lightest load server

netconfig start [left | right] SUB on addressStarts subsystem SUB on server ‘address’ – name or IP address

netconfig start [left | right] SUB on localStarts subsystem SUB attached to local terminal

Alternate side notation:• left SUB SUBL

• right SUB SUBR

5 cjb

Start subsystems

When subsystems are started normally• stdout is directed to /dev/null

• stderr is available (but not much use)

When started ‘on local’• both stdout and stderr are available and

attached to controlling terminal

6 cjb

Current subsystems

LSS Logging control ECS Enclosure control CSQ IIF receiver PMC left Primary mirror control PSF left Active optics control MCS Mount control PCS Pointing control SDB Star database

7 cjb

Start GUIs

GUIs are normal executables installed in a system path

ECSGUI [READ_WRITE] ECSGUI [READ_WRITE] & For sided subsystems add ‘left’ or ‘right’

as argument: PSFGUI left

8 cjb

Current GUIs ECSGUI [READ_WRITE] PMCGUI {left | right} PSFGUI {left | right} MCSGUI (old) mcsgui (OpenGL) PCSGUI (old) pcsgui (OpenGL) presetgui (OpenGL) WXGUI

9 cjb

TO station start buttons ECSGUI TEL, ECS, ECSGUI

READ_WRITE MATGUI CSQ, MATGUI MCSGUI MCS, mcsgui (OpenGL) PCSGUI PCS, pcsgui (OpenGL) PMCGUI TEL, LSS, left PMC, PMCGUI left PRESETGUI PCS, MCS, SDB, presetgui PSFGUI TEL, LSS, left PSF, PSFGUI left TELGUI TEL, TELGUI WXGUI ECS, WXGUI

10 cjb

Stop GUIs

Use ‘File exit’ option or title bar ‘X’

11 cjb

Stop subsystems

netconfig stop [left | right] SUBStops subsystem SUB – times out in seven seconds

netconfig stop allStops all subsystems

netconfig kill [left | right] SUBSends SIGKILL (9) instead of SIGINT (2)

If GUI is left running when SUB stopped GUI should turn red

12 cjb

TO station stop buttons

TCSstop all subsystems

13 cjb

Stop MCSPU on jet

Command ‘stop’ in engineering interface will stop just the interface

Command ‘stopall’ will stop both the engineering interface and mcstemp

14 cjb

Stop TCS networks

netconfig stop (on each server)• Stop all TCS network processes on local

server

• You may stop TCS networks without stopping subsystems – network state is kept in files on disk

15 cjb

TCS state files

LBT.confOverall configuration parameters in

/usr/commonsw/NetConfig

rpc_IPAddr.confRPC functions and aliases on local server in /var/state/LBT

nameprocs.confSubsystem running on local server in /var/state/LBT

LSSEventClientList.datList of event callbacks on local server in /var/state/LBT

16 cjb

Troubleshooting - netconfig

netconfig ps• Shows all TCS processes on local server

netconfig –l• Shows all known TCS subsystems on all

servers

netconfig –s• Shows all servers with subsystem count and

load

17 cjb

Troubleshooting - rpcconfig rpcconfig stop [address]

• Stops rpcserver on ‘address’ rpcconfig start address [cold] [passive]

• Starts rpcserver on ‘address’• ‘cold’ skips reading state file• ‘passive’ disables rpc health check

rpcconfig –l• Shows number of registered functions and aliases on local server

rpcconfig –f [address]• Lists all functions on server address

rpcconfig –a [address]• Lists all aliases on server address

18 cjb

Troubleshooting - gshmconfig gshmconfig stop

• Stop gshmserver on local server gshmconfig start

• Start gshmserver on local server gshmconfig –b

• Shows reflective memory IP broadcast address on local server

gshmconfig –l• Lists segment key, size, and all servers currently on

the reflective memory network• Repeat and check that all servers stay at same ID

19 cjb

Troubleshooting - monitor

monitor –o• Shows computers as they join or drop off the

shared memory network

• Shows restart requests

20 cjb

Troubleshooting - syslog

/var/log/messages on local server ‘tcslog’ will show messages with LBT

designation as they occur

21 cjb

Troubleshooting - DDViewerGUI

DDViewerGUI shows hierarchical view of reflective memory on local server

Available on every server and TO station Updates about every two seconds

22 cjb

Troubleshooting - Events

No LSS - XML files on local server• /tmp/SUB.log

LSS - MySql database• mysql –u lss –p –h mysql LBT_log

• password lss

• select date,description from Event where (date like ‘%Jul 18%’ and description like ‘%left PSF%’);

23 cjb

Mountain subsystem debugging

Use your account on a server Get current CVS tree Modify subsystem code Build and install (may require removal of

existing package) netconfig stop SUB netconfig start SUB on server

24 cjb

Mountain GUI debugging

Use your account on a server or TO station

Get current CVS tree Modify GUI code Build GUI (install not needed) Run GUI directly

25 cjb