analyzer

102
H S C analyzer Visual Documentation v1.0.4.1

description

C. H. S. analyzer. Visual Documentation v1.0.4.1. User Interface. Active class (for selection and some processes). Number of wells associated with the active class (current plate only). Current activated plate. Process pipeline. Wells coordinate. Currently activated descriptor. - PowerPoint PPT Presentation

Transcript of analyzer

Page 1: analyzer

H SCanalyzer

Visual Documentationv1.0.4.1

Page 2: analyzer

User Interface

Page 3: analyzer

Active class(for selection and some processes)

Page 4: analyzer

Number of wells associated with the active class(current plate only)

Page 5: analyzer

Current activated plate

Page 6: analyzer

Process pipeline

Page 7: analyzer

Wells coordinate

Page 8: analyzer

Currently activateddescriptor

Page 9: analyzer

Current plate readouts display corresponding to the selected

descriptor

Page 10: analyzer

Look up tablerelated to the current plate and the current

descriptor

Page 11: analyzer

Available and activated descriptorsWhen inactivated, the descriptors

will not be taken into account for any process.

Page 12: analyzer

Zoom in and Zoom out(can also be performed with the mouse

wheel)

Page 13: analyzer

When activatedonly class colors are

displayed

Page 14: analyzer

When checkedclass selection is performed on all

the currently activated plates

Page 15: analyzer

When pressed, all the wells will turned into the currently selected

class (including non activated wells)

Page 16: analyzer

When pressed, the wells will turned into the currently selected class

(only activated wells are concerned)

Page 17: analyzer

When the mouse cursor isplaced over a well, a list of

information is displayed. This includes: Well position, name (if so),

list of all the activated descriptors with their corresponding values and

the class

Page 18: analyzer

Contextual menu: Histogram(right click on a well)

Average and standarddeviation of the displayedhistogram

Current value of the selected well

Histogram of the selected histogram within the active

class over the entire list of active plates.

(The histogram color is related to the class)

Page 19: analyzer

Representation of the distribution (if available) of the current well selected

descriptor

Display (if loaded) Name, Information , Locus ID and

Concentration associated to the current well (Info and

Name are editable)

Save new Info and Name

Contextual menu: Info (right click over a well)

Page 20: analyzer

Contextual menu: Kegg (right click over a well)Locus ID is required for this option

Current value of the selected well

Page 21: analyzer

Options

Page 22: analyzer
Page 23: analyzer
Page 24: analyzer

Note: display distribution option

requires the have loaded a distributions

based screen

Page 25: analyzer
Page 26: analyzer
Page 27: analyzer
Page 28: analyzer

File format: scalar format

Page 29: analyzer

The first row of the .csv data table should contain the name of every parameter. The order is not important has it can be change during the loading process. However, a column for the plate name, the well position and at least one

descriptor (double format) are mandatory.

MS

Exce

l vie

wN

otep

ad v

iew

Page 30: analyzer

Then, down below, the values are listed: double format for the descriptors and concentration, integer for the locus ID and class (<=10), and string for the others. If a descriptor value is missing or a parameter not formatted in the correct

format, the well will be rejected from the screening.

MS

Exce

l vie

wN

otep

ad v

iew

Page 31: analyzer

File format: histogram format

Page 32: analyzer

A general directory containing every plateof the screening has to be created

Every plate is defined by its own sub-directory. The name of the directory will be used as the plate name.

Page 33: analyzer

A general directory containing every plateof the screening has to be created

Each plate directory should contain a .txt file for every well.The file name is defined as follow:

“MxN.txt”

Where M and N are respectively the column and row position (1 indexed)

Page 34: analyzer

For each well, every row describes a descriptor histogram: 1st column is the name, then the remaining columns contain the histogram values (tab-separated).

Note: consistency between each .txt file is required.

MS Excel view

Importing such data can be operated trough the File->Load Histogram Based Screen menu.

Page 35: analyzer

Importing Data

Page 36: analyzer
Page 37: analyzer

Plate dimensions

Page 38: analyzer

Readouts preview

Page 39: analyzer

Names that be associatedwith the data

Page 40: analyzer

If unchecked,the corresponding data

will not be loaded

Page 41: analyzer

Data type.Mandatory: Plate Name,well position and at least

one descriptor.Note: Locus ID should be

defined as an integer.

Page 42: analyzer

Generating artificialunivariate screening data

Page 43: analyzer
Page 44: analyzer
Page 45: analyzer
Page 46: analyzer

If an effect X is selected, it will be combined to original data by the following way:

Resulting Data = Original Data * (X + Shift_X)

Page 47: analyzer

The higher Shift_X is, the lower X contribution to the generated data

Page 48: analyzer
Page 49: analyzer
Page 50: analyzer

If a parameter X is checked, its value will vary from, increasing with a stepdefined in the options window.

Page 51: analyzer

Generating artificialmultivariate screening data

Page 52: analyzer
Page 53: analyzer

The dimension correspondsto the number of descriptors

Page 54: analyzer

Up to 10 different clouds canbe generated

Page 55: analyzer

Their positions within the plate are defined bycolumn

Page 56: analyzer

Each cloud (phenotype) is modeled by a Gaussian distribution defined by its means in every dimension

Page 57: analyzer

As well as its variances.

Page 58: analyzer

Dimensionality reduction

Page 59: analyzer

Information windows:provide feedback about the process

as well as information and useful links

Page 60: analyzer

Desired new dimension

Page 61: analyzer

Feature selection approaches:Unsupervised: use all the activated wells of the activated plates to reduce the dimension Supervised: keep the descriptors required tocluster the classes (but the neutral class)

Page 62: analyzer

Perform the dimensionality reduction

The descriptors selectionis then automatically adjusted

Page 63: analyzer

Descriptors management

Page 64: analyzer

You can split a distribution and generate a descriptor per bin

Page 65: analyzer
Page 66: analyzer
Page 67: analyzer

Quality controls

Page 68: analyzer

Display the sorted Z-factors of the current plate for each

selected descriptor

Display the sorted Z-factors of the selected descriptor for all the active plates

Page 69: analyzer

Display the normal probability plotof the current selected descriptor values

among the set of points of theselected class.

Note: operate the same wayfor the entire screen.

Page 70: analyzer

Display the correlationmatrix of the active descriptors

And the ranked corresponding histogram (optional)

Page 71: analyzer

Identify the potentialsystematic errors for

each selected plates on each active descriptor

Page 72: analyzer

Systematic errors identification parameters can be changed by the

options window

Double clickon a row

display the corresponding

readouts

Page 73: analyzer

Plate by plate, and descriptor by descriptor, correction procedurecan be performed using the dedicated function

Page 74: analyzer

Plates can be rejectedregarding a chosen parameter

(here Z-factor on the current descriptor)

before

after

Page 75: analyzer

Evolution (average and standard deviation)of the current descriptorwithin a defined classcan be monitored to identifyabnormalities

Page 76: analyzer

Normalization

Page 77: analyzer

Various approachesfor data normalization

of the screening dataare available

(controls based or not)

before

after

Page 78: analyzer

Clustering and classification

Page 79: analyzer

The user has to define the desired

cluster number.

before

after

Page 80: analyzer

Some clustering methodsallow automated evaluation of the

optimum number of clusters.Note: if more than 10 classes are

detected, the clustering is not operated.

Originalreadouts

Clustering (here:3 classes have identified)

Page 81: analyzer

This class will not be used for the learning, but will be classified.

The user has to define the classes for the training.

Plate by plate: the training step is performed on the each plate

independently.

Page 82: analyzer

If the classification operated is tree based (C4.5) a tree diagram is affected for each plate.

Page 83: analyzer

Hierarchical Tree visualization

Page 84: analyzer

Class color

Note: this operation can be time consuming for large number of data

Well position

Page 85: analyzer
Page 86: analyzer

Pathways analysis

Page 87: analyzer

Recurrence of each specific pathway

Locus IDs have to be associated to each wellNote: this operation

requires internet connection

Page 88: analyzer

Export

Page 89: analyzer

At this stage, names can be modified

Descriptors are checked regarding the selectionbut can be modified for

the export purpose

Page 90: analyzer
Page 91: analyzer
Page 92: analyzer
Page 93: analyzer

Fluo1

Size

Volume

Fluo1 Size Volume

Pearson correlation matrix.

Page 94: analyzer

Weka

HCS Analyzer

Page 95: analyzer

Core Development

Page 96: analyzer

Unpack and launch the solution

Page 97: analyzer

Look, develop, modify, debug, test

Launch and have fun

Page 98: analyzer

Plugins Development

Page 99: analyzer

Add a new HCSAnalyzer Plugin project within the global solution

Note: this template is available on our website, and has to be copied in your dedicated visual studio directory

Page 100: analyzer

Add the two following references to your project

Page 101: analyzer

Add your HCS Analyzer plugins directory in thePost-build event command line

Page 102: analyzer

Add information about your plugin(menu, etc.)

Develop your plugin and do not forget to share it