Français Search

BayesiaLab 4.1 : what's new

  • Data
    - Discretization with Decision Tree
    - Import and associate report
    - Data file encoding
    - Output file encoding
    - Data file path
    - Dictionaries
    - Handling blanks in dictionaries
    - Images
  • Graphs
    - Occurrences matrix
  • Node editor
    - New node editor
    - Classes
    - Properties
    - Values
    - Comment
  • Network
    - Forbidden arcs
    - Constants
  • Interface
    - New icons
    - New search tool
    - New node selection tool
    - New alignment tool
    - Not observable nodes displayed differently
    - Image on nodes
    - Image preview
    - Node comments and arc comments
    - Properties edited from node's menu
    - Reorganization of the Inference menu
  • Security
    - Proxy authentication
  • Monitors
    - Joint probability of the network and others
    - Node's color displayed in monitors
    - Time indicator
  • Learning
    - Learning optimizations
    - Completion optimizations
    - Compression rate
  • Inference
    - Missing values imputation in the associated database
    - Interactive bayesian updating
    - Interactive inference
    - Batch inference
    - Batch labeling
    - New gain analysis tool
    - Complexity reduction algorithm
    - Pearson's correlation
    - Network's skeleton
  • Settings
    - Editing
    - Database
    - Inference

Data


 




Discretization with Decision Tree
The list of the available target variables that can be used for the discretization with decision trees contains now the continuous variables for which the discretization as been defined.

Import and associate report
An importation and association report is available at the end of the loading process.


Data file encoding
It is now possible to choose the encoding in order to import file encoded with non default platform encoding (Shift_JIS, UTF-8, UTF-16, UTF-16BE, UTF-16LE, iso, ibm, windows and many others are supported). BOMs (Byte Order Masks) are supported for the UTF-8 and UTF-16 (BE and LE) encoding.

Encoding


Output file encoding
The settings allow to choose the encoding of the output files such as the saved or generated databases as well as the generated html files (such as reports and others). This encoding format can be changed in the setting panel through the "File encoding" combo box in the option "Database>Save format". An encoding indicator is also added into the meta of the html files.

Encodig settings


Data file path
The path of the imported or associated data file is now kept in order to be quickly reused through the Recent database menu.


Dictionaries
Dictionaries of costs, classes and modality values have been added.

  • a cost can be associated to each node by associating a number to the name of the node (or nothing if the node must be not observable)
  • one or more classes can be associated to one or more nodes
  • numeric values can be associated to the modalities of the nodes by associating a number to the name of the node following by a dot and the name of the modality.

Dictionaries
The dictionaries can be also exported.


Export dictionaries




Handling blanks in dictionaries
If the name of a node or a modality contains a blank, it must be replaced by an underscore in the dictionary file. For example, if you want to link a cost (10) to a node named "Node 1", the dictionary file must contain the line:
Node_1 10


Images
All the images associated to the network such as background image and node images are now included in the network xbl file.

 

Graphs



Occurrences matrix
The results can be displayed along four modes:

  • absolute,
  • percent,
  • line percent,
  • column percent.

Matrix settings

Matrix results

 


 

Node editor




New node editor
The node editor has a new interface. All the properties of a node can be directly edited by selecting the convenient thumbnail.

Node editor




Classes
A new mechanism allowing to create classes of nodes has been added. A class is a subset of nodes with a given name. A node can belong to several classes at the same time. It is very useful to manage properties shared by several nodes (costs, temporal indices, colors, images,...) and to create arc constraints between nodes.
The classes are managed with the node editor:


Class editor




Properties
Color, image, temporal index and cost can be edited in this panel. It is also possible to propagate a property to the nodes that belong to the same classe.

Properties editor




Values
When there is at least a node with associated values, the expected total value of the network and the mean value of the nodes having associated values are displayed below the joint probability.

The values are used quite like Utility nodes. Indeed, an expected numerical value can be obtained by associating an Utility node to each node, except that the modalities without values can't be represented with this kind of node. Thus, these values are used to evaluate the network, to measure the impact of such lever on the quality of the network. However, unlike Utility nodes, these values are not taken in account during the action policies learning.

This is these values that are used to compute the Pearson's linear correlation coefficient.

 

Values editor

Satisfaction



Comment
The comment can be edited within the node editor.

Comment editor


 

Network



Forbidden arcs
A new editor allows to create and manage constraints over the arcs. The created constraints will be taken into account by the learning algorithms.

Constraint editor


You can forbid probabilistic relation, in one or both directions, between two nodes, between a node and a class of node, between two classes of node or between a class of node and a node.

Arc constraint




Constants
A constant editor has been added. These constants will be used in the formulas that generate the conditional probability tables. A constant has a type (real, integer, boolean and string) and a value. The constants are managed by this editor:

Constant editor


To create a new constant you must choose a name that is not already used by a node or another constant, a type and a value. Once the constant is created, its value can be modified and the conditional probability tables will be regenerated according to the formulas that use the modified constants.

Add a constant


 

Interface



New icons Tool bar




New search tool
The tool that allows to search nodes and arcs takes not only into account the node nameS but also the classes that have been defined..

Search tools

Search tools

 




New node selection tool
The node selection tool available from the node contextual menu allows to select all the nodes that belong to the same classe.

Class selection




New alignment tool
The alignment tool available from the node contextual menu allows to define a layout where the selected nodes are equally distributed, horizontally or vertically.

Alignment




Not observable nodes displayed differently
The not observable nodes and their monitors are displayed now with a mauve color in order to be identified immediately.

Not observable node

Not observable node monitor

 




Image on nodes
Now, you can display an image instead of the default node representation.

Image node


The chosen image can be propagated to the nodes of the same classes if needed. The images are saved in the network file.

We can switch between standard display and images display (if there are images) with the button:


Image button




Image preview
An image preview has been added into the file chooser when the selected file is a valid image format. The dimensions of the image are also displayed.

 


 



Node comments and arc comments
The comments of the nodes and arcs can be displayed separately.

Comment buttons




Properties edited from node's menu
All node's properties can be directly edited from the contextual menu.

Properties menu




Reorganization of the Inference menu
The Inference menu has been split into two menus: the Analysis menu containing all the graphic and report analysis and the Inference menu containing the batch labeling and inference and the adaptive questionnaire.

Inference menu

Analysis menu

 


 

Security


Proxy authentication
When the automatic validation process is used, you can configure the use of a proxy for the Internet connection by giving the login and password for authentication.


 

Monitors


Joint probability of the network and others
The joint probability of the network is displayed at the top of the monitor panel. If the network has a database associated, the number of cases is also displayed. If the different modalities of the nodes have associated values, the total value of the network and its mean are displayed.

Monitor info

 




Node's color displayed in monitors
If a color is associated to a node, it will be displayed as a border in the corresponding monitor.

Monitor color




Time indicator
The time indicator is no more represented by a node but has been included as an icon at the bottom right of the network frame. Clicking on it allows to remove the use of the time in the network.

Time indicator


Learning




Learning optimizations
The EQ and SopLEQ algorithms have been completely rewritten and are much more efficient. The learning time has been reduced by 10% in general.


Completion optimizations
Switching to different completion mode is faster.
The completion methods has been improved during learning.



Compression rate
A compression rate is available in the console. This new indicator measures the data compression obtained by the network with respect to the previous network (usually, the unconnected network). This rate then not only gives an indication on the probabilistic links that are in the network, but also the strength of these links.
For example, with a database containing two binary variables that are strictly identical, the corresponding network will link these variables and describe in the conditional probability table that the value of the second variable is deterministically defined by the first variable. The compression rate will be then equal to 50%.

 

Inference



Missing values imputation in the associated database
In the Data menu, you can use the new Imputation menu that allows to perform imputation on the currently loaded database. You can choose the values according to the law or according to the maximum probability. The generated database will be saved in a file.

Imputation




Interactive bayesian updating
The interactive bayesian updating allows to use the associated database as a file of observations. This file can then be used to update the probability distribution of the nodes that have been declared as "not observable", with respect to the observations that are interactively read from this file. Whereas the probability distribution of all the unobserved nodes can be impacted by these observations, we just update those of the "not observable" nodes after each observation. This mode displays a new toolbar that allows to perform a step by step updating and also complete updating over the database:

Actualisation interactive toolbar
The button  Firstallows to come back to the first example of the database and to reset the probability distributions of the "not observable" nodes. The button Last performs an updating from the current index to the last in the database. This process can be stop while running by clicking on the red light of the Status bar. The button Next steps to the next example. The text field indicates the index of the current example. It is possible to enter an index in the field to perform updating from the current index to the new index. If the new index is lower than the current one, the probability distributions are reset and the updating goes from the index 0 to the specified one. The button Validate validates the updated conditional probability tables. The buttonStop  stops the interactive updating and reinitialize the conditional probability tables. It also removes all the observations.




Interactive inference
The interactive inference allows to use the associated database as a file of observations. This mode displays a new toolbar that allows to navigate through the different cases contained in the database:

Inference interactive ToolBar
The button  Firstallows to come back to the first example of the database, the button Last navigates to the last one. The button Previous goes to the previous example if it is possible and the button Next goes to the next one. The text field indicates the index of the current example. It is possible to enter an index in the field to go to it directly. The button Stop stops the interactive inference and removes all the observations.

At each example, the nodes are observed with the corresponding value in the database except if this value is missing or if the nodes are declared as not observable or as target node. The probability distributions of these unobserved nodes are computed and displayed in the monitors. When a node is unobserved but has a corresponding value in the database, this value is indicated in the monitor with the blue sky bar. The joint probability and the corresponding number of cases are also computed again.

In the following picture, Cancer is the target node (pink background) and is not observed. The corresponding value in the database is No (blue sky) and corresponds to the value predicted by the network (99,97%). The node TbOrCa is not observed because it is declared as not observable (mauve background) et the corresponding value in the database is False (blue sky). The node Smoking is not observed because the corresponding value is missing in the database:

Inference interactive monitors

This mode allows then to see interactively the behavior of the network and to check its validity.




Batch inference
The batch inference has been added in order to infer the probability distributions of the nodes declared as "not observable" based on the cases that are describe in a database. The batch inference process can be interrupted at any time without loosing the computed data. The already generated data are saved in the output database.


Batch labeling
The batch labeling process can be interrupted at any time without loosing the computed data. The already generated data are saved in the database. A node with "not observable" cost is not observed, even if its values are in the file. 


New gain analysis tool
The gain curve has been extended ("Performance of the network" analysis toolbox) in order to automatically analyze the expected economical gains with the evaluated model. These computations follow the definition of the unit costs corresponding to the treatment of each individual (x-axis), of unit gains corresponding to each positive answer (y-axis), and finally of a target population's size. The economical gain is then defined as the difference between the profit corresponding to the treatment of x% of the population and the profit corresponding to the treatment of the whole population. As the following screen captures shows it, the result is displayed as a curve (blue curve) and as a gradient of color (the closer we are to the yellow, the more we are close to optimality).

Gain tool
The economical parameters can be modified with the following dialog:


Gaint tool options




Complexity reduction algorithm
We have developed a new algorithm to reduce the complexity of the graph that are too connected to allow the construction of the junction tree, and then, that prevent exact inference. This algorithm incrementally simplifies the network structure until the exact inference can be performed.

Complexity reducer
After reduction, a report containing all the removed arcs is displayed.




Pearson's correlation
The association of values to nodes' modalities allows to compute R the Pearson's linear correlation coefficient between two nodes linked by an arc. If the modalities don't have associated values, default values are defined in order to compute R (from 0 to n-1 for a node with n modalities). The thickness of an arc is directly proportional to the absolute value of R, its color represents the sign of R (blue if positive and red otherwise). The exact value of the correlation for each arc is temporary displayed in the comment of the arc. The Pearson's correlation has also been added to the relationships analysis report.


Network's skeleton
 In Validation mode, the network can be displayed without the head of the arcs in order to avoid any erroneous causal analysis of the direction of the arrows. This option is activated by pressing the convenient button in the toolbar:

Skeleton

 


 

Settings


Editing
An option has been added in the Settings to choose the behavior of the software when a node has been created:
- once a node or an arc has been created, the software go back to selection mode automatically
- the software goes back to selection mode when the user do a right click.

Editing settings




Database
It is possible to set default values for some options of the importation wizard:
- completion algorithm for the missing values (static, dynamic or Structural EM),
- discretization method (equal distances or equal frequencies),
- number of intervals.

Setting database




Inference
It is possible to modify the parameter of the network complexity reducer algorithm. Based on the available RAM memory, one can increase or decrease the reduction rate. 

Setting