5.4 Model Detail Views
The GET_*
interfaces are replaced by model views, and Oracle recommends that users leverage the views instead.
The following are the new model views:
Association:
Classification, Regression, and Anomaly Detection:
Clustering:
Feature Extraction:
Feature Selection:
Data Preparation and Other:
5.4.1 Model Detail Views for Association Rules
Model detail views for Association Rules describes the rule view for Association Rules. Oracle recommends that users leverage the model details views instead of the GET_ASSOCIATION_RULES
function.
The rule view DM$VR
model_name describes the generated rules for Association Rules. Depending on the settings of the model, the rule has different set of columns. The following views are displayed when different Global settings are applied without aggregates for transactional and 2–Dimensional inputs.
Transactional Input Without ASSO_AGGREGATES Setting
ODMS_ITEM_ID_COLUMN_NAME
is set and ITEM_VALUE
(ODMS_ITEM_VALUE_COLUMN_NAME
) is not set, the following is the transactional view:Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
RULE_ID NUMBER
RULE_SUPPORT NUMBER
RULE_CONFIDENCE NUMBER
RULE_LIFT NUMBER
RULE_REVCONFIDENCE NUMBER
ANTECEDENT_SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
CONSEQUENT_SUPPORT NUMBER
CONSEQUENT_NAME VARCHAR2(4000)
ANTECEDENT SYS.XMLTYPE
Table 5-11 Rule View Columns for Transactional Inputs
Column Name | Description |
---|---|
|
A partition in a partitioned model to retrieve details |
|
Name or identifier of the target |
|
The number of transactions that satisfy the rule. |
|
The likelihood of a transaction satisfying the rule. |
|
The degree of improvement in the prediction over random chance when the rule is satisfied. |
|
The number of transactions in which the rule occurs divided by the number of transactions in which the consequent occurs. |
|
The ratio of the number of transactions that satisfy the antecedent to the total number of transactions. |
|
The total number of attributes referenced in the antecedent and consequent of the rule. |
|
The ratio of the number of transactions that satisfy the consequent to the total number of transactions. |
|
Name of the consequent |
|
For two-dimensional inputs, For two-dimensional inputs, when The view uses three columns for consequent. The rule view has the following columns:
Note: All the types for three parts areVARCHAR2 . This column is not applicable when ASSO_AGGREGATES is set.
|
|
Value of the consequent when In the following view, the
When In the following view, the
|
|
The independent condition in the rule. When this condition exists, the dependent condition in the consequent also exists. The condition is a combination of attribute values called a predicate (
Note: The occurrence of the attribute as aDM_PREDICATE indicates the presence of the item in the transaction. The actual value for attribute_num_value or attribute_str_value is meaningless. For example, the following predicate indicates that 'Mouse Pad' is present in the transaction even though the attribute value is NULL.
|
Transactional Input With ASSO_AGGREGATES Setting
-
Rule view when
ODMS_ITEM_ID_COLUMN_NAME
is set andItem_value
(ODMS_ITEM_VALUE_COLUMN_NAME
) is not set. -
Rule view when
ODMS_ITEM_ID_COLUMN_NAME
is set andItem_value
(ODMS_ITEM_VALUE_COLUMN_NAME
) is set withTYPE
as numerical, the view has aCONSEQUENT_VALUE
column. -
Rule view when
ODMS_ITEM_ID_COLUMN_NAME
is set andItem_value
(ODMS_ITEM_VALUE_COLUMN_NAME
) is set withTYPE
as categorical, the view has aCONSEQUENT_VALUE
column.
ASSO_AGGREGATES
has columns for the aggregates output (four columns per aggregate). The 2–Dimensional input does not allow aggregates setting.
Example 5-13 Examples
The following example shows profit and sales set to be aggregated:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
RULE_ID NUMBER
RULE_SUPPORT NUMBER
RULE_CONFIDENCE NUMBER
RULE_LIFT NUMBER
RULE_REVCONFIDENCE NUMBER
ANTECEDENT_SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
CONSEQUENT_SUPPORT NUMBER
CONSEQUENT_NAME VARCHAR2(4000)
ANTECEDENT SYS.XMLTYPE
ANT_RULE_PROFIT BINARY_DOUBLE
CON_RULE_PROFIT BINARY_DOUBLE
ANT_PROFIT BINARY_DOUBLE
CON_PROFIT BINARY_DOUBLE
ANT_RULE_SALES BINARY_DOUBLE
CON_RULE_SALES BINARY_DOUBLE
ANT_SALES BINARY_DOUBLE
CON_SALES BINARY_DOUBLE
ODMS_ITEM_ID_COLUMN_NAME
is set and Item_value
(ODMS_ITEM_VALUE_COLUMN_NAME
) is set with TYPE
as numerical, the view has a CONSEQUENT_VALUE
column.Name Null? Type
----------------------------------------- -------- ----------------------------
PARTITION_NAME VARCHAR2(128)
RULE_ID NUMBER
RULE_SUPPORT NUMBER
RULE_CONFIDENCE NUMBER
RULE_LIFT NUMBER
RULE_REVCONFIDENCE NUMBER
ANTECEDENT_SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
CONSEQUENT_SUPPORT NUMBER
CONSEQUENT_NAME VARCHAR2(4000)
CONSEQUENT_VALUE NUMBER
ANTECEDENT SYS.XMLTYPE
ANT_RULE_PROFIT BINARY_DOUBLE
CON_RULE_PROFIT BINARY_DOUBLE
ANT_PROFIT BINARY_DOUBLE
CON_PROFIT BINARY_DOUBLE
ANT_RULE_SALES BINARY_DOUBLE
CON_RULE_SALES BINARY_DOUBLE
ANT_SALES BINARY_DOUBLE
CON_SALES BINARY_DOUBLE
Rule view when ODMS_ITEM_ID_COLUMN_NAME
is set and Item_value
(ODMS_ITEM_VALUE_COLUMN_NAME
) is set with TYPE
as categorical, the view has a CONSEQUENT_VALUE
column.
Name Null? Type
----------------------------------------- -------- ----------------------------
PARTITION_NAME VARCHAR2(128)
RULE_ID NUMBER
RULE_SUPPORT NUMBER
RULE_CONFIDENCE NUMBER
RULE_LIFT NUMBER
RULE_REVCONFIDENCE NUMBER
ANTECEDENT_SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
CONSEQUENT_SUPPORT NUMBER
CONSEQUENT_NAME VARCHAR2(4000)
CONSEQUENT_VALUE VARCHAR2(4000)
ANTECEDENT SYS.XMLTYPE
ANT_RULE_PROFIT BINARY_DOUBLE
CON_RULE_PROFIT BINARY_DOUBLE
ANT_PROFIT BINARY_DOUBLE
CON_PROFIT BINARY_DOUBLE
ANT_RULE_SALES BINARY_DOUBLE
CON_RULE_SALES BINARY_DOUBLE
ANT_SALES BINARY_DOUBLE
CON_SALES BINARY_DOUBLE
Global Detail for Association Rules
A single global detail is produced by an Association model. The following table describes a global detail returned for Association Rules model.
Table 5-12 Global Detail for Association Rules
Name | Description |
---|---|
|
The number of itemsets generated |
|
The maximum support |
|
The total number of rows used in the build |
|
The number of association rules in the model generated |
|
The number of the transactions in input data |
5.4.2 Model Detail View for Frequent Itemsets
Model detail view for Frequent Itemsets describes the frequent itemsets view. Oracle recommends that you leverage model details view instead of the GET_FREQUENT_ITEMSETS
function.
The frequent itemsets view DM$VI
model_name has the following schema:
Name Type
------------- ------------------
PARTITION_NAME VARCHAR2 (128)
ITEMSET_ID NUMBER
SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
ITEMSET SYS.XMLTYPE
Table 5-13 Frequent Itemsets View
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
Itemset identifier |
|
Support of the itemset |
|
Number of items in the itemset |
|
Frequent itemset The structure of the |
5.4.3 Model Detail View for Transactional Itemsets
Model detail view for Transactional Itemsets describes the transactional itemsets view. Oracle recommends that users leverage the model details views.
For the very common case of transactional data without aggregates, DM$VT
model_name view provides the itemsets information in transactional format. This view can help improve performance for some queries as compared to the view with the XML column. The transactional itemsets view has the following schema:
Name Type
----------------- -----------------
PARTITION_NAME VARCHAR2(128)
ITEMSET_ID NUMBER
ITEM_ID NUMBER
SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
ITEM_NAME VARCHAR2(4000)
Table 5-14 Transactional Itemsets View
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
Itemset identifier |
|
Item identifier |
|
Support of the itemset |
|
Number of items in the itemset |
|
The name of the item |
5.4.4 Model Detail View for Transactional Rule
Model detail view for Transactional Rule describes the transactional rule view and transactional itemsets view. Oracle recommends that you leverage model details views.
Transactional data without aggregates also has a transactional rule view DM$VA
model_name. This view can improve performance for some queries as compared to the view with the XML column. The transactional rule view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
RULE_ID NUMBER
ANTECEDENT_PREDICATE VARCHAR2(4000)
CONSEQUENT_PREDICATE VARCHAR2(4000)
RULE_SUPPORT NUMBER
RULE_CONFIDENCE NUMBER
RULE_LIFT NUMBER
RULE_REVCONFIDENCE NUMBER
RULE_ITEMSET_ID NUMBER
ANTECEDENT_SUPPORT NUMBER
CONSEQUENT_SUPPORT NUMBER
NUMBER_OF_ITEMS NUMBER
Table 5-15 Transactional Rule View
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
Rule identifier |
|
Name of the Antecedent item. |
|
Name of the Consequent item |
|
Support of the rule |
|
The likelihood a transaction satisfies the rule when it contains the Antecedent. |
|
The degree of improvement in the prediction over random chance when the rule is satisfied |
|
The number of transactions in which the rule occurs divided by the number of transactions in which the consequent occurs |
|
Itemset identifier |
|
The ratio of the number of transactions that satisfy the antecedent to the total number of transactions |
|
The ratio of the number of transactions that satisfy the consequent to the total number of transactions |
|
Number of items in the rule |
5.4.5 Model Detail Views for Classification Algorithms
Model detail view for Classification algorithms describe target map view and scoring cost view which are applicable to all Classification algorithms. Oracle recommends that users leverage the model details views instead of the GET_*
function.
The target map view DM$VT
model_name describes the target distribution for Classification models. The view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
TARGET_VALUE NUMBER/VARCHAR2
TARGET_COUNT NUMBER
TARGET_WEIGHT NUMBER
Table 5-16 Target Map View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Target value, numerical or categorical |
|
Number of rows for a given |
|
Weight for a given |
The scoring cost view DM$VC
model_name describes the scoring cost matrix for Classification models. The view has the following schema:
Name Type
----------------------------------------- --------------------------------
PARTITION_NAME VARCHAR2(128)
ACTUAL_TARGET_VALUE NUMBER/VARCHAR2
PREDICTED_TARGET_VALUE NUMBER/VARCHAR2
COST NUMBER
Table 5-17 Scoring Cost View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
A valid target value |
|
Predicted target value |
|
Associated cost for the actual and predicted target value pair |
5.4.6 Model Detail Views for Decision Tree
Model detail view for Decision Tree describes the split information view, node statistics view, node description view, and the cost matrix view. Oracle recommends that users leverage the model details views instead of GET_MODEL_DETAILS_XML
function.
The split information view DM$VP
model_name describes the decision tree hierarchy and the split information for each level in the Decision Tree. The view has the following schema:
Name Type
---------------------------------- ---------------------------
PARTITION_NAME VARCHAR2(128)
PARENT NUMBER
SPLIT_TYPE VARCHAR2
NODE NUMBER
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
OPERATOR VARCHAR2
VALUE SYS.XMLTYPE
Table 5-18 Split Information View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Node ID of the parent |
|
The main or surrogate split |
|
The node ID |
|
The attribute used as the splitting criterion at the parent node to produce this node. |
|
Split attribute subname. The value is null for non-nested columns. |
|
Split operator |
|
Value used as the splitting criterion. This is an XML element described using the For example, |
The node statistics view DM$VI
model_name describes the statistics associated with individual tree nodes. The statistics include a target histogram for the data in the node. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
NODE NUMBER
NODE_SUPPORT NUMBER
PREDICTED_TARGET_VALUE NUMBER/VARCHAR2
TARGET_VALUE NUMBER/VARCHAR2
TARGET_SUPPORT NUMBER
Table 5-19 Node Statistics View
Parameter | Description |
---|---|
|
Partition name in a partitioned model |
|
The node ID |
|
Number of records in the training set that belong to the node |
|
Predicted Target value |
|
A target value seen in the training data |
|
The number of records that belong to the node and have the value specified in the |
Higher level node description can be found in DM$VO
model_name view. The DM$VO
model_name has the following schema:
ame Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
NODE NUMBER
NODE_SUPPORT NUMBER
PREDICTED_TARGET_VALUE NUMBER/VARCHAR2
PARENT NUMBER
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
OPERATOR VARCHAR2
VALUE SYS.XMLTYPE
Table 5-20 Node Description View
Parameter | Description |
---|---|
|
Partition name in a partitioned model |
|
The node ID |
|
Number of records in the training set that belong to the node |
|
Predicted Target value |
|
The ID of the parent |
|
Specifies the attribute name |
|
Specifies the attribute subname |
|
Attribute predicate operator - a conditional operator taking the following values: IN, = , <>, < , >, <=, and >= |
|
Value used as the description criterion. This is an XML element described using the For example, |
The DM$VM
model_name view describes the cost matrix used by the Decision Tree build. The DM$VM
model_name view has the following schema:
Name Type
----------------------------------------- --------------------------------
PARTITION_NAME VARCHAR2(128)
ACTUAL_TARGET_VALUE NUMBER/VARCHAR2
PREDICTED_TARGET_VALUE NUMBER/VARCHAR2
COST NUMBER
Table 5-21 Cost Matrix View
Parameter | Description |
---|---|
|
Partition name in a partitioned model |
|
Valid target value |
|
Predicted Target value |
|
Associated cost for the actual and predicted target value pair |
Table 5-22 Decision Tree Statistics Information In Model Global View
Name | Description |
---|---|
|
The total number of rows used in the build |
5.4.7 Model Detail Views for Generalized Linear Model
Model details views for Generalized Linear Model (GLM) describes the model details view and row diagnostic view for Linear and Logistic Regression. Oracle recommends that users leverage model details views than the GET_MODEL_DETAILS_GLM
function.
The model details view DM$VD
model_name describes the final model information for both Linear Regression models and Logistic Regression models.
For Linear Regression, the view DM$VD
model_name has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
FEATURE_EXPRESSION VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
STD_ERROR BINARY_DOUBLE
TEST_STATISTIC BINARY_DOUBLE
P_VALUE BINARY_DOUBLE
VIF BINARY_DOUBLE
STD_COEFFICIENT BINARY_DOUBLE
LOWER_COEFF_LIMIT BINARY_DOUBLE
UPPER_COEFF_LIMIT BINARY_DOUBLE
For Logistic Regression, the view DM$VD
model_name has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
TARGET_VALUE NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
FEATURE_EXPRESSION VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
STD_ERROR BINARY_DOUBLE
TEST_STATISTIC BINARY_DOUBLE
P_VALUE BINARY_DOUBLE
STD_COEFFICIENT BINARY_DOUBLE
LOWER_COEFF_LIMIT BINARY_DOUBLE
UPPER_COEFF_LIMIT BINARY_DOUBLE
EXP_COEFFICIENT BINARY_DOUBLE
EXP_LOWER_COEFF_LIMIT BINARY_DOUBLE
EXP_UPPER_COEFF_LIMIT BINARY_DOUBLE
Table 5-23 Model View for Linear and Logistic Regression Models
Column Name | Description |
---|---|
|
The name of a feature in the model |
|
Valid target value |
|
The attribute name when there is no subname, or first part of the attribute name when there is a subname. |
|
Nested column subname. The value is null for non-nested columns. When the nested column is numeric, the mining attribute is identified by the combination |
|
A unique value that can be assumed by a categorical column or nested categorical column. For categorical columns, a mining attribute is identified by a unique |
|
The feature name constructed by the algorithm when feature selection is enabled. If feature selection is not enabled, the feature name is simply the fully-qualified attribute name (attribute_name.attribute_subname if the attribute is in a nested column). For categorical attributes, the algorithm constructs a feature name that has the following form: fully-qualified_attribute_name.attribute_value When feature generation is enabled, a term in the model can be a single mining attribute or the product of up to 3 mining attributes. Component mining attributes can be repeated within a single term. If feature generation is not enabled or, if feature generation is enabled, but no multiple component terms are discovered by the Note: In Oracle Database 12c Release 2, the algorithm does not subtract the mean from numerical components. |
|
The estimated coefficient. |
|
Standard error of the coefficient estimate. |
|
For Linear Regression, the t-value of the coefficient estimate. For Logistic Regression, the Wald chi-square value of the coefficient estimate. |
|
Probability of the |
|
Variance Inflation Factor. The value is zero for the intercept. For Logistic Regression, |
|
Standardized estimate of the coefficient. |
|
Lower confidence bound of the coefficient. |
|
Upper confidence bound of the coefficient. |
|
Exponentiated coefficient for Logistic Regression. For linear regression, |
|
Exponentiated coefficient for lower confidence bound of the coefficient for Logistic Regression. For Linear Regression, |
|
Exponentiated coefficient for upper confidence bound of the coefficient for Logistic Regression. For Linear Regression, |
The row diagnostic view DM$VA
model_name describes row level information for both Linear Regression models and Logistic Regression models. For Linear Regression, the view DM$VA
model_name has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CASE_ID NUMBER/VARHCAR2
TARGET_VALUE BINARY_DOUBLE
PREDICTED_TARGET_VALUE BINARY_DOUBLE
Hat BINARY_DOUBLE
RESIDUAL BINARY_DOUBLE
STD_ERR_RESIDUAL BINARY_DOUBLE
STUDENTIZED_RESIDUAL BINARY_DOUBLE
PRED_RES BINARY_DOUBLE
COOKS_D BINARY_DOUBLE
Table 5-24 Row Diagnostic View for Linear Regression
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of the case identifier |
|
The actual target value as taken from the input row |
|
The model predicted target value for the row |
|
The diagonal element of the n*n (n=number of rows) that the Hat matrix identifies with a specific input row. The model predictions for the input data are the product of the Hat matrix and vector of input target values. The diagonal elements (Hat values) represent the influence of the ith row on the ith fitted value. Large Hat values are indicators that the ith row is a point of high leverage, a potential outlier. |
|
The difference between the predicted and actual target value for a specific input row. |
|
The standard error residual, sometimes called the Studentized residual, re-scales the residual to have constant variance across all input rows in an effort to make the input row residuals comparable. The process multiplies the residual by square root of the row weight divided by the product of the model mean square error and 1 minus the Hat value. |
|
Studentized deletion residual adjusts the standard error residual for the influence of the current row. |
|
The predictive residual is the weighted square of the deletion residuals, computed as the row weight multiplied by the square of the residual divided by 1 minus the Hat value. |
|
Cook's distance is a measure of the combined impact of the ith case on all of the estimated regression coefficients. |
For Logistic Regression, the view DM$VA
model_name has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CASE_ID NUMBER/VARHCAR2
TARGET_VALUE NUMBER/VARCHAR2
TARGET_VALUE_PROB BINARY_DOUBLE
Hat BINARY_DOUBLE
WORKING_RESIDUAL BINARY_DOUBLE
PEARSON_RESIDUAL BINARY_DOUBLE
DEVIANCE_RESIDUAL BINARY_DOUBLE
C BINARY_DOUBLE
CBAR BINARY_DOUBLE
DIFDEV BINARY_DOUBLE
DIFCHISQ BINARY_DOUBLE
Table 5-25 Row Diagnostic View for Logistic Regression
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of the case identifier |
|
The actual target value as taken from the input row |
|
Model estimate of the probability of the predicted target value. |
|
The Hat value concept from Linear Regression is extended to Logistic Regression by multiplying the Linear Regression Hat value by the variance function for Logistic Regression, the predicted probability multiplied by 1 minus the predicted probability. |
|
The working residual is the residual of the working response. The working response is the response on the linearized scale. For Logistic Regression it has the form: the ith row residual divided by the variance of the ith row prediction. The variance of the prediction is the predicted probability multiplied by 1 minus the predicted probability.
|
|
The Pearson residual is a re-scaled version of the working residual, accounting for the weight. For Logistic Regression, the Pearson residual multiplies the residual by a factor that is computed as square root of the weight divided by the variance of the predicted probability for the ith row.
|
|
The |
|
Measures the overall change in the fitted logits due to the deletion of the ith observation for all points including the one deleted (the ith point). It is computed as the square of the Pearson residual multiplied by the Hat value divided by the square of 1 minus the Hat value. Confidence interval displacement diagnostics that provides scalar measure of the influence of individual observations. |
|
C and CBAR are extensions of Cooks’ distance for Logistic Regression. CBAR measures the overall change in the fitted logits due to the deletion of the ith observation for all points excluding the one deleted (the ith point). It is computed as the square of the Pearson residual multiplied by the Hat value divided by (1 minus the Hat value)
Confidence interval displacement diagnostic which measures the influence of deleting an individual observation. |
|
A statistic that measures the change in deviance that occurs when an observation is deleted from the input. It is computed as the square of the deviance residual plus |
|
A statistic that measures the change in the Pearson chi-square statistic that occurs when an observation is deleted from the input. It is computed as |
Global Details for GLM: Linear Regression
The following table describes global details returned by a Linear Regression model.
Table 5-26 Global Details for Linear Regression
Name | Description |
---|---|
|
Adjusted R-Square |
|
Akaike's information criterion |
|
Coefficient of variation |
|
Indicates whether the model build process has converged to specified tolerance. The following are the possible values:
|
|
Corrected total degrees of freedom |
|
Corrected total sum of squares |
|
Dependent mean |
|
Error degrees of freedom |
|
Error mean square |
|
Error sum of squares |
|
Model F value statistic |
|
Estimated mean square error of the prediction, assuming multivariate normality |
|
Hocking Sp statistic |
|
Tracks the number of SGD iterations. Applicable only when the solver is SGD. |
|
JP statistic (the final prediction error) |
|
Model degrees of freedom |
|
Model F value probability |
|
Model mean square error |
|
Model sum of square errors |
|
Number of parameters (the number of coefficients, including the intercept) |
|
Number of rows |
|
R-Square |
|
The number of predictors excluded from the model due to multi-collinearity |
|
Root mean square error |
|
Schwarz's Bayesian information criterion |
Global Details for GLM: Logistic Regression
The following table returns global details returned by a Logistic Regression model.
Table 5-27 Global Details for Logistic Regression
Name | Description |
---|---|
|
Akaike's criterion for the fit of the baseline, intercept-only, model |
|
Akaike's criterion for the fit of the intercept and the covariates (predictors) mode |
|
Indicates whether the model build process has converged to specified tolerance. The following are the possible values:
|
|
Dependent mean |
|
Tracks the number of SGD iterations (number of IRLS iterations). Applicable only when the solver is SGD. |
|
Likelihood ratio degrees of freedom |
|
Likelihood ratio chi-square value |
|
Likelihood ratio chi-square probability value |
|
-2 log likelihood of the baseline, intercept-only, model |
|
-2 log likelihood of the model |
|
Number of parameters (the number of coefficients, including the intercept) |
|
Number of rows |
|
Percent of correct predictions |
|
Percent of incorrectly predicted rows |
|
Percent of cases where the estimated probabilities are equal for both target classes |
|
Pseudo R-square Cox and Snell |
|
Pseudo R-square Nagelkerke |
|
The number of predictors excluded from the model due to multi-collinearity |
|
Schwarz's Criterion for the fit of the baseline, intercept-only, model |
|
Schwarz's Criterion for the fit of the intercept and the covariates (predictors) model |
Note:
-
When Ridge Regression is enabled, fewer global details are returned. For information about ridge, see Oracle Data Mining Concepts.
-
When the value is
NULL
for a partitioned model, an exception is thrown. When the value is not null, it must contain the desired partition name.
Related Topics
5.4.8 Model Detail Views for Naive Bayes
Model Detail Views for Naive Bayes describes prior view and result view. Oracle recommends that users leverage the model details views instead of the GET_MODEL_DETAILS_NB
function.
The prior view DM$VP
model_name describes the priors of the targets for Naïve Bayes. The view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
TARGET_NAME VARCHAR2(128)
TARGET_VALUE NUMBER/VARCHAR2
PRIOR_PROBABILITY BINARY_DOUBLE
COUNT NUMBER
Table 5-28 Prior View for Naive Bayes
Column Name | Description |
---|---|
|
The name of a feature in the model |
|
Name of the target column |
|
Target value, numerical or categorical |
|
Prior probability for a given |
|
Number of rows for a given |
The Naïve Bayes result view DM$VV
model_view describes the conditional probabilities of the Naïve Bayes model. The view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
TARGET_NAME VARCHAR2(128)
TARGET_VALUE NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
CONDITIONAL_PROBABILITY BINARY_DOUBLE
COUNT NUMBER
Table 5-29 Result View for Naive Bayes
Column Name | Description |
---|---|
|
The name of a feature in the model |
|
Name of the target column |
|
Target value, numerical or categorical |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Mining attribute value for the column |
|
Conditional probability of a mining attribute for a given target |
|
Number of rows for a given mining attribute and a given target |
Table 5-30 Naive Bayes Statistics Information In Model Global View
Name | Description |
---|---|
|
The total number of rows used in the build |
5.4.9 Model Detail View for Support Vector Machine
Model Detail View for Support Vector Machine describes linear coefficient view. Oracle recommends that users leverage the model details views instead of the GET_MODEL_DETAILS_SVM
function.
The linear coefficient view DM$VL
model_name describes the coefficients of a linear SVM algorithm. The target_value field in the view is present only for Classification and has the type of the target. Regression models do not have a target_value field.
The reversed_coefficient field shows the value of the coefficient after reversing the automatic data preparation transformations. If data preparation is disabled, then coefficient and reversed_coefficient have the same value. The view has the following schema:
Name Type
----------------------------------------- --------------------------------
PARTITION_NAME VARCHAR2(128)
TARGET_VALUE NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
REVERSED_COEFFICIENT BINARY_DOUBLE
Table 5-31 Linear Coefficient View for Support Vector Machine
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Target value, numerical or categorical |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Value of a categorical attribute |
|
Projection coefficient value |
|
Coefficient transformed on the original scale |
Table 5-32 Support Vector Statistics Information In Model Global View
Name | Description |
---|---|
|
Indicates whether the model build process has converged to specified tolerance:
|
|
Number of iterations performed during build |
|
Number of rows used for the build |
|
Number of rows removed due to 0 norm. This applies to one-class linear models only. |
5.4.10 Model Detail Views for Clustering Algorithms
Oracle Data Mining supports these clustering algorithms: Expectation Maximization, k-Means, and Orthogonal Partitioning Clustering (O-Cluster).
All clustering algorithms share the following views:
-
Cluster description
DM$VD
model_name -
Attribute statistics
DM$VA
model_name -
Histogram statistics
DM$VH
model_name -
Rule statistics
DM$VR
model_name
The cluster description view DM$VD
model_name describes cluster level information about a clustering model. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
CLUSTER_NAME NUMBER/VARCHAR2
RECORD_COUNT NUMBER
PARENT NUMBER
TREE_LEVEL NUMBER
LEFT_CHILD_ID NUMBER
RIGHT_CHILD_ID NUMBER
Table 5-33 Cluster Description View for Clustering Algorithm
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a cluster in the model |
|
Specifies the label of the cluster |
|
Specifies the number of records |
|
The ID of the parent |
|
Specifies the number of splits from the root |
|
The ID of the child cluster on the left side of the split |
|
The ID of the child cluster on the right side of the split |
The attribute view DM$VA
model_name describes attribute level information about a Clustering model. The values of the mean, variance, and mode for a particular cluster can be obtained from this view. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
CLUSTER_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
MEAN BINARY_DOUBLE
VARIANCE BINARY_DOUBLE
MODE_VALUE VARCHAR2(4000)
Table 5-34 Attribute View for Clustering Algorithm
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
The ID of a cluster in the model |
|
Specifies the label of the cluster |
|
Specifies the attribute name |
|
Specifies the attribute subname |
|
The field returns the average value of a numeric attribute |
|
The variance of a numeric attribute |
|
The mode is the most frequent value of a categorical attribute |
The histogram view DM$VH
model_name describes histogram level information about a Clustering model. The bin information as well as bin counts can be obtained from this view. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
CLUSTER_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
BIN_ID NUMBER
LOWER_BIN_BOUNDARY BINARY_DOUBLE
UPPER_BIN_BOUNDARY BINARY_DOUBLE
ATTRIBUTE_VALUE VARCHAR2(4000)
COUNT NUMBER
Table 5-35 Histogram View for Clustering Algorithm
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
The ID of a cluster in the model |
|
Specifies the label of the cluster |
|
Specifies the attribute name |
|
Specifies the attribute subname |
|
Bin ID |
|
Numeric lower bin boundary |
|
Numeric upper bin boundary |
|
Categorical attribute value |
|
Histogram count |
The rule view DM$VR
model_name describes the rule level information about a Clustering model. The information is provided at attribute predicate level. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
CLUSTER_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
OPERATOR VARCHAR2(2)
NUMERIC_VALUE NUMBER
ATTRIBUTE_VALUE VARCHAR2(4000)
SUPPORT NUMBER
CONFIDENCE BINARY_DOUBLE
RULE_SUPPORT NUMBER
RULE_CONFIDENCE BINARY_DOUBLE
Table 5-36 Rule View for Clustering Algorithm
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
The ID of a cluster in the model |
|
Specifies the label of the cluster |
|
Specifies the attribute name |
|
Specifies the attribute subname |
|
Attribute predicate operator - a conditional operator taking the following values: IN, = , <>, < , >, <=, and >= |
|
Numeric lower bin boundary |
|
Categorical attribute value |
|
Attribute predicate support |
|
Attribute predicate confidence |
|
Rule level support |
|
Rule level confidence |
5.4.11 Model Detail Views for Expectation Maximization
Model detail views for Expectation Maximization (EM) describes the differences in the views for EM against those of Clustering views. Oracle recommends that user leverage the model details views instead of the GET_MODEL_DETAILS_EM
function.
The following views are the differences in the views for Expectation Maximization against Clustering views. For an overview of the different Clustering views, refer to "Model Detail Views for Clustering Algorithms".
The component view DM$VO
model_name describes the EM components. The component view contains information about their prior probabilities and what cluster they map to. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
COMPONENT_ID NUMBER
CLUSTER_ID NUMBER
PRIOR_PROBABILITY BINARY_DOUBLE
Table 5-37 Component View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of a component |
|
The ID of a cluster in the model |
|
Component prior probability |
The mean and variance component view DM$VM
model_name provides information about the mean and variance parameters for the attributes by Gaussian distribution models. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
COMPONENT_ID NUMBER
ATTRIBUTE_NAME VARCHAR2(4000)
MEAN BINARY_DOUBLE
VARIANCE BINARY_DOUBLE
The frequency component view DM$VF
model_name provides information about the parameters of the multi-valued Bernoulli distributions used by the EM model. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
COMPONENT_ID NUMBER
ATTRIBUTE_NAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
FREQUENCY BINARY_DOUBLE
Table 5-38 Frequency Component View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of a component |
|
Column name |
|
Categorical attribute value |
|
The frequency of the multivalued Bernoulli distribution for the attribute/value combination specified by |
For 2-Dimensional columns, EM provides an attribute ranking similar to that of Attribute Importance. This ranking is based on a rank-weighted average over Kullback–Leibler divergence computed for pairs of columns. This unsupervised Attribute Importance is shown in the DM$VI
model_name view and has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_IMPORTANCE_VALUE BINARY_DOUBLE
ATTRIBUTE_RANK NUMBER
Table 5-39 2–Dimensional Attribute Ranking for Expectation Maximization
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Column name |
|
Importance value |
|
An attribute rank based on the importance value |
The pairwise
Kullback–Leibler divergence is reported in the DM$VB
model_name view. This metric evaluates how much the observed joint distribution of two attributes diverges from the expected distribution under the assumption of independence. That is, the higher the value, the more dependent the two attributes are. The dependency value is scaled based on the size of the grid used for each pairwise computation. That ensures that all values fall within the [0; 1] range and are comparable. The view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME_1 VARCHAR2(128)
ATTRIBUTE_NAME_2 VARCHAR2(128)
DEPENDENCY BINARY_DOUBLE
Table 5-40 Kullback-Leibler Divergence for Expectation Maximization
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of an attribute 1 |
|
Name of an attribute 2 |
|
Scaled pairwise Kullback-Leibler divergence |
The projection table DM$VP
model_name shows the coefficients used by random projections to map nested columns to a lower dimensional space. The view has rows only when nested or text data is present in the build data. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_NAME VARCHAR2(4000)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
COEFFICIENT NUMBER
Table 5-41 Projection table for Expectation Maximization
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of feature |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Categorical attribute value |
|
Projection coefficient. The representation is sparse; only the non-zero coefficients are returned. |
Global Details for Expectation Maximization
The following table describes global details for Expectation Maximization.
Table 5-42 Global Details for Expectation Maximization
Name | Description |
---|---|
|
Indicates whether the model build process has converged to specified tolerance. The possible values are:
|
|
Loglikelihood on the build data |
|
Number of components produced by the model |
|
Number of clusters produced by the model |
|
Number of rows used in the build |
|
The random seed value used for the model build |
|
The number of empty components excluded from the model |
Related Topics
5.4.12 Model Detail Views for k-Means
Model detail views for k-Means (KM) describes cluster description view and scoring view. Oracle recommends that you leverage model details view instead of GET_MODEL_DETAILS_KM
function.
This section describes the differences in the views for k-Means against the Clustering views. For an overview of the different views, refer to "Model Detail Views for Clustering Algorithms". For k-Means, the cluster description view DM$VD
model_name has an additional column:
Name Type
---------------------------------- ----------------------------
DISPERSION BINARY_DOUBLE
Table 5-43 Cluster Description for k-Means
Column Name | Description |
---|---|
|
A measure used to quantify whether a set of observed occurrences are dispersed compared to a standard statistical model. |
The scoring view DM$VC
model_name describes the centroid of each leaf clusters:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
CLUSTER_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
VALUE BINARY_DOUBLE
Table 5-44 Scoring View for k-Means
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a cluster in the model |
|
Specifies the label of the cluster |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Categorical attribute value |
|
Specifies the centroid value |
Table 5-45 k–Means Statistics Information In Model Global View
Name | Description |
---|---|
|
Indicates whether the model build process has converged to specified tolerance. The following are the possible values:
|
|
Number of rows used in the build |
|
Number of rows removed due to 0 norm. This applies only to models using cosine distance. |
Related Topics
5.4.13 Model Detail Views for O-Cluster
Model Detail Views for O-Cluster describes the statistics views. Oracle recommends that user leverage the model details views instead of the GET_MODEL_DETAILS_OC
function.
The following are the differences in the views for O-Cluster against Clustering views. For an overview of the different clustering views, refer to "Model Detail Views for Clustering Algorithms". The OC algorithm uses the same descriptive statistics views as Expectation Maximization (EM) and k-Means (KM). The following are the statistics views:
-
Cluster description
DM$VD
model_name -
Attribute statistics
DM$VA
model_name -
Rule statistics
DM$VR
model_name -
Histogram statistics
DM$VH
model_name
The Cluster description view DM$VD
model_name describes the O-Cluster components. The cluster description view has additional fields that specify the split predicate. The view has the following schema:
Name Type
---------------------------------- ----------------------------
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
OPERATOR VARCHAR2(2)
VALUE SYS.XMLTYPE
Table 5-46 Description View
Column Name | Description |
---|---|
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Split operator |
|
List of split values |
SYS.XMLTYPE
is as follows:<Element>splitval1</Element>
The OC algorithm uses a histogram view DM$VH
model_name with a different schema than EM and k-Means (KM). The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITON_NAME VARCHAR2(128)
CLUSTER_ID NUMBER
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
BIN_ID NUMBER
LABEL VARCHAR2(4000)
COUNT NUMBER
Table 5-47 Histogram Component View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of a component |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Unique identifier |
|
Bin label |
|
Bin histogram count |
Table 5-48 O-Cluster Statistics Information In Model Global View
Name | Description |
---|---|
|
The total number of rows used in the build |
Related Topics
5.4.14 Model Detail Views for Explicit Semantic Analysis
Model Detail Views for Explicit Semantic Analysis (ESA) describes attribute statistics view and feature view. Oracle recommends that users leverage the model details views.
ESA algorithm has the following descriptive statistics views:
-
Attribute statistics
DM$VA
model_name -
Features
DM$VF
model_name
The view DM$VA
model_name has the following schema:
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
Table 5-49 Attribute View for Explicit Semantic Analysis
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of a feature as it appears in the training data |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Categorical attribute value |
|
A measure of the weight of the attribute with respect to the feature |
The view DM$VF
model_name has a unique row for every feature in one view. This feature is helpful if the model was pre-built and the source training data are not available. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER/VARCHAR2
Table 5-50 Feature View for Explicit Semantic Analysis
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of a feature as it appears in the training data |
Table 5-51 Explicit Semantic Analysis Statistics Information In Model Global View
Name | Description |
---|---|
|
The total number of input rows |
|
Number of rows removed by filters |
5.4.15 Model Detail Views for Non-Negative Matrix Factorization
Model detail views for Non-Negative Matrix Factorization (NMF) describes encoding H matrix view and H inverse matrix view. Oracle recommends that users leverage the model details views instead of the GET_MODEL_DETAILS_NMF
function.
The NMF algorithm has two matrix content views:
-
Encoding (H) matrix
DM$VE
model_name -
H inverse matrix
DM$VI
model_name
The view DM$VE
model_name describes the encoding (H) matrix of an NMF model. The FEATURE_NAME
column type may be either NUMBER
or VARCHAR2
. The view has the following schema definition.
Name Type
------------------- --------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER
FEATURE_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
Table 5-52 Encoding H Matrix View for Non-Negative Matrix Factorization
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a feature in the model |
|
The name of a feature in the model |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Specifies the value of attribute |
|
The attribute encoding that represents its contribution to the feature |
The view DM$VI
model_view describes the inverse H matrix of an NMF model. The FEATURE_NAME
column type may be either NUMBER
or VARCHAR2
. The view has the following schema:
Name Type
----------------- ------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER
FEATURE_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
COEFFICIENT BINARY_DOUBLE
Table 5-53 Inverse H Matrix View for Non-Negative Matrix Factorization
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a feature in the model |
|
The name of a feature in the model |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Specifies the value of attribute |
|
The attribute encoding that represents its contribution to the feature |
Table 5-54 Non-Negative Matrix Factorization Statistics Information In Model Global View
Name | Description |
---|---|
|
Convergence error |
|
Indicates whether the model build process has converged to specified tolerance. The following are the possible values:
|
|
Number of iterations performed during build |
|
Number of rows used in the build input dataset |
|
Number of rows used by the build |
5.4.16 Model Detail Views for Singular Value Decomposition
Model detail views for Singular Value Decomposition (SVD) describes S Matrix view, right-singular vectors view, and left-singular vector view. Oracle recommends that users leverage the model details views instead of the GET_MODEL_DETAILS_SVD
function.
The DM$VE
model_name view leverages the fact that each singular value in the SVD model has a corresponding principal component in the associated Principal Components Analysis (PCA) model to relate a common set of information for both classes of models. For a SVD model, it describes the content of the S matrix. When PCA scoring is selected as a build setting, the variance and percentage cumulative variance for the corresponding principal components are shown as well. The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER
FEATURE_NAME NUMBER/VARCHAR2
VALUE BINARY_DOUBLE
VARIANCE BINARY_DOUBLE
PCT_CUM_VARIANCE BINARY_DOUBLE
Table 5-55 S Matrix View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a feature in the model |
|
The name of a feature in the model |
|
The matrix entry value |
|
The variance explained by a component. This column is only present for SVD models with setting This column is non-null only if the build data is centered, either manually or because of the following setting: |
|
The percent cumulative variance explained by the components thus far. The components are ranked by the explained variance in descending order. This column is only present for SVD models with setting This column is non-null only if the build data is centered, either manually or because of the following setting: |
The SVD DM$VV
model_view describes the right-singular vectors of SVD model. For a PCA model it describes the principal components (eigenvectors). The view has the following schema:
Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
FEATURE_ID NUMBER
FEATURE_NAME NUMBER/VARCHAR2
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_VALUE VARCHAR2(4000)
VALUE BINARY_DOUBLE
Table 5-56 Right-singular Vectors of Singular Value Decomposition
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
The ID of a feature in the model |
|
The name of a feature in the model |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Categorical attribute value. For numerical attributes, |
|
The matrix entry value |
DM$VU
model_name describes the left-singular vectors of a SVD model. For a PCA model, it describes the projection of the data in the principal components. This view does not exist unless the settings dbms_data_mining.svds_u_matrix_output
is set to dbms_data_mining.svds_u_matrix_enable
. The view has the following schema:Name Type
---------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
CASE_ID NUMBER/VARHCAR2
FEATURE_ID NUMBER
FEATURE_NAME NUMBER/VARCHAR2
VALUE BINARY_DOUBLE
Table 5-57 Left-singular Vectors of Singular Value Decomposition or Projection Data in Principal Components
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Unique identifier of the row in the build data described by the U matrix projection. |
|
The ID of a feature in the model |
|
The name of a feature in the model |
|
The matrix entry value |
Global Details for Singular Value Decomposition
The following table describes a global detail for Singular Value Decomposition.
Table 5-58 Global Details for Singular Value Decomposition
Name | Description |
---|---|
|
Number of features (components) produced by the model |
|
The total number of rows used in the build |
|
Suggested cutoff that indicates how many of the top computed features capture most of the variance in the model. Using only the features below this cutoff would be a reasonable strategy for dimensionality reduction. |
Related Topics
5.4.17 Model Detail View for Minimum Description Length
Model detail view for Minimum Description Length (for calculating Attribute Importance) describes Attribute Importance view. Oracle recommends that users leverage the model details views instead of the GET_MODEL_DETAILS_AI
function.
The Attribute Importance view DM$VA
model_name describes the Attribute Importance as well as the Attribute Importance rank. The view has the following schema:
Name Type
----------------------------------------- ----------------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
ATTRIBUTE_IMPORTANCE_VALUE BINARY_DOUBLE
ATTRIBUTE_RANK NUMBER
Table 5-59 Attribute Importance View for Minimum Description Length
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Importance value |
|
Rank based on importance |
Table 5-60 Minimum Description Length Statistics Information In Model Global View
Name | Description |
---|---|
|
The total number of rows used in the build |
5.4.18 Model Detail View for Binning
The binning view DM$VB
describes the bin boundaries used in the automatic data preparation.
The view has the following schema:
Name Type
-------------------- --------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
BIN_ID NUMBER
LOWER_BIN_BOUNDARY BINARY_DOUBLE
UPPER_BIN_BOUNDARY BINARY_DOUBLE
ATTRIBUTE_VALUE VARCHAR2(4000)
Table 5-61 Model Details View for Binning
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Specifies the attribute name |
|
Specifies the attribute subname |
|
Bin ID (or bin identifier) |
|
Numeric lower bin boundary |
|
Numeric upper bin boundary |
|
Categorical value |
5.4.19 Model Detail Views for Global Information
Model detail views for Global Information describes global statistics view, alert view, and computed settings view. Oracle recommends that users leverage the model details views instead of GET_MODEL_DETAILS_GLOBAL
function.
The global statistics view DM$VG
model_name describes global statistics related to the model build. Examples include the number of rows used in the build, the convergence status, and the model quality metrics. The view has the following schema:
Name Type
------------------- --------------------
PARTITION_NAME VARCHAR2(128)
NAME VARCHAR2(30)
NUMERIC_VALUE NUMBER
STRING_VALUE VARCHAR2(4000)
Table 5-62 Global Statistics View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of the statistic |
|
Numeric value of the statistic |
|
Categorical value of the statistic |
The alert view DM$VW
model_name lists alerts issued during the model build. The view has the following schema:
Name Type
------------------- ----------------------
PARTITION_NAME VARCHAR2(128)
ERROR_NUMBER BINARY_DOUBLE
ERROR_TEXT VARCHAR2(4000)
Table 5-63 Alert View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Error number (valid when event is Error) |
|
Error message |
The computed settings view DM$VS
model_name lists the algorithm computed settings. The view has the following schema:
Name Type
----------------- --------------------
PARTITION_NAME VARCHAR2(128)
SETTING_NAME VARCHAR2(30)
SETTING_VALUE VARCHAR2(4000)
Table 5-64 Computed Settings View
Column Name | Description |
---|---|
|
Partition name in a partitioned model |
|
Name of the setting |
|
Value of the setting |
5.4.20 Model Detail View for Normalization and Missing Value Handling
The Normalization and Missing Value Handling View DM$VN
describes the normalization parameters used in Automatic Data Preparation (ADP) and the missing value replacement when a NULL
value is encountered. Missing value replacement applies only to the twodimensional columns and does not apply to the nested columns.
The view has the following schema:
Name Type
---------------------- -----------------------
PARTITION_NAME VARCHAR2(128)
ATTRIBUTE_NAME VARCHAR2(128)
ATTRIBUTE_SUBNAME VARCHAR2(4000)
NUMERIC_MISSING_VALUE BINARY_DOUBLE
CATEGORICAL_MISSING_VALUE VARCHAR2(4000)
NORMALIZATION_SHIFT BINARY_DOUBLE
NORMALIZATION_SCALE BINARY_DOUBLE
Table 5-65 Normalization and Missing Value Handling View
Column Name | Description |
---|---|
|
A partition in a partitioned model |
|
Column name |
|
Nested column subname. The value is null for non-nested columns. |
|
Numeric missing value replacement |
|
Categorical missing value replacement |
|
Normalization shift value |
|
Normalization scale value |