Changes in This Release for Oracle Data Mining User's Guide
Changes in this release for Oracle Data Mining User’s Guide.
Oracle Data Mining User's Guide is New in This Release
-
This guide is new in release 12c. Oracle Data Mining User's Guide replaces two manuals that were provided in previous releases: Oracle Data Mining Administrator's Guide and Oracle Data Mining Application Developer's Guide.
-
Information about database administration for Oracle Data Mining is now consolidated in Administrative Tasks for Oracle Data Mining . The remaining chapters of this guide are devoted to application development.
-
Information about the Data Mining sample programs is now in The Data Mining Sample Programs.
Changes in Oracle Data Mining 12c Release 2 (12.2)
The following changes are documented in Oracle Data Mining User’s Guide for 12c Release 2 (12.2).
New Features in 12c Release 2
The following features are new in this release:
Partitioned Models
-
Data Mining SQL function
A new Data Mining SQL function
ORA_DM_PARTITION_NAME
is included for partitioned models. The function returns the partition names for a partitioned model.See Data Mining SQL Scoring Functions.
Provided new scoring functions
See Partitioned Model scoring.
See GROUPING Hint.
-
About partitioned model
Description of Partitioned model is added
-
DDL in partitioned model
Explained the newly added Add and Drop partition for maintenance operations.
Model Views
-
Added new Model Detail Views. Model Detail Views are preferred over
GET*
functions.See Model Detail Views.
New Data Dictionary Views. See Data Mining Data Dictionary Views.
Explicit Semantic Analysis
-
Newly added
FEATURE_COMPARE
SQL function -
FEATURE_COMPARE
SQL functionProvides an example of the new SQL function
FEATURE_COMPARE
using ESA algorithm.
Association Rules Aggregates
-
Using retail analysis data
Added enhancements to Association Rules and an example to show the concept of aggregates.
See Using Retail Analysis Data.
See Model Detail Views.
R Extensibility
-
Mining model settings for R
New mining model settings are included for R, to define the characteristics of R models. The mining model settings can be used with generic settings that are independent of algorithms, to specify R model build, score and view.
-
DBMS_DATA_MINING for R
The DBMS_DATA_MINING subprograms that are independent of algorithms, can operate on R model for mining functions such as Classification, Clustering, Feature Extraction, and Regression.
See DBMS_DATA_MINING.
Changes in Oracle Data Mining 12c Release 1 (12.1)
The following changes are documented in Oracle Data Mining User's Guide for 12c Release 1 (12.1).
New Features
The following features are new in this release:
-
Expanded prediction details
The
PREDICTION_DETAILS
function now supports all predictive algorithms and returns more details about the predictors. New functions,CLUSTER_DETAILS
andFEATURE_DETAILS
, are introduced.See Prediction Details.
-
Dynamic scoring
The Data Mining SQL functions now support an analytic clause for scoring data dynamically without a pre-defined model.
See Dynamic Scoring.
-
Significant enhancements in text mining
This enhancement greatly simplifies the data mining process (model build, deployment and scoring) when unstructured text data is present in the input.
-
Manual pre-processing of text data is no longer needed.
-
No text index must be created.
-
Additional data types are supported:
CLOB
,BLOB
,BFILE
. -
Character data can be specified as either categorical values or text.
-
-
New clustering algorithm: Expectation Maximization
See the following:
-
New feature extraction algorithm: Singular Value Decomposition with Principal Component Analysis
See the following:
-
Generalized Linear Models are enhanced to support feature selection and creation.
Desupported Features
The following features are no longer supported by Oracle. See Oracle Database Upgrade Guide for a complete list of desupported features in this release.
-
Oracle Data Mining Java API
-
Adaptive Bayes Network (ABN) algorithm
Other Changes
The following are additional new features in this release:
-
A new SQL function,
CLUSTER_DISTANCE
, is introduced.CLUSTER_DISTANCE
returns the raw distance between each row and the cluster centroid.See Scoring and Deployment .
-
New support for native double data types,
BINARY_DOUBLE
andBINARY_FLOAT
, improves the performance of the SQL scoring functions.See Preparing the Data.
-
Decision Tree algorithm now supports nested data.
See Preparing the Data.