Text Application Developer's Guide
ContentsOpens a new window
Opens a new window
Page 190 of 321

Supervised Classification

With supervised classification, you employ the CTX_CLS.TRAIN procedure to automate the rule writing step. CTX_CLS.TRAIN uses a training set of sample documents to deduce classification rules. This is the major advantage over rule-based classification, in which you must write the classification rules.

However, before you can run the CTX_CLS.TRAIN procedure, you must manually create categories and assign each document in the sample training set to a category.

See Also:

Oracle Text Reference for more information on CTX_CLS.TRAIN

When the rules are generated, you index them to create a CTXRULE index. You can then use the MATCHES operator to classify an incoming stream of new documents.

You may choose between two different classification algorithms for supervised classification:

Previous PageOpens a new window
Page 190 of 321
Next PageOpens a new window