Académique Documents
Professionnel Documents
Culture Documents
INDICE ANALTICO
modelo multidimensional, 49-50, 52 117, 123, 125, 135, 139, 146, 148,
operadores de. Vase operadores 162,258,260,281-285,287-301,
# OLAP 303,316,322,373,389,398,406,
alta dimensionalidad, 5, 30, 79, 116, 118, 445,474,479,486,488,491,495-
8-subsuncin, 308-309, 314, 326 153,163,227,236,354,364,380, 497,499,502,505-507,512-514,
399,445,554,568,605 521,529,554,602,604,609-614,
anlisis correlacional, 12, 120 616-617,619-621,623
Jl anlisis de componentes principales, 65- Area Under the ROC Curve, 287, 470-
67,79-83,93, 118, 122, 124, 142, 472,515,636
ACP. Vase anlisis de componentes
184,189,199,209,343-346,355, asociacin. Vase reglas de asociacin
principales
376,401,604,629 atipico. Vase valor:anmalo
AdaBoost, 378, 380, 489, 497
anlisis de correspondencias, 65-66, 90, atributos relevantes, 99, 445, 457, 587
Adaline. Vase regla Adaline
122-123 aumento de dimensionalidad. Vase
adaptacin
anlisis de residuos. Vase residuos dimensionalidad:aumento
funcin de, 35, 386-391, 393-402,
anlisis de varianza. Vase ANOV A AutoClass, 622
410-411,415,448
medida, 386, 388, 394 anlisis discriminante, 86, 148,203, auto correlacin, 533-535
208-211,213,230,233,496 autoridades, 551, 560-561
medida de, 35, 386,450
lineal, 86
agrupamiento jerrquico, 143,421,436,
439,458 anlisis exploratorio de datos. Vase
exploratory data analysis
(]3
algoritmo Apriori, 146, 148,240-241,
248,253-254,256,319,612-613 anlisis factorial, 65-66, 79, 82-83, 148
anlisis inteligente de datos, 418 bagging, 115,298,488-489,491-492,
algoritmo B, 269, 273, 490 494,497-498,505,604,613,619
anlisis multivariante, XV, 41, 65-66,
algoritmo de retropropagacin, 147, bag-of-words. Vase vector de palabras
332-333, 335-336, 352 79,82,93-94,121-122
base de datos
algoritmo de Wang y Mendel, 407-409 anlisis por modelo lineal, 121
anlisis ROC, 15,39,144,202,211, distribuida, 603
algoritmo EM (Expectation documental, 11
469-471,515,519,601
Maximization),276 espacial, 11, 525-528, 530-531
anmalos, 77,144,189,423, Vase
algoritmo K2, 269, 274 inductiva, 316, 318-319
tambin valor anmalo
algoritmo Nalve Bayes, 258, 260-262, multimedia, 11,539
ANOV A, 92, 122, 178
271,273,275,278-279 relacional, XVI, 9-11,43-44,55-56,
algoritmo PAM, 455, 458, 530 anytime, 154
aprendizaje 127,161,302-303,306,313-314,
algoritmo TAN, 272 317,407,547,557,592,603,616,
algoritmos evolutivos, 34, 135, 383-384, de Hebb, 327, 343-347, 639
631
inductivo, 38,146,148-149,161,
386-387,389-390, 399-400,409, temporal, 11
307,310,553
417-418,450,629 transaccional, XIV, 44-48, 56, 59-61,
no supervisado, 144,252,327,330,
algoritmo s genticos, 27, 35, 148,384, 63,575,590
343,347,351,377,421
387,389-391,393-396,398-402,410, Bayes. Vase teorema de Bayes, Vase
416-417,419,448-450,617,623 supervisado, 144,216,252,317-318,
327,330,332,341,343,351,367, tambin algoritmo Nalve Bayes
alisado, 116,203,216 B-course, 279
377,446-447,537-538,541-542,
almacn de datos, XIII, XIV, XVI, bias, 138,310-311,313-314,325-326,
604
XVIII, 4, 10, 14-15, 19,21-22,41, 328,354
43-52,54-63,65,67-69,97, 101, aprendizaje P AC (Probabilistic
Approximate Correct), 153-154 binarizacin, 145,312,325,376,493,
131-132,149,514,557,575,583, Vase tambin numerizacin
587-590,593-594,597,601-602,605, aprendizaje por consultas, 150
Apriori. Vase algoritmo Apriori bodega de datos. Vase almacn de
615-616,618,640,648 datos
AQ, 289, 297
arquitectura de, 49 bolsa de palabras. Vase vector de
rboles de decisin, 5, 12, 15, 19,25,27,
carga de, 59 palabras
30-32,75,78,88,90,93,99-100,
granularidad de, 49
652 Introduccin a la Minera de Datos
boosting, 298, 487, 489-492, 497-499, de elasticidad, 169 cruce, 34, 386-389, 392, 394, 396, 398,
612-613,619,623,630,634,637, VIF. Vase Variance Inflation Factor 401,414,450
641,643,646-647 (VIF)
'Iil bootstrapping, 37, 115,466,477,488 colinealidad, 181-184, 192, 194,209
1
bsqueda de proyecciones exp10ratorias, combinacin de modelos, 160,459,485- (])
I EPP (Exploratory Projection Pursuit), 486, 500, Vase tambin mtodos
1I Darwin. Vase Oracle Data Mining
82,345-346 multiclasificadores
Suite
11
tambin mtodos basados en estimador de probabilidades, 140-141, hiperplano, 158, 160, 188,354-355,
distancia 202,294,472-473,492 357-360,362,364,378,426
aproximacin relacional, 320 ETL. Vase sistema ETL hiptesis MAP, 259-260, 478
de Mahalanobis, 207, 423, 444 evaluacin, 36-37, 150,459,461-462, histograma, 15,65-67,70-72,77,103,
de Chebychev, 423 612, Vase tambin subajuste, Vase 122,187-188,193,229,612,616
de enlace, 438, 440 tambin sobreajuste HOLAP,55
de Manhattan, 423 anlisis ROe. Vase anlisis ROC hubo Vase concentradores
del coseno, 423 basada en costes, 462, 467, 472 HUGIN,279
eucldea, 34, 422, 429-430, 432-433, bootstrapping. Vase bootstrapping Hyperion Enterprise, 56
449,535 conjunto de entrenamiento. Vase
distribucin conjunto de entrenamiento
binomial, !l6, 195, 197 conjunto de prueba. Vase conjunto 1
de Poisson, 123, 195 de prueba
del error, 195 de la clasificacin, 462 IBM Intelligent Miner for Text, 568
gamma, 195 de la regresin, 476 ID3, 146, 148,262-263,275,284-285,
normal, 94, 174, 187, 194-195,205, del agrupamiento, 480 297,529,611
235,261-262,466 reglas de asociacin, 481 ILP. Vase programacin lgica
DMQ, 127 validacin cruzada. Vase validacin inductiva
Document Object Model, 559, Vase cruzada incremental, 116, 149,240,242,292,
tambin XML executive information system, 46, 62, 333,377-378,454,520-521,602
Document Type Definition, 510, 512 102,588-589,640 incrementalidad, 520, 602, Vase
dril!, 48, 52-54, 57, !lO, 619 expectation maximization. Vase tambin incremental
DSS. Vase sistemas para la toma de algoritmo EM (Expectation IND, 295, 297
decisin Maximization) induccin. Vase aprendizaje inductivo
DTD. Vase Document Type Definition explanation-based leaming, 149 Informix, 56, 616
exploratory data analysis, 102, 630, 649 inteligencia artificial, XIII, 15,257,263,
expresividad de modelos, 116, 137-138, 383,455,547
p, 153, 155-157, 159-163,258,285, Intelligent Miner. Vase DB2 lntelligent
Miner
312,315,317,320,341,403,405,
Easy NN-plus, 622 443 interfaces visuales, 103, 131-133
EDA. Vase exploratory data analysis Extensible Markup Language (XML), I
eigenvalues, 80-81, 84, 199 XV, XVII, 59,131,317,325,510, I
eigenvectors, 80, 82
512,523,525,547,550-551,559,
J
EIS. Vase Executive Information 11
590,617
Systems jBNC, 275, 279
extraccin de informacin, 540, 548
EL VIRA, 275, 278, 635 jerarqua de conceptos, 247-248
ensemble methods. Vase mtodos j-cuadrado, 123,245-246,254,293
multiclasificadores p
Enterprise Miner, XIV
entropa, 90, 287, 400, 480, 483 factorizacin, 82
1(
EPP. Vase bsqueda de proyecciones FFOIL,289,326 K medias, 147,341,421,428,432,434-
exploratorias Fisher. Vase discriminante de Fisher
436,444,448,455,458,494,509,
ERP (Enterprise Resource Planning), fitness. Vase medida de adaptacin 610,612-613,616,620
579,588-589 FOIL, 289, 310, 323, 326, 646
K vecinos. Vase vecinos ms prximos
error cuadrtico medio, 19,26, 38, 99, forest, 487
Kepler, 591, 614-615
150,159,167,179-180,183,222, funciones de activacin, 332-334 kemel. Vase ncleo
293,332,340-341,476,584 funciones de densidad, 194, 204, 206, Kohonen. Vase mapa de Kohonen
error cuadrtico relativo, 476 225,229-230,232,440-442,480
error esperado, 286, 291, 298 funciones de puntuacin, 150
escalabilidad, 15,297,326,395,399, L
419,597,601-602,605
escalado
multidimensional, 65-66, 94-95
q Laplace, correccin de, 116-117, 261-
262,294,473
softmax o sigmoidal, 77, 93-94 GainRatio, 287, 297
lenguajes de consulta, 10,41,44,97,
espacio de caractersticas, 347, 355, 357, generalizacin menos general, 310, 326
126-127,130,316-317,605
361-367,376-377 GID3,287
Igg. Vase generalizacin menos general
estadstica, XIII, XIV, XV, XVI, 3, 5, GOLEM, 310, 326
LIBSVM, 363, 378-379
15,27,86,92, 104, 135, 139, 146, grfica de dispersin, 72-73
limpieza de datos, 23, 43, 67,103,144,
162,165-166,203,212,221,236, 582,587,605
245,263,344,353,383,428,461, J{ link distance. Vase distancia de enlace
535,540, 575, 592-593, 598; 606, LINUS, 89, 304, 315-316, 642
609,621,623, Vase tambin LISP, 161
Hebb. Vase aprendizaje de Hebb
modelizacin estadstica lgica difusa, 35, 383-384, 403, 405-407,
herramientas de minera de datos. Vase
estadstico PRESS. Vase PRESS 409,417-419,623
sistemas de minera de datos
654 Introduccin a la Minera de Datos
,
ndice analtico 655
v 182,191-192
varianza, XV, 75, 80-85, 93, 165, 171-
458,497-499,591,609,613-614
Winrosa, 418
172,174-175,177,179-181,183-185,
VCI. Vase repositorio VCI
VML, 58,131
189-191,199,208,215-216,221,
224-225,293,345,465,535
x
underfitting. Vase subajuste vecinos ms prximos, 34, 77, 147-148,
undersampling. Vase submuestreo Xelopes, 458, 591, 610
155,216,224,316,322,421,425, XMI.. Vase Extensible Marlrup
427,442~,446,448,455,458, Language
o/ 481,613-614,616
vector de palabras, 372, 375, 552-554
validacin cruzada, XIV, 15,36,150, vista minable, 10,22,55,97-101,103, 'Y
180-181,210,222-224,230,368, 107-109,111-112,119,125-129,131,
380,464-466,477,496-498
138,302,304,582,602,617 Yale,417
visualizacin
validacin simple, 36
posterior, 103,504,506