Académique Documents
Professionnel Documents
Culture Documents
1) Partie 1/6
Philippe Bajoit
Le sujet
Bases de donnes documentaires
17 November 2011
Page 2
Un exemple
Herv Jamar (MR), secrtaire d'Etat la modernisation des Finances et la lutte contre la fraude fiscale
17 November 2011
Page 3
Philippe Bajoit
Licence en Informatique ULg 1991 1993-1999
Cognitive Systems Europe, BIM: NLP, PDF CapGemini: gnraliste
2000-2009
Wang, Getronics, Fujitsu Services, KPN: gestion documentaire
2009-maintenant
ULg: architecture des logiciels HR
Rseaux
facebook LinkedIn http://be.linkedin.com/in/philippebajoit
Agenda
6x le jeudi AM, 2 heures
Evaluation
Livre ouvert Exercices pratiques Questions de rflexion
17 November 2011
Page 5
17 November 2011
Page 6
17 November 2011
Page 7
IT incontournable
Exploitation Consultance
17 November 2011
Page 8
17 November 2011
Page 9
Le concept de document
17 November 2011
Page 10
Le concept de document
Dfinition Document physique Document numrique, mta-donnes Caractristiques du document numrique :
htrogne, distribu, volutif, droits daccs, cycles de vie, prrenit, archives
17 November 2011
Page 11
Le concept de document Dfinition Ensemble form par un support et une information, gnralement enregistr de faon permanente, et tel qu'il puisse tre lu par l'homme et la machine. Fonctions:
Conservation de linformation Communication de linformation
Nombreux supports:
Papier Microfiche, Audio, Numrique
17 November 2011
Page 12
Types:
Livre (gnral) Monographie Intgrale de .. Ouvrage de rfrence Littrature grise
17 November 2011
Page 13
Les mtadonnes
donne qui dcrit une donne Sapplique aux documents (vrifier) Essentiel pour les documents numriques
17 November 2011
Page 14
Le concept de document Le document numrique Dune manire simple, un document se dfinit par
un contenu des donnes pour le dcrire
Document
Contenu (texte, images, autre contenu)
17 November 2011
Page 15
17 November 2011
Page 16
17 November 2011
Page 17
17 November 2011
Page 18
17 November 2011
Page 19
17 November 2011
Page 20
17 November 2011
Page 21
17 November 2011
Page 22
17 November 2011
Page 23
17 November 2011
Page 24
17 November 2011
Page 25
17 November 2011
Page 26
17 November 2011
Page 27
Versions
17 November 2011
Page 28
17 November 2011
Getronics Confidential
Page 29
17 November 2011
Getronics Confidential
Page 30
17 November 2011
Getronics Confidential
Page 31
Exemples:
JSR170: version (unique) Nuxeo: version cre au choix (optionnelle, mineure ou majeure) lors du check-in FileNet P8: version mineure cre lors du check-out, version majeure au choix
17 November 2011
Getronics Confidential
Page 32
17 November 2011
Getronics Confidential
Page 33
17 November 2011
Getronics Confidential
Page 34
BREAK
Questions ?
17 November 2011
Page 35
Indexation et mtadonnes
17 November 2011
Page 36
Mtadonnes et indexation
donne qui dcrit une donne (le document) Formalisme OO Utilit des mtadonnes Fonctions Types de valeurs (eg Single value, multiple value) Liens entre documents
17 November 2011
Page 37
17 November 2011
Page 38
Formalisme orient-objet (OO) Encapsulation means that an object will contain both data and the methods needed to manipulate the data.
Exemple: Assume we have a "title-object". It would contain the actual words of the title, information about the fonts used and the methods needed to create, delete, display, print and edit the title. Wherever you chose to plug-in this object, you would be able to use the built-in methods that came along with the data.
17 November 2011
Page 39
17 November 2011
Page 40
Formalisme orient-objet (OO) Hritage means that once you have defined one type of object, you can define an unlimited number of derived objects (sons or daughters), and they will as a default have inherited all the characteristics of the parent object type.
Exemple: Starting with the "title" object type, you could easily derive a "subtitle" object type that inherited all the methods of its parent. You could then introduce a small change in the display and the print methods so that a smaller point size was used.
17 November 2011
Page 41
17 November 2011
Page 42
Formalisme orient-objet (OO) Hirarchie de types dobjets ou de classes: Perhaps starting with "the Mother of all text-objects" an object type called "word", you could define an entire hierarchy of text objects, all sharing a number of characteristics and being separated by specific, limited differences.
Exemple: Users of Word for Windows will probably recognize this way of working from the way "styles" are defined in this product.
17 November 2011
Page 43
17 November 2011
Page 44
Formalisme orient-objet (OO) Polymorphisme means that a particular method could have the same name for a lot of different object types. But it would work differently according to the current object in question. As a user, you would not be required to know about these differences.
Exemple: You would be able to call the print-method for the "title" and "subtitle" objects without worrying about any differences in their characteristics.
17 November 2011
Page 45
17 November 2011
Page 46
17 November 2011
Page 47
17 November 2011
Page 48
17 November 2011
Page 49
17 November 2011
Page 50
17 November 2011
Page 51
17 November 2011
Page 52
Dublin Core
Dublin Core Metadata Initiative (1995) http://dublincore.org/ Ambition de doter les documents web de mtadonnes universelles Peu de contraintes (eg aucun lment obligatoire, tous rptables) Adoption lente Extensible
17 November 2011
Page 53
Elements DC
Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights
17 November 2011
Page 54
HTML
<meta name="DC.subject" content="fruits de mer" /> <link rel="DC.relation" hreflang="en" href="http://www.example.org/en/" /> <link rel="DC.relation" hreflang="de" href="http://www.example.org/de/" />
17 November 2011
Page 55
Extensions DC
Qualificateur (Qualifier) pour apporter des prcisions
<meta name="DC.description" content="description" /> <meta name="DC.description.abstract" content="This article describes the work of the IFB Chaos Committee, including a summary of its major findings." /> <meta name="DC.description.tableOfContents" content="Introduction; Vertebrates; Invertebrates; Molluscs " />
17 November 2011
Page 56
Taxonomie
17 November 2011
Page 57
Taxonomie
A lorigine, classification des espces vivantes Actions:
Nommer, classer
Rsultats:
classification, hirarchie
17 November 2011
Page 58
Taxonomie
Processus de rcolte de linformation
17 November 2011
Page 59
Taxonomie Exemple Exemple: OHIM Office of Harmonization for the Internal Market Domaine de la proprit intellectuelle (marques et modles, ie except les brevets) Support au classement des documents
17 November 2011
Page 60
Thesaurus
17 November 2011
Page 61
Thesaurus
Fournit le sens un vocabulaire contrl Gnralement spcifique un domaine Lien avec les mta-donnes : vocabulaire Lien avec la taxonomie: sens + relations Utilit pour les recherches Diffrent des corrections orthographiques (ie Google) Synonymes
Corporate memory vs Organization memory
17 November 2011
Page 62
17 November 2011
Page 63
17 November 2011
Page 64
17 November 2011
Page 65
Thesaurus
Termes Relations entre termes:
Terme Terme Terme Terme spcifique (TS) gnrique (TG) associ (TA) rejet (TR)
17 November 2011
Page 66
Thesaurus
Termes racine (top-terms) Accs hirarchique
17 November 2011
Page 67
Thesaurus
TG + TS
17 November 2011
Page 68
Thesaurus
Termes rejets
17 November 2011
Page 69
Thesaurus
Annotations (notes)
17 November 2011
Page 70
Thesaurus
Accs alphabtique
17 November 2011
Page 71
Thesaurus
Export selon des spcifications
17 November 2011
Page 72
Thesaurus
Export selon des spcifications
17 November 2011
Page 73
Exercices
17 November 2011
Page 74
Exercice 1
Indexation dun document
Facture commerciale Dcouverte des mtadonnes Taxonomie (classement) ? Reprsentation du traitement (ie qui doit traiter ce document dans lentreprise) : via une mtadonne, ou un autre systme?
17 November 2011
Page 75
Exercice 2
Modle documentaire
Les traductions Modle multilingue (nombre indtermin) Le document traduire possde ses propres mtadonnes Notion de document original (ie crit par lauteur, et non par le traducteur) Notions de versions
17 November 2011
Page 76
Thank you
17 November 2011
Page 77
Licence
http://creativecommons.org/licenses/by-nc-sa/2.0/be/
17 November 2011
Page 78
17 November 2011
Page 79