Académique Documents
Professionnel Documents
Culture Documents
T. J. Taranto
C. R. Pitcher
Final Report
National Library of Australia Cataloguing-in-Publication data:
Bibliography.
Includes index.
ISBN 9781921232619 (pbk.)
ISBN 9781921232626 (pdf)
1. Marine sciences - Research - Queensland - Torres Strait Islands. 2. Torres Strait Islands (Qld.).
I. Pitcher, C. R. (Clifford Roland). II. Cooperative Research Centre for Torres Strait. III. CSIRO.
Marine and Atmospheric Research. IV. Title.
551.4609943
Citation:
Taranto, T. J. and C. R. Pitcher (2007). CRC Torres Strait Task 5.2: Data and Information
Management. Final Report for CRC Torres Strait. CSIRO Marine and Atmospheric Research,
Cleveland. pp.42.
DISCLAIMER
CSIRO has taken all reasonable steps to ensure that the information contents in this publication are
accurate at the time of publication. Readers should ensure that they make appropriate inquiries to
determine whether new information is available on the particular subject matter
CRC Torres Data and Information Repository i
June 2007
Tom Taranto
Roland Pitcher
CSIRO Marine and Atmospheric Research
233 Middle St, Cleveland, Qld.
ACKNOWLEDGEMENTS
The compilation of data and research information into the Torres Strait Marine Research repository
was achieved through the collaboration of many research agencies. The contributions by AFMA,
AIMS, JCU, QDPI and the TSRA along with the funding by the CRC Torres Strait and the CSIRO
Division of Marine and Atmospheric Research are acknowledged. The continued support by the Reef
and Rainforest Research Centre (RRRC) and the CSIRO Marine and Atmospheric Research (CMAR)
Data Centre will ensure that marine research in the Torres Strait will have an extensive internet based
library of searchable information and data to draw upon into the future.
CRC Torres Data and Information Repository iii
TABLE OF CONTENTS
ACKNOWLEDGEMENTS.....................................................................................................................................ii
TABLE OF CONTENTS....................................................................................................................................... iii
FIGURES........................................................................................................................................................... iii
TABLES ............................................................................................................................................................ iii
NON-TECHNICAL SUMMARY ....................................................................................................................... 1-1
PROJECT: Task 5.2 Data and Information Management................................................................................ 1-1
PRINCIPAL INVESTIGATOR: Tom Taranto................................................................................................ 1-1
CO-INVESTIGATOR: Roland Pitcher ........................................................................................................... 1-1
ADDRESS: ...................................................................................................................................................... 1-1
OBJECTIVES:................................................................................................................................................. 1-1
NON-TECHNICAL SUMMARY: .................................................................................................................. 1-1
Achievements and Outcomes against the objectives (2006 - 2007)............................................................. 1-1
Utilisation and Application of the Research (2006 - 2007).......................................................................... 1-2
Publications (2006 - 2007)........................................................................................................................... 1-2
Outcomes Achieved..................................................................................................................................... 1-2
1. INTRODUCTION ....................................................................................................................................... 1-3
1.1. BACKGROUND ................................................................................................................................. 1-3
1.2. NEED................................................................................................................................................... 1-3
1.3. OBJECTIVES...................................................................................................................................... 1-3
2. METHODS .................................................................................................................................................. 2-4
2.1. Facilitate the collation of CRC-TS Intellectual Property (IP).............................................................. 2-4
2.2. Develop a searchable repository website ............................................................................................. 2-4
2.3. Coordinate the moderation and listing of sensitive data and publications ........................................... 2-5
3. RESULTS .................................................................................................................................................... 3-6
3.1. Facilitate the collation of CRC-TS Intellectual Property (IP).............................................................. 3-6
3.2. Develop a searchable repository website ............................................................................................. 3-8
3.3. Coordinate the moderation and listing of sensitive data and publications ......................................... 3-12
4. DISCUSSION............................................................................................................................................ 4-13
4.1. Facilitate the collation of CRC-TS Intellectual Property (IP)............................................................ 4-13
4.2. Develop a searchable repository website ........................................................................................... 4-13
4.3. Coordinate the moderation and listing of sensitive data and publications ......................................... 4-13
5. BENEFITS................................................................................................................................................. 5-14
6. FURTHER DEVELOPMENT................................................................................................................... 6-14
7. ACHIEVEMENT OF OUTCOMES.......................................................................................................... 7-14
8. CONCLUSIONS ....................................................................................................................................... 8-15
9. RECOMMENDATIONS........................................................................................................................... 9-15
10. REFERENCES .................................................................................................................................... 10-16
11. ABBREVIATIONS & GLOSSARY ................................................................................................... 11-16
12. APPENDIX 1: INTELLECTUAL PROPERTY.................................................................................. 12-17
13. APPENDIX 2: TASK MANAGEMENT LISTING ............................................................................ 13-28
14. APPENDIX 3: STAFF......................................................................................................................... 14-34
FIGURES
Figure 3-1. Index page showing custom repository search tool and Marlin Metadata Search tool...................... 3-9
Figure 3-2. Repository custom Google search result page................................................................................... 3-9
Figure 3-3. Direct access to CMAR Data Centre............................................................................................... 3-10
Figure 3-4. Additional page to search other websites ........................................................................................ 3-10
Figure 3-5. Direct access to other Australian Data Centres .............................................................................. 3-11
Figure 3-6. Search libraries................................................................................................................................ 3-11
Figure 3-7. Custom Google search on contributing web domains .................................................................... 3-12
TABLES
Table 3-1. Status of CRC Torres IP works lodged (as at 23 Mar 2007) by project.. ........................................... 3-6
Table 3-2. Status of CRC Torres IP works lodged (as at 23 Mar 2007) by Task level....................................... 3-7
CRC Torres Data and Information Repository 1-1
NON-TECHNICAL SUMMARY
PROJECT: Task 5.2 Data and Information Management
ADDRESS:
CSIRO Marine and Atmospheric Research
233 Middle St, Cleveland, 4163
Ph: 07 3826 7259 Fax: 07 3826 7222
Email: tom.taranto@csiro.au
OBJECTIVES:
1. To facilitate the collation of CRC-TS Reports, metadata and available associated data from
Principal Investigators in a standard format where possible.
2. To develop a searchable website on a secure repository containing linked Reports, metadata
and available associated data (for limited access where appropriate).
3. To coordinate the moderation and listing of sensitive data and publications for the project.
NON-TECHNICAL SUMMARY:
The CRC Torres Strait Data and Information Management Task was commissioned in June 2006, with
the involvement of a CRC TS Steering Committee, to capture publications and data produced by the
CRC Torres Strait. Preliminary work on the project began in June 2006. As the CRC Torres Strait
wound up in July 2006, the Contract and IP collected by this project has since been transferred to the
Reef and Rainforest Research Centre (RRRC).
A product of the project was to provide a website where all available CRC Torres Strait marine related
research information can be accessed by all stakeholders. The website has been established and
populated with publications and data as received from past CRC Torres Strait Task Leaders. The
website and associated holdings are hosted by the CSIRO Marine and Atmospheric Research Data
Centre, to be maintained in perpetuity.
During the collection phase of the Task, progress reports were provided by way of fortnightly emails
(up to 18th Dec 2006) to all CRC Task Leaders detailing the status of the collection of works and
requesting that outstanding works be lodged to the repository. Due to the slow response by Task
Leaders in lodging their research works, this Project was granted an extension to provide additional
opportunity for Task Leaders to lodge their CRC Torres Strait works. At the time of drafting of this
Final Report (23 March 2007), only 96 of 187 identified works have been received.
An important objective of this Task was to identify sensitive material. Effective procedures were
developed in association with Torres Strait Regional Authority (TSRA) staff to ensure appropriate
access constraints are enforced for all CRC works submitted to the repository.
All available literature and data is now available online at http://www.cmar.csiro.au/DataCentre/torres,
accessible using customized search engines. Following final feedback, the website will be promoted
to other Torres Strait agencies and libraries for inclusion onto their websites.
Outcomes Achieved
The Torres Strait Marine Research Repository provides stakeholders of the Torres Strait both a
secure repository of past research efforts and a utility that searches both this repository and other
information repositories related to the Torres Strait marine environment.
The collation of CRC-TS reports, metadata (and available associated data) from Principal
Investigators (PIs) is the foundation of the Torres Strait Marine Research Repository. All lodged
literature is in standard pdf format with all submitted metadata adhering to the Australian ANZLIC
standard. All CRC-TS literature and data has been vetted by the TSRA for sensitivity, and
appropriate internet website security options implemented. Both metadata and associated data are
maintained by the CSIRO Marine and Atmospheric Research Data Centre.
In addition to providing a customised searchable website that links to the Repository of CRC-TS
outputs (literature and data), the website provides a searchable interface to other non-CRC research
works and data sources related to Torres Strait marine resources.
By promoting the Repository and its search capabilities this project benefits not only future research
in the region but also the communities that depend on its resources.
1.1. BACKGROUND
This CRC Torres Strait Task was initiated in response to a request from the CRC TS Board during
June 2005 regarding the issue of data archiving and management of information arising from CRC TS
research tasks. Following that request, CSIRO Marine and Atmospheric Research conducted a
preliminary project to scope the requirements to address the CRC TS Board's needs.
1.2. NEED
The scoping project identified that the CRC TS had contracted over 24 projects that were expected to
produce data and final reports/theses. There were also several AFMA contracted projects conducted
since the completion of the AFMA TS Reports and Data Archive (Taranto, 2004). A need was
identified to capture these reports, data & metadata before some CRC TS Task Leaders disperse and/or
become difficult to contact.
The CRC Torres Strait managed a set of Research Tasks under a co-ordinated Research Plan due to
complete by mid 2006. It was identified that the principle tasks of a Data Management Task would be
to facilitate the entry of metadata and collection of each of the individual CRC TS Task datasets,
facilitate the production of reports in a standard PDF format, and work with each PI to get the actual
data and reports lodged into a central system.
It was also identified that a web site would need to be developed to maximise future distribution of
collected information and data — preferably linked under the Torres Strait Regional Authority web
site (www.tsra.gov.au) — containing searchable metadata, the PDF reports as a searchable document
library and access to the actual data for direct downloading where possible and appropriate. All
content was to be moderated for privacy and cultural sensitivities and published under the appropriate
restrictions as defined by the TSRA. A companion DVD was identified as an additional or alternative
product that could also be developed, but this was not progressed.
1.3. OBJECTIVES
The original objectives were:
1. To facilitate the collation of CRC-TS Reports, metadata and available associated data from
Principle Investigators (PI’s) in a standard format where possible.
2. To develop a searchable website on a secure repository containing linked Reports, metadata and
available associated data (for limited access where appropriate).
3. To coordinate the moderation and listing of sensitive data and publications for the project.
CRC Torres Data and Information Repository 2-4
2. METHODS
2.1. Facilitate the collation of CRC-TS Intellectual Property (IP)
To address the objective of collating CRC-TS Reports, metadata and available associated data from
Principal Investigators in a standard format where possible, a number of facilitation services were
provided to CRC TS Task Leaders.
Immediately following an email from the CRC TS to all Task Leaders advising of the initiation of this
Data and Information Management Task, and requesting their response, this task initially produced
and distributed a CRC Torres Strait Final Report template document (on 13 June 2006) to facilitate a
common reporting interface. The template was designed in consultation with CMAR graphic designers
and CRC TS staff. This Final Report adheres to that design template.
At the commencement of the Task, extensive research and interviews were conducted with CRC Task
Leaders and other CRC TS project staff to establish the extent of IP works attributed to the CRC TS.
This IP inventory became the basis of coordinating the lodgment of Task outputs to the Data and
Information Repository. See APPENDIX 1: Intellectual Property.
To promote ongoing lodgment of CRC TS IP works, all Task Leaders were emailed status reports
fortnightly from mid September 2006 to end December 2006. Each status report highlighted the need
for Task Leaders to lodge IP works to the repository and included instructions on the agreed lodgment
process. In addition, all Task Leaders were advised of the preference to lodge reports in PDF format
based on the CRC report template that was specifically developed for the exercise.
Due to the lower than expected lodgment of CRC IP works all Task Leaders were personally contacted
during December 2006 and an extension of this Task's Milestone Report (from end December 2006 to
February 2007) was sought to provide further time for Task Leaders to lodge their IP works. In
addition to the above status reports, and to further facilitate IP lodgment, each Task Leader was
personally informed that they could simply lodge their works by email directly to the Repository
Administrator. Though discussions with Task Leaders were positive, at the time of drafting this report
(23 March 2007) there were still a significant number of identified works yet to be submitted to the
repository.
Another addition to the publications and metadata being collected from the CRC Torres Strait Program
is the likely inclusion of publications and metadata as published on the AFMA Torres Strait Research
DVD (Taranto, 2004). During September 2006 the then AFMA Director (Richard McLoughlin) was
approached to enquire if AFMA was agreeable to releasing copyright to selected works of the AFMA
Torres Strait Research DVD. After receiving a positive response, CSIRO Legal services were
requested to draft the appropriate copyright permissions to present to AFMA. It was envisioned that
conditional on AFMA granting permission, the addition of the 96 publications dating back to 1980 and
approximately 30 metadata statements, from this DVD, to the repository website would provide
stakeholders ready access to an extensive library of information to facilitate planning, management,
and research within the Torres Strait region.
Table 3-1. Status of CRC Torres IP works lodged (as at 23 Mar 2007) by project. Note that the number of
Reports also includes the Final (Web) Reports lodged at the CRC Torres website in July 2006.
Metadata submitted
Abstracts submitted
Reports submitted
Articles submitted
Posters submitted
Papers submitted
OUTSTANDING
Data submitted
SUBMITTED
IDENTIFIED
Presentations
Identified
Identified
Identified
Identified
Identified
Identified
Identified
Identified
TOTAL
TOTAL
Project
Project 1 22 23 1 13 7 15 8 15 0 0 1 1 4 5 2 2 45 74 29
Project 2 8 9 0 8 0 7 0 7 5 5 0 0 0 0 0 2 13 38 25
Project 3 10 15 0 1 0 8 0 8 0 0 0 0 0 0 7 7 17 39 22
Project 4 7 10 0 5 1 3 2 3 0 0 0 0 0 0 0 0 10 21 11
Project 5 4 4 0 0 2 2 0 2 0 0 0 0 0 0 0 0 6 8 2
Unknown 0 7 0 7 7
TOTAL 51 61 1 34 10 35 10 35 5 5 1 1 4 5 9 11 91 187 96
The lower than expected use of the Report template may also be associated to the late commencement
of the project — although the template was provided to all Task Leaders (on 13/06/2006) in advance
of the milestone date, this was closely coincident with the Final Reporting deadline for CRC Torres
Strait Tasks and some Task Leaders had already completed their reports or their reports were well
underway, thus there was insufficient time for many of them to adopt the template. The majority of
Reports lodged were either based on existing agency templates or the MSWord normal template.
CRC Torres Data and Information Repository 3-7
Table 3-2. Status of CRC Torres IP works lodged (as at 23 Mar 2007) by Task. Note that the number of Reports
also includes the Final (Web) Reports lodged at the CRC Torres website in July 2006.
OUTSTANDING
Presentations
SUBMITTED
IDENTIFIED
Abstracts
Identified
Identified
Identified
Identified
Identified
Identified
Identified
Identified
Metadata
Reports
Posters
Articles
Papers
TOTAL
TOTAL
Task
Data
1.1 3 3 0 4 0 2 0 2 2 2 5 13 8
1.2 2 2 0 1 0 2 3 1
1.3 2 2 0 2 0 2 0 2 2 8 6
1.4 2 2 0 1 1 1 1 1 1 1 5 6 1
1.5 2 2 0 1 1 1 0 1 3 5 2
1.6a 2 2 1 3 5 5 4 5 1 1 2 2 15 18 3
1.7 1 1 1 1 0
1.8 2 2 0 1 1 1 0 1 3 5 2
1.11 2 2 0 2 2 2 4 6 2
1.13 2 2 1 1 3 3 0
1.14 1 1 0 1 0 1 0 1 1 4 3
1.15 0 1 0 1 1
1.16 1 1 1 1 0
Prj 1 22 23 1 13 7 15 8 15 0 0 1 1 4 5 2 2 45 74 29
2.1 1 2 0 1 0 0 1 3 2
2.2 5 5 0 5 0 5 0 5 5 5 0 2 10 27 17
2.3 2 2 0 2 0 2 0 2 2 8 6
Prj 2 8 9 0 8 0 7 0 7 5 5 0 0 0 0 0 2 13 38 25
3.1 2 2 0 2 0 2 2 6 4
3.2 2 2 2 2 0
3.3 1 2 0 1 0 2 0 2 1 7 6
3.4 3 5 0 3 0 3 3 11 8
3.5 1 1 1 1 0
3.6 1 2 7 7 8 9 1
3.7 0 1 0 1 0 1 0 3 3
Prj 3 10 15 0 1 0 8 0 8 0 0 0 0 0 0 7 7 17 39 22
4.1a 1 1 0 3 1 1 1 1 3 6 3
4.2 2 2 2 2 0
4.3 2 2 0 1 1 1 3 4 1
4.4 1 2 0 1 1 3 2
4.6 0 2 0 2 2
4.7 1 1 0 1 0 1 0 1 1 4 3
Prj 4 7 10 0 5 1 3 2 3 0 0 0 0 0 0 0 0 10 21 11
5.1 2 2 2 2 0 2 4 6 2
5.2 2 2 2 2 0
Prj 5 4 4 0 0 2 2 0 2 0 0 0 0 0 0 0 0 6 8 2
Unk 0 7 0 7 7
TOTAL 51 61 1 34 10 35 10 35 5 5 1 1 4 5 9 11 91 187 96
The additional inclusion of selected publications and metadata as published on the AFMA Torres
Strait Research DVD publications and metadata (Taranto, 2004) is still in progress at the time of
drafting this report. CSIRO Legal have sought permission from AFMA to load the entire contents of
the AFMA DVD onto the website - except six publications that were copyright to agencies other than
AFMA or CSIRO. Upon receipt of AFMA Copyright permission the selected works can be transferred
to the readied website.
CRC Torres Data and Information Repository 3-8
Figure 3-1. Index page showing custom repository search tool and Marlin Metadata Search tool
6. FURTHER DEVELOPMENT
There will always remain a need to promote the repository to stakeholders of the Torres Strait. In
addition there will always be a need to facilitate either the lodgment of IP works to the repository or
hyperlinks to new and valued inventories of information managed by other research agencies. It is
only by the continued cooperation and participation of stakeholders that this search interface can
maximize research developments within the Torres Strait.
At present, the system is dependent on external applications and services being delivered by Google,
relying on the Google web trawler applications (Googlebots) to search and index individual web files.
Though prescribed commands and an efficient web design have been followed to maximise the
instance of hits by Google’s web trawlers, the Googlebots are self managed and there is no guarantee
of a quick search of the complete repository. It has been observed that the current indexing of new
information within the repository takes between one and three weeks. This can be improved by
implementing an enterprise repository system such as Dspace.
7. ACHIEVEMENT OF OUTCOMES
The resultant Torres Strait Marine Research Repository has successfully published the CRC Torres
Strait IP works that have been lodged with Administrators. In addition, the customised search
interfaces available for future stakeholders significantly enhances the repository’s planned outcome to
provide a searchable repository of research IP works from the CRC Torres Strait.
With a defined repository now permanently maintained by the CSIRO Marine and Atmospheric
Research Data Centre, the maintenance and availability of any lodged IP - not just that of the CRC
Torres Strait - is assured, providing an enduring service to the Torres Strait research community.
The customised search interface has been developed so that it can be simply incorporated onto and/or
linked from other agencies websites, seamlessly linking independent research efforts between
stakeholders and custodial agencies of the Torres Strait.
CRC Torres Data and Information Repository 9-15
8. CONCLUSIONS
Overall, the outputs of the project have contributed to greater outcomes than initially anticipated. The
number of CRC IP works successfully lodged into the enduring and secure repository were 10%
higher than initially predicted and the customised search interface provides not only a search of the
repository holdings but also of other defined external repositories.
The disappointing response from CRC Principal Investigators (50%) could have been improved if this
project was commenced before the completion date of the CRC. Relating the lodgement of works to
contractual agreements and/or other pecuniary costs could also be considered.
9. RECOMMENDATIONS
A recommended priority is for the current owner of the Repository, the RRRC, to ensure all necessary
licensing requirements are in place before the repository is made available to the public at large.
Investigations have identified the ‘Creative Commons’ license — a simplified licensing regime
specifically designed for the transfer of Australian internet based information — as a possible solution
for the RRRC. The current draft website also contains a link to the CSIRO Disclaimer as an example
of current practices.
The late commencement and coincidence with Special Issue publication significantly limited
lodgement of intellectual property (IP) works into the repository, only 50% of identified IP works
were eventually lodged to the repository. It is recommended that any future research programs
consider this experience and provide timely services for stakeholders to adhere to protocols such as
template usage and provide an effective protocol for researcher to lodge their works. It is also
suggested that any future contractual agreements include a clause that withholds final payment until all
works have been submitted.
The lodgement of outstanding IP to the repository remains the responsibility of current custodians.
The intellectual property (IP) listing (APPENDIX 1) identifies those works awaiting lodgement.
The now functional Torres Strait Marine Research Repository is a significant resource. It not only
incorporates and searches the IP works of the CRC Torres Strait, it includes search interfaces to many
other Torres Strait related repositories. Though the repository’s security and ongoing service is
guaranteed by its inclusion within the CSIRO Marine and Atmospheric Research Data Centre, it will
require periodic updates and maintenance to continue to act as a value resource. In addition, the
customised search interface, though currently searching many already known websites and external
repositories, will require ongoing maintenance.
It is recommended that a repository administrator be assigned with the responsibility to promote the
Torres Strait Marine Research Repository to stakeholders and other researcher agencies; to update
links to new inventories containing relevant marine research; and to ensure that sustainable protocols
are developed and followed by contributors to the Repository.
The implementation of an enterprise level repository also offers many benefits such as increasing the
rating by academic search engines and a guaranteed timely and comprehensive search. It is
recommended that to enhance the outcomes of the existing ‘hybrid’ search repository, consideration be
given to the implementation of the Dspace open source repository, a widely recognised research
repository.
CRC Torres Data and Information Repository 11-16
10. REFERENCES
CatalystIT (2003). Technical Evaluation of selected Open Source Repository Solutions.
https://eduforge.org/docman/view.php/131/1062/Repository%20Evaluation%20Document.pdf . Cited
January 2007.
Taranto, T.J. and C.R. Pitcher (2004) Torres Strait marine science: collected publications and data,
1980-2003. DVD. Cleveland, Qld., CSIRO Marine Research.
Contacts
CMAR Data Centre Manager – Tony Rees
TSRA Contact – Vic McGrath
CRC Torres Strait Contact – David Williams
RRRC Contact – Russell Reichelt