CODATA Task Group on Data Sources for Sustainable Development in SADC

CODATA WORKSHOP

Data Sources for Sustainable Development in SADC

14–15 May 2007

National Research Foundation, Pretoria, South Africa

1

Proceedings of a workshop of 14–15 May 2007 (version 5;last modified: 1 October 2007)

CODATA Task Group on Data Sources for Sustainable Development in SADC

TABLE OF CONTENTS

EXECUTIVE SUMMARY: CODATA TASK GROUP ON DATA SOURCES FOR SUSTAINABLE DEVELOPMENT IN SADC

Day 1

SESSION 1: OPENING AND OBJECTIVES

Opening address (Dr Andrew Kaniki, Executive Director: Knowledge Management and Strategy, National Research Foundation)

Objectives of workshop (Prof. Steve Rossouw, Vice-President: CODATA and Chair: CODATA SANC)

SESSION 2: SCIENTIFIC AND TECHNICAL DATA AND INFORMATION POLICY

Panellist: Dr Sospeter Muhongo (ICSU)

Panellist: Robyn Glaser (Department of Science and Technology)

Panellist: Paul Uhlir (US National Academies, CODATA USNC)

Panellist: Dr Xola Mati (ASSAf)

Panellist: Nontando Guwa (ASSAf)

Panellist: Dr Wieland Gevers (ASSAf)

Panellist: Dr Andrew Kaniki (NRF)

Comments on presentations

Panellist: Prof. Steve Rossouw (CODATA SANC)

Questions and discussion

SESSION 3: DATA SHARING AND SPECIFIC DATA ISSUES

Panellist: Dr Anthony Cooper (CSIR)

Panellists: Dr Martie van Deventer (CSIR) & Dr Heila Pienaar (University of Pretoria)

Comments on presentations

Panellist: Avinash Chuntharpursat (SAEON)

Comments on presentation

Questions and discussion

Day 2

SESSION 4: ENVIRONMENTAL AND GEOSPATIAL, HEALTH AND BIOMEDICAL, BIODIVERSITY, AND SOCIO-ECONOMIC DATA AND INFORMATION IN THE SADC REGION

Panellist: Dr XIAO Yun (Chinese Academy of Sciences and CODATA Chinese NC)

Comments on presentation

Panellist: DrLischenHaoses-Gorases (University of Namibia)

Comments on presentation

Panellist: Stanley Mukanganyama (Department of Biochemistry, University of Zimbabwe)

Comments on presentation

Panellist: Mika Odido (Intergovernmental Oceanographic Commission of UNESCO)

Comments on presentation

European Union Sixth Framework Programme (FP6): Dr Geoff Meese (CSIR)

Comments on presentation

Panellist: Raed Sharif (US National Academies of Sciences)

Comments on presentation

Discussion

Directory of data producers in South Africa (Henda van der Berg, NRF)

Comments on presentation

SESSION 5: ICSU-ROA AND CODATA TASK GROUPS: FOLLOW-UP ACTIONS

Panellist: Prof. LIU Chuang (Co-Chair, UN GAID e-SDDC Executive Committee, Leading Professor of Global Change Information and Research Center, Institute of Geography and Natural Resources, Chinese Academy of Sciences)

Panellist: Paul Uhlir (US National Academies of Sciences)

SESSION 6: WORKING SESSION

Preparation of working document on how the recommendations are to be implemented

Closure: Prof. Sospeter Muhongo (ICSU Regional Office for Africa)

COMMENTS RECEIVED IN RESPONSE TO THE INVITATION TO PARTICIPANTS SUBMIT COMMENTS AFTER THE WORKSHOP

Comment from Mr Henry Mloza-Banda, University of Malawi

APPENDIX: LIST OF ACRONYMS

EXECUTIVE SUMMARY: CODATA TASK GROUP ON DATA SOURCES FOR SUSTAINABLE DEVELOPMENT IN SADC

The CODATA Task Group on Data Sources for Sustainable Development in SADC Countries held its initial organising workshop in Pretoria, South Africa on 14–15 May 2007. The National Research Foundation hosted the workshop, which was attended by 31 delegates. The workshop was convened in collaboration with the ICSU Regional Office for Africa (ROA), the National Research Foundation (NRF) of South Africa, the CODATA Task Group on Preservation of and Access to Scientific and Technical Data in Developing Countries, the South African National Committee for CODATA, and the United States National Committee for CODATA.

The objectives of the workshop were to:

  • Consider the main recommendations from the September 2005 CODATA Workshop on Strategies for Permanent Access to Scientific Information in Southern Africa: Focus on Health and Environmental Information for Sustainable Development and discuss specific actions and initiatives that could be taken in response.
  • Designate appropriate offices/officials in the SADC countries to actively support the initiatives of the CODATA Task Group on Data Sources for Sustainable Development in SADC countries that are of regional interest.
  • Promote mutual reinforcement with the converging goals of the ICSU ROA, the SADC and NEPAD Secretariats, and other regional organisations, as appropriate, and coordinate actions in order to leverage resources.
  • Establish a work plan for the CODATA task group for 2007–2008.

The outcome of the workshop was the recommendation of thefollowing set of actions for the period 2007–2008 for the CODATA Task Group on Data Sources for Sustainable Development in SADC:

WHAT / WHO / WHEN
1. Develop an inventory of scientific databases across SADC
  • Extension of SA Data Archives(SADA) directory of data producers in South Africa
  • With ICSU/CODATA ‘wrapper’/ front end
  • Task group must determine sustainability strategy and data fields
  • Link to ICSU and CODATA outreach activities
/
  • The task group would be responsible for the workplan, proposal and raising resources with the guidance of the ICSU Regional Office for Africa
  • Funds would be needed for a full-time staff member
  • Seed funding available from CODATA must be used to source sustainable resources
  • Involve people with databases
  • Key role players:
  • Explore international and domestic sources of funding
  • ICSU Africa
  • SADC science desk
  • SA Department of Science & Technology
  • African Union
  • Collaborator per country as part of CODATA outreach and recruitment
/ Proposal: end June 2007
2. Develop an inventory of scientific researchers (data professionals) across SADC / Subset of projects 1 and 3 and related to an existing ASSAf initiative – Network of African Science Academies (NASAC)
3. Develop an inventory of online training and instructional materials on data management /
  • UN GAID e-SDDC has taken international responsibility for this initiative
  • The task group to liaise, take material on board and see it is followed up
  • Universities represented at the workshop are partners

4. Better understanding of IP and open access. (cf Chile initiative)
  • Strategy by a coalition of library and research organisations
/ Paul Uhlir to provide background information for the task group to consider and possibly set up a meeting in Washington of potential partner organisations
5. Establish open institutional repositories: tool kits, open source software, business plans, funding for institutions
  • Possibly link to NRF database of theses/ dissertations
  • Link associated data on data sources and experts and possibly link with other players in a System of Systems approach
  • Develop capacity in secondary data analysis, particularly of collated data from SADC government sources
/ Potential coalition partners: Task group, IAP, ASSAf, e-IFL (Electronic Information for Libraries), higher education institutions, NRF, Association of Commonwealth Universities, Association of African Universities, science councils
Potential funders:
  • Open Society Institute
  • Shuttleworth Foundation
  • UNESCO
  • Partnership for Higher Education
  • International Development Research Centre (IDRC)
  • EU
  • Gates Foundation

6. Malaria research and environmental data and information network (pilot initiative) / The task group to liaise with other researchers regarding proposed network, to be formed through a bottom-up initiative of interested researchers/research groups
Potential funders:
  • Gates Foundation
  • Numerous others
/ SAMI to be asked to take the lead: Dr van Deventer (CSIR) and Dr Pienaar (University of Pretoria) to speak to SAMI (Prof Jane Morris)

The Task Group on Data Sources for Sustainable Development in SADC would be the lead task group for the initiative and would develop a proposal on the basis of the input received at the workshop. The task group would report on progress at the 2008 CODATA General Assembly, by which time the task group would have to be in a position to demonstrate deliverables if it wishes to continue for a further term.

1

Proceedings of a workshop of 14–15 May 2007 (version 5;last modified: 1 October 2007)

CODATA Task Group on Data Sources for Sustainable Development in SADC

Day 1

Present:

Surname / Name / Title / Affiliation
Arnold / Robyn / Ms / Write Connection (scribe and rapporteur)
Chantson / Janine / Dr / ICSU-ROA
Chuntharpursat / Avinash / Mr / SAEON
Cooper / Antony / Dr / CODATA SANC
Du Plessis / Tanya / Dr / University of Johannesburg
Dubi / Alfonse / Dr / University of Dar es Salaam
Gevers / Wieland / Prof. / Academy of Science of South Africa
Glaser / Robyn / Ms / Department of Science and Technology
Guwa / Nontando / Ms / Academy of Science of South Africa
Haoses-Gorases / Lischen / Dr / University of Namibia
Kaniki / Andrew / Dr / National Research Foundation
Liu / Chuang / Prof. / Chinese Academy of Sciences and e-SDDC, UN GAID
Mabaso / Refiloe / Ms / CODATA SANC
Mati / Xola / Dr / Academy of Science of South Africa
Mloza-Banda / Henry / Mr / University of Malawi
Mohoto / Themba / Mr / CODATA SANC
Muhongo / Sospeter / Dr / ICSU-Regional Office of Africa
Mukanganyama / Stanley / Dr / University of Zimbabwe
Nxumalo / Michael / Mr / National Research Foundation
Odido / Mika / Dr / UNESCO Intergovernmental Oceanographic Commission
Pienaar / Heila / Dr / University of Pretoria
Roman / Henry / Dr / CSIR / WAYS Africa
Rossouw / Steve / Prof. / CODATA SANC
Selematsela / Daisy / Dr / CODATA SANC
Sentoo / Naresh / Dr / DurbanUniversity of Technology
Sharif / Raed MS / Mr / CODATA USNC and SyracuseUniversity
Uhlir / Paul / Mr / CODATA USNC
Van der Berg / Henda / Ms / CODATA SANC
Van Deventer / Martie / Dr / CSIR
Xiao / Yun / Prof. / Chinese Academy of Sciences and CODATA Chinese NC

SESSION 1: OPENING AND OBJECTIVES

Opening address (Dr Andrew Kaniki, Executive Director: Knowledge Management and Strategy, National Research Foundation)

Data and information are essential building blocks for knowledge. Scientific research is one of the key processes for generating data, information and knowledge. It is acknowledged that indigenous knowledge systems are critical. The importance of Type 2 knowledge generation is also acknowledged, but scientific research remains the key. The outputs of individuals and research institutions are often published in peer-reviewed journals, reports and other documents. Researchers in developing countries are conversant with the processes, systems and infrastructure for managing the available information, although researchers in such countries have varying degrees of access to such information. Even when researchers know how to access information using tools such as bibliographies and other resources, these may be prohibitively costly. There are instruments, policies, procedures and legislation in place to facilitate access to information. The research community in South Africa, for instance, is fortunate that much work is being done with the support and direction of the Academy of Science of South Africa (ASSAf), apart from the work of knowledge professionals. One of the key publications on research publishing in South Africa was directed by ASSAf, namely ‘Report on a Strategic Approach to Research Publishing in South Africa’.

The value of research publications is well recognised, but raw data sets are also very important to the scientific community around the world in terms of knowledge production. There is growing emphasis on secondary data analysis, and in some fields this is the major form of knowledge production, for instance, in astronomy. However, in developing countries, there has not been much work of this nature, and in most fields, efforts are only just beginning. Information and communication technology (ICT) capability now enables the explosion in data collection and data distribution and has focused attention on the potential value of securing and sharing expensively created data sets. For instance, South Africa spends enormous resources on the periodic national census, but there is minimal processing of the data by social scientists in terms of secondary data analysis.

In the case of the Southern African Large Telescope, of which the NRF is custodian, data are generated 24 hours a day, every day, and shared with observatories in other time zones of the southern hemisphere. The data are thus very well utilised, compared with data from a field such as sustainable development, for instance.

The International Council for Science (ICSU) has identified scientific data and information as a priority area in developing its strategic plan for the coming years. An international panel of experts was appointed in 2003 by the Committee on Scientific Planning and Review (CSPR) to assess the strategic issues with respect to the use of data and scientific information. A report was produced, namely ‘Scientific data and information: A report of the CSPR assessment panel’. It highlights issues related to policies, operations, management, and data and information. The principal recommendation is that ICSU should assume an international leadership role in identifying and addressing critical policy management issues related to scientific data and information. ICSU is made up of scientific bodies and countries. South Africa has started looking at what the country could contribute with respect to data curation and data archiving. One of the projects operating in South Africa, with the involvement of the NRF, ASSAf, the University of Pretoria, the Medical Research Council (MRC), the Human Sciences Research Council (HSRC) and the Council for Scientific and Industrial Research (CSIR) (as observer) is the National Data Information and Curation Centre, which is a virtual centre to ensure that the various aspects of data archiving and management are taken care of. A steering committee representing the various partners and chaired by the NRF has been formed. The steering committee was formalised in January 2007, although much previous groundwork had been done under various auspices, for instance the South African Research Information Service (SARIS) project that investigated scientific access to research information services. One of the issues is to identify who produces and owns data that can be shared and what policies and procedures are in place. The South African Cabinet is discussing legislation on access to information relating to publicly funded research. The key issues to bear in mind in developing such policy include the recognition of critical resources. The research community in South Africa and Africais concerned that there should be rules and regulations and not simply open access. The key factor is that data must be fully utilised wherever they are.

The NRF and South African colleagues hope that this workshop will facilitate discussion, sharing and advancement in accessing and utilising data for sustainable development, particularly with colleagues in the Southern African Development Community (SADC).

Objectives of workshop (Prof. Steve Rossouw, Vice-President: CODATA and Chair: CODATA SANC)

The objectives of the workshop are:

  • To consider the main recommendations from the September 2005 Committee on Data for Science and Technology(CODATA) workshop (Strategies for Permanent Access to Scientific Information in Southern Africa: Focus on Health and Environmental Information for Sustainable Development) and discuss specific actions and initiatives that can be taken in response
  • To designate appropriate officials or officers in partner countries to actively support the initiatives with respect to data sources for sustainable development in SADC countries of regional interest
  • To promote mutual reinforcement with the converging goals of the ICSU Regional Office for Africa, the SADC and NEPAD (New Partnership for Africa’s Development) secretariat and other regional organisations, as appropriate, and coordinate actions in order to leverage the respective resources. (Resources are scarce, particularly in this part of the world.)
  • To establish a workplan for 2007/2008 for the CODATA Task Group on Data Sources for Sustainable Development: SADC. (CODATA task groups are generally funded for two-year periods and may on conclusion apply for continuation; permission is usually granted on the basis of satisfactory outputs during the initial two-year period, up to a maximum of six years). It is therefore important to establish a workplan so that a proposal for continuation can be in tabled at the CODATA General Assembly in 2008. The task group could produce outputs that would be valuable to the SADC region.

A number of possible activities are listed, and the document concludes that these and other issues and activities may be promoted by establishing new coordination frameworks, programmes, partnerships, mutual initiatives, networks, online consultation and general awareness raising within the policy, research, education and development community. Specific follow-up actions and responses by the parties should be identified for each issue or activity. Unless individuals and officials are nominated to carry out the objectives, the chances of producing anything of lasting value are slim. Participants are urged to consider a workable plan for the task group to implement. It is essential that people take ownership and responsibility so that work of lasting value can be produced.

Background

There are some 53 countries in the African Union, but only 15 are ICSU members. Of even more concern is that only four African countries are members of CODATA. Outreach is one of the objectives of the workshop, to firm up the contact made in regional workshops. Most developing countries face a range of problems, with priorities in areas such as health, education, poverty alleviation and employment creation, and data conservation is not a high priority for governments in the region.

South Africa has had paper archives for a long time, since the colonial period. Most of the information in these archives is preserved and is still accessible, although not always in good condition. Digital archiving is a new concept in southern Africa. The South African Data Archive (SADA) was established at the NRF in the early 1990s as a national repository for scientific data. Holdings from the MRC, HSRC and CSIR were deposited with SADA. In 1999, CODATA was invited to address a SADA management meeting and discuss whether CODATA could make any contribution towards the operation of SADA. As a result, the head of SADA at the time was invited to become a member of the CODATA national committee.

In the meantime, CODATA was taking other initiatives in the area of data conservation, and a workshop was held in Senegal on ‘Scientific and technical data handling in exchange for development’. (At the time, sustainable development was not as high on the international agenda as at present.) A three-person delegation from South Africa attended the workshop, including representation of the SADA office, and a paper on ‘Scientific and technical data in southern Africa – the situation in SADC countries’ was delivered. At the time, there were very few activities in the area of data conservation in SADC countries, with the exception of South Africa. During informal discussions at the workshop, John Rumble, former president of CODATA, and Paul Uhlir discussed the possibility of convening workshops on the archiving of data. As the result of a presentation to the CODATA general assembly of 2000, a workshop on ‘Archiving for scientific and technical data’ was held in Pretoria in 2002, attended by some 70 delegates from the region as well as international experts, to provide input on technical aspects of archiving. The meeting was co-chaired by Bill Anderson (who unfortunately could not attend the present workshop). The topics discussed included scientific, technical, management and policy issues. These topics are still pertinent today, even though the environment is constantly shifting as technology, priorities and governments change. These topics are all on the agenda for the present meeting. The workshop recommended the establishment of a task group. A working group is normally funded by CODATA for only two years. If it can show a need to continue the efforts, a task group may be established. The recommendations of the task group are on the CODATA website. The Task Group on Preservation of and Access to Scientific and Technical Data in Developing Countries was created at the CODATA general assembly in 2002, with Bill Anderson and LIU Chang as co-chairs. Only two African countries were included in the membership of the task group, namely, Senegal and South Africa. In the other two African CODATA member countries (Nigeria and Cameroon), the national committees were not very active and did not take up the offer to become members of the task group at the time.