
In today's high volume and data-rich environment it is worth investigating if the data you need for your research project already exists. However, finding data can present a few challenges:
Despite this, there are various strategies and techniques you can employ to ensure you get the best quality data available, in a timely and economical fashion. Read on to learn more...
There are a range of places to look for data, but you could start your search in Google or a similar search engine.
Tips on using Google to find data.
Data repositories
Whilst there is no universally accepted way of establishing data quality (ARDC, 2022) there are ways in which you can test and validate the data you find online and elsewhere.
Properties of High-Quality data
Australian Research Data Commons. (2022, January 27). Describing dataset quality — towards a global best practice..https://ardc.edu.au/article/describing-dataset-quality-towards-a-global-best-practice/
Properties of well managed data (the FAIR principles)
A quality repository will have the elements of TRUST, which stands for: Transparency, Responsibility, User focus, Sustainability and Technology
Data repositories that meet a high standard of TRUST can apply for a badge called the CoreTrustSeal. Repositories that have the CoreTrustSeal are linked below, along with more information about the TRUST framework.
Depending on the way you are receiving (or sharing) data, a data license or data sharing agreement (or both) may be required. For example, the data you want to use may be licensed with a Creative Commons Share-alike licence, which would mean that if you used that data you would have to license your work in the same way. If you acquire data from a corporate or industry source, you may also have to enter into a data-sharing agreement with the data providers.
Data sharing agreements
A data sharing agreement brings clarity to what can and cannot be done with the data you are sharing as the owner or accessing as the requester. For example, third-party data providers will often want to:
When created carefully, a data sharing agreement should lay out these expectations, and give you confidence that you are doing the right thing with the data you've been provided, or that your collaborators will be doing the right thing with yours.
To help you create a data-sharing agreement we've linked the ARDC's Data Sharing Agreement Development Guidelines below.
Data licences
A data licence is similar to a data sharing agreement, however it is offered unilaterally to a work placed in the public domain, unlike a data sharing agreement, which is negotiated directly between two parties. Thusly by applying a data license to your work, you can control how your data is going to be reused by others, regardless of whether or not they are in direct contact with you. We have linked a page on data licenses from the ARDC below.