Data organization is about working more efficiently with data.
The standard methods of data organisation are:
Researchers often need to share primary data and preliminary results with collaborators, or may wish to transfer data to another computer system (e.g. from university to home).
The most common method for transferring files is with email attachments, but there are limits to the size of files that can be transferred. While removable data storage media (USBs, CDs, etc) can transfer larger files, they require the researcher to physically transfer the data.
Large files are usually transferred using FTP (File Transfer Protocol), which allows the user to download as well as upload. An FTP client (e.g. FTP Explorer) is used to connect and transfer files, and access can be restricted by username and password.
To assist good data management, ANU provides local area network and Internet access to the Homedrive, a central file space on which each member of the University is allocated file space to store personal files (students/staff 4.5GB). Homedrive is accessible from any Information Commons computer or via https://myfiles.anu.edu.au/
File Synchronisation Programs:
Many research projects are carried out collaboratively.
When collaborating on simple work and with a small number of collaborators, data transfer is usually done by email, USB or a shared network drive.
For more complex tasks, or for projects with many collaborators - it is worth considering using collaborative software tools, such as the ANU-provided Alliance which provides a wide range of collaborative tools such as forums, chat rooms, calendars, and more.
When data is constantly being edited, especially by multiple users, it is a good idea to implement some form of version control to keep track of changes. This can be as simple as appending a number to the end of a file after each major edit. For example:
Such conventions are good for simple work but quickly become unmanageable when you have multiple authors or make lots of edits.
The alternative is to use revision (or version) control software. Version control software provides access control, a collaborative work environment, synchronization between home/office/laptop computers, and a degree of data safety.
Such programs offer several advantages:
While version control software is in some cases harder to set up, it provides more advanced version tracking. A distributed version control system like Bazaar can be used with Alliance to collaboratively manage documents and data.
TortoiseSVN uses the Subversion system of version control. It integrates with Windows Explorer making it one of the easiest version control systems to use.
Such tools make it easier for multiple users to access and edit the latest version of a document without conflicting with other people’s changes. The entire history is stored making it easier for users to see any changes made, and to revert to an older version if needed.
+61 2 6125 5111
The Australian National University, Canberra
CRICOS Provider : 00120C
ABN : 52 234 063 906