程序代写代做 data structure gui html file system database go COMP5349: Cloud Computing

COMP5349: Cloud Computing
Week 1: Git Tutorial
A brief Intro to Git
Git is a version control system that allows people, mainly developers, to manage a project or a set of files by tracking their changes over time. A centralised hosting service is usually required to synchronise the changes made by different users or by the same user from different machines.
The most popular public web-based hosting services using Git are https://github. com/ and https://bitbucket.org/. Many organisations set up their own enterprise Git services to give members a free access to the premium features. Our university has recently launched a hosting service based on GitHub Enterprise. You can find all the information and the login link on this page https://informatics.sydney.edu.au/code-repository/
The most basic concept in Git is repository. It is a data structure used to store all the information of a project. A repository lives in the root directory of a project. It is stored in a hidden subdirectory called .git.
In the first exercise, we will explore the repository structure and familiarise ourselves with a few basic git commands. In the second exercise, we will examine ways to synchro- nise the repository.
Please note that the tutorial only introduces some basic commands. They do not cover all possible scenarios you may encounter while using Git. When you encounter a unique scenario that you are unsure about, please consult the online resources. The main focus of this tutorial is to help you understand the internal structure of a Git repository. This is important as it helps you in understanding the Git commands and subsequently solving various Git problems.
You are encouraged to use Linux based system for this lab to be able to run all exercises. However, you can run basic GIT commands under Windows.
Question 1: Understanding the Local Repository
The aim of this exercise is to understand a Git local repository.
a) Reboot to Linux (optional)
In this tutorial, we will be using Linux OS. All the exercises can be done on your personal Mac OS without any changes. Please ensure that Git is installed on it. If you do not have Git installed in your Mac, please follow the instructions in https:
1
School of Computer Science Dr. Ying Zhou
Sem. 1/2020
27.02.2020

//www.atlassian.com/git/tutorials/install-git#mac-os-x. If you prefer to do the lab exercises on Windows OS, make sure you use Windows PowerShell. At some point in time, you may need to use Windows specific commands. These commands are not covered in this tutorial. Please consult the online resources.
If the lab machine is currently on Windows, reboot it into Linux. You will see the selection screen after restarting the machine. Once you are at the Linux login screen, login with your unikey and password. The Linux version installed in the lab machine is Red Hat Enterprise 7.
b) Initialise an empty repository
We will begin by initialising an empty local repository. Open a terminal window and
change to your working directory using cd command such as cd wards run the following commands:
mkdir week1
cd week1
echo comp5349 > enrol.txt
git init
/comp5349. After-
The mkdir and cd command creates and changes your current working directory to a directory named week1. This directory will be used as your project’s root directory. The echo command creates a text file enrol.txt with content “comp5349”. The git init command initialises an empty local Git repository inside week1 directory.
In this directory, you should be able to see a hidden .git directory by using the com- mand ls -a. This .git repository contains all the metadata and the actual project data stored in various sub-directories. In general, Git repositories have the same structure i.e. same set of sub-directories. You can see the sub-directories of the .git directory using command ls .git. There should be directories with names like: HEAD,objects, refs, etc.
In the subsequent exercises, we will focus on the objects directory. Git stores every version of the project data in an “Object Database” residing in the objects directory. Currently, you will not see any files in it except for the two sub-directories. This is because our Git repository is still empty.
c) Adding files to your repository
Let’s add some files into our empty Git repository. Go back to the project’s root direc- tory and issue the following commands:
git add enrol.txt
find .git/objects -type f
You should now see .git/objects/cd/bb2ce4ae5b3765b0d33a1acbff426c258b4bcd
The git add enrol.txt command adds the file enrol.txt into the repository. This creates a blob object in the .git/objects directory. The blob object is not given the same name as the file name. Instead, it is named using the SHA1 hash of the file content. Git uses a clever way to create a two level storage structure for all objects.
2

It uses the first two characters of the SHA1 hash as the directory name, and the rest as the object’s file name. In the above example, the SHA1 hash of the file content is cdbb2ce4ae5b3765b0d33a1acbff426c258b4bcd. The first 2 characters cd is used as the directory name.
Note that the file is not stored as it is in the repository. You cannot view the content of the blob object with command like cat. This is because the file is first compressed then stored. Git provides its own facility for you to view the object content.
To view the content of object cdbb2ce4ae5b3765b0d33a1acbff426c258b4bcd, proceed to .git/objects directory using the command cd .git/objects, and execute the following command:
git cat-file -p cdbb2ce4ae5b3765b0d33a1acbff426c258b4bcd
You should see “comp5349”; the content of the file.
If you have two files with exactly the same content e.g. one created by copying the other, there will only be one copy in the Git’s object database. This is because they share the same SHA1 value. To test this out, create a backup of enrol.txt file and add it to the Git repository:
cp enrol.txt enrol.bak
git add enrol.bak
You will notice that the repository remain unchanged.
d) Creating a commit
So far, the repository is only capturing the content of your project files i.e. no name is associated with any blob object. The name-content association only happens during commit. A commit is a snapshot of your project that can be retrieved when necessary.
Run the following command to create a commit:
git commit -m “First Commit”
If you have never run git commit before, the terminal will prompt you to set up the name and email address of the committer – that is you. Set them up by running the following command (replace with your actual name, and with your own email address):
git config –global user.name “”
git config –global user.email “”
Run find .git/objects -type f to find all the objects in the object database. You will notice there are 3 objects there. If you use the same file names as the ones in the previous exercises, two of your objects will be:
.git/objects/cd/bb2ce4ae5b3765b0d33a1acbff426c258b4bcd
.git/objects/8d/935bd4ec8fad4bc451e9d26fe206bd0ebdd5e7
If you compare the name of the files with other classmates, you will find the two objects (tree and commit object) above to have the same name, but not the other object. The
3

third object will always have a name that is unique to each repository. Hence there is no way to predict it.
The commit command creates two objects: a tree object and a commit object. The tree object records the mapping between the blob and the file names. As usual, the object name is the SHA1 hash of the content of the tree object. The commit object records the content of this commit as well as the reference to its parent commit.
Let us inspect the tree object starting with 8d – recall that git cat-file -p

Related Posts