manage_large_binaries_with_git_lfs.md 6.4 KB
Newer Older
M
Marin Jankovski 已提交
1 2
# Git LFS

3 4 5
Managing large files such as audio, video and graphics files has always been one
of the shortcomings of Git. The general recommendation is to not have Git repositories
larger than 1GB to preserve performance.
M
Marin Jankovski 已提交
6

7
GitLab already supports [managing large files with git annex](http://docs.gitlab.com/ee/workflow/git_annex.html)
8 9
(EE only), however in certain environments it is not always convenient to use
different commands to differentiate between the large files and regular ones.
M
Marin Jankovski 已提交
10

11 12
Git LFS makes this simpler for the end user by removing the requirement to
learn new commands.
M
Marin Jankovski 已提交
13 14 15

## How it works

16 17 18
Git LFS client talks with the GitLab server over HTTPS. It uses HTTP Basic Authentication
to authorize client requests. Once the request is authorized, Git LFS client receives
instructions from where to fetch or where to push the large file.
M
Marin Jankovski 已提交
19

20
## GitLab server configuration
M
Marin Jankovski 已提交
21

22
Documentation for GitLab instance administrators is under [LFS administration doc](lfs_administration.md).
M
Marin Jankovski 已提交
23

24
## Requirements
M
Marin Jankovski 已提交
25

26 27
* Git LFS is supported in GitLab starting with version 8.2
* [Git LFS client](https://git-lfs.github.com) version 1.0.1 and up
M
Marin Jankovski 已提交
28

M
Marin Jankovski 已提交
29
## Known limitations
M
Marin Jankovski 已提交
30

31 32
* Git LFS v1 original API is not supported since it was deprecated early in LFS
  development
M
Marin Jankovski 已提交
33
* When SSH is set as a remote, Git LFS objects still go through HTTPS
34 35 36 37
* Any Git LFS request will ask for HTTPS credentials to be provided so good Git
  credentials store is recommended
* Git LFS always assumes HTTPS so if you have GitLab server on HTTP you will have
  to add the URL to Git config manually (see #troubleshooting)
38 39 40 41
  
>**Note**: With 8.12 GitLab added LFS support to SSH. The Git LFS communication
 still goes over HTTP, but now the SSH client passes the correct credentials
 to the Git LFS client, so no action is required by the user.
M
Marin Jankovski 已提交
42 43 44

## Using Git LFS

45 46 47
Lets take a look at the workflow when you need to check large files into your Git
repository with Git LFS. For example, if you want to upload a very large file and
check it into your Git repository:
M
Marin Jankovski 已提交
48 49 50

```bash
git clone git@gitlab.example.com:group/project.git
51
git lfs install                       # initialize the Git LFS project
M
Marin Jankovski 已提交
52
git lfs track "*.iso"                 # select the file extensions that you want to treat as large files
M
Marin Jankovski 已提交
53 54
```

55 56
Once a certain file extension is marked for tracking as a LFS object you can use
Git as usual without having to redo the command to track a file with the same extension:
M
Marin Jankovski 已提交
57 58

```bash
M
Marin Jankovski 已提交
59
cp ~/tmp/debian.iso ./                # copy a large file into the current directory
M
Marin Jankovski 已提交
60
git add .                             # add the large file to the project
M
Marin Jankovski 已提交
61 62 63 64
git commit -am "Added Debian iso"     # commit the file meta data
git push origin master                # sync the git repo and large file to the GitLab server
```

65 66 67 68
Cloning the repository works the same as before. Git automatically detects the
LFS-tracked files and clones them via HTTP. If you performed the git clone
command with a SSH URL, you have to enter your GitLab credentials for HTTP
authentication.
M
Marin Jankovski 已提交
69 70 71 72 73

```bash
git clone git@gitlab.example.com:group/project.git
```

74 75
If you already cloned the repository and you want to get the latest LFS object
that are on the remote repository, eg. from branch `master`:
76 77 78 79

```bash
git lfs fetch master
```
M
Marin Jankovski 已提交
80

M
Marin Jankovski 已提交
81
## Troubleshooting
M
Marin Jankovski 已提交
82 83 84

### error: Repository or object not found

M
Marin Jankovski 已提交
85
There are a couple of reasons why this error can occur:
M
Marin Jankovski 已提交
86

87
* You don't have permissions to access certain LFS object
M
Marin Jankovski 已提交
88

89
Check if you have permissions to push to the project or fetch from the project.
M
Marin Jankovski 已提交
90

91 92
* Project is not allowed to access the LFS object

93 94
LFS object you are trying to push to the project or fetch from the project is not
available to the project anymore. Probably the object was removed from the server.
M
Marin Jankovski 已提交
95

96
* Local git repository is using deprecated LFS API
M
Marin Jankovski 已提交
97 98 99

### Invalid status for <url> : 501

100 101 102 103 104 105 106 107 108
Git LFS will log the failures into a log file.
To view this log file, while in project directory:

```bash
git lfs logs last
```

If the status `error 501` is shown, it is because:

109 110 111 112
* Git LFS support is not enabled on the GitLab server. Check with your GitLab
  administrator why Git LFS is not enabled on the server. See
  [LFS administration documentation](lfs_administration.md) for instructions
  on how to enable LFS support.
113

114 115 116 117 118
* Git LFS client version is not supported by GitLab server. Check your Git LFS
  version with `git lfs version`. Check the Git config of the project for traces
  of deprecated API with `git lfs -l`. If `batch = false` is set in the config,
  remove the line and try to update your Git LFS client. Only version 1.0.1 and
  newer are supported.
M
Marin Jankovski 已提交
119 120 121

### getsockopt: connection refused

122 123 124 125
If you push a LFS object to a project and you receive an error similar to:
`Post <URL>/info/lfs/objects/batch: dial tcp IP: getsockopt: connection refused`,
the LFS client is trying to reach GitLab through HTTPS. However, your GitLab
instance is being served on HTTP.
M
Marin Jankovski 已提交
126

127 128
This behaviour is caused by Git LFS using HTTPS connections by default when a
`lfsurl` is not set in the Git config.
M
Marin Jankovski 已提交
129

M
Marin Jankovski 已提交
130
To prevent this from happening, set the lfs url in project Git config:
M
Marin Jankovski 已提交
131 132 133

```bash

134
git config --add lfs.url "http://gitlab.example.com/group/project.git/info/lfs"
M
Marin Jankovski 已提交
135 136 137 138
```

### Credentials are always required when pushing an object

139 140 141 142
>**Note**: With 8.12 GitLab added LFS support to SSH. The Git LFS communication
 still goes over HTTP, but now the SSH client passes the correct credentials
 to the Git LFS client, so no action is required by the user.

143 144
Given that Git LFS uses HTTP Basic Authentication to authenticate the user pushing
the LFS object on every push for every object, user HTTPS credentials are required.
M
Marin Jankovski 已提交
145

146 147
By default, Git has support for remembering the credentials for each repository
you use. This is described in [Git credentials man pages](https://git-scm.com/docs/gitcredentials).
M
Marin Jankovski 已提交
148

149 150
For example, you can tell Git to remember the password for a period of time in
which you expect to push the objects:
M
Marin Jankovski 已提交
151 152 153 154 155

```bash
git config --global credential.helper 'cache --timeout=3600'
```

156 157
This will remember the credentials for an hour after which Git operations will
require re-authentication.
M
Marin Jankovski 已提交
158

159 160
If you are using OS X you can use `osxkeychain` to store and encrypt your credentials.
For Windows, you can use `wincred` or Microsoft's [Git Credential Manager for Windows](https://github.com/Microsoft/Git-Credential-Manager-for-Windows/releases).
M
Marin Jankovski 已提交
161

162
More details about various methods of storing the user credentials can be found
163
on [Git Credential Storage documentation](https://git-scm.com/book/en/v2/Git-Tools-Credential-Storage).