提交 · f9565e303916ca194ef63b5fd3de541bf1c2a170 · 李少辉-开发者 / gitlab-foss

21 11月, 2017 2 次提交

Batchload blobs for diff generation · f9565e30

由 Zeger-Jan van de Weg 提交于 11月 03, 2017

After installing a new gem, batch-loader, a construct can be used to
queue data to be fetched in bulk. The gem was also introduced in both
gitlab-org/gitlab-ce!14680 and gitlab-org/gitlab-ce!14846, but those mrs
are not merged yet.

For the generation of diffs, both the old blob and the new blob need to
be loaded. This for every file in the diff, too. Now we collect all
these so we do 1 fetch. Three `.allow_n_plus_1_calls` have been removed,
which I expect to be valid, but this needs to be confirmed by a full CI
run.

Possibly closes:
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37445
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37599
- https://gitlab.com/gitlab-org/gitlab-ce/issues/37431

f9565e30

N

Fix bitbucket wiki import with hashed storage enabled · f2977c63
由 Nick Thomas 提交于 11月 20, 2017

f2977c63

20 11月, 2017 3 次提交

Don't move project repository/attachments when using hashed storage · fa39e8a0

由 Bob Van Landuyt 提交于 11月 20, 2017

When a project is using hashed storage, the repositories and
attachments wouldn't be saved on disk using the `full_path`. So the
migration would not do anything.

However: best to just skip moving when hashed storage is enabled.

fa39e8a0

Clean up schema of the "merge_requests" table · 936e9e89

由 Yorick Peterse 提交于 11月 14, 2017

This adds various foreign keys and indexes to the "merge_requests" table
as outlined in https://gitlab.com/gitlab-org/gitlab-ce/issues/31825.

Fixes https://gitlab.com/gitlab-org/gitlab-ce/issues/31825

936e9e89

A

Fix Gitlab::Git::Repository#remote_tags using unexisting variable · 3f0c9e97
由 Alejandro Rodríguez 提交于 11月 20, 2017

3f0c9e97

18 11月, 2017 2 次提交

A

Incorporate Gitaly's RefService.DeleteRefs RPC · 38730a2d
由 Alejandro Rodríguez 提交于 11月 17, 2017

38730a2d

Fix conflict highlighting · 64a9e53b

由 Sean McGivern 提交于 11月 17, 2017

Conflicts used to take a `Repository` and pass that to
`Gitlab::Highlight.highlight`, which would call `#gitattribute` on the
repository. Now they use a `Gitlab::Git::Repository`, which didn't have that
method defined - but defining it on `Gitlab::Git::Repository` does make it
available on `Repository` through `method_missing`, so we can do that and both
cases will work.

64a9e53b

17 11月, 2017 10 次提交
- F
  
  Changing OAuth lookup to be case insensitive · c7cf68bd
  由 Francisco Javier López 提交于 11月 17, 2017
  
  c7cf68bd
- F
  
  Renaming AuthenticationException to AuthenticationError · 4188c10c
  由 Francisco Lopez 提交于 11月 17, 2017
  
  4188c10c
- D
  
  Fix go-import meta data when enabled_git_access_protocol is a blank string · f767dd4a
  由 Douwe Maan 提交于 11月 17, 2017
  
  f767dd4a
- S
  Convert migration to populate latest merge request ID into a background migration · 5cecff89
  由 Stan Hu 提交于 11月 16, 2017
```
This is to smear updates over a few hours to avoid causing excessive
replication lag as seen in https://gitlab.com/gitlab-com/infrastructure/issues/3235.
```
  5cecff89
- F
  
  Moved Exceptions to Gitlab::Auth · 1436598e
  由 Francisco Lopez 提交于 11月 16, 2017
  
  1436598e
- F
  
  Moving exceptions to UserAuthFinders · aa84ef1e
  由 Francisco Lopez 提交于 11月 16, 2017
  
  aa84ef1e
- F
  
  Added some more comments · f1896575
  由 Francisco Lopez 提交于 11月 10, 2017
  
  f1896575
- F
  
  Added UserAuthFinders spec · 130a9933
  由 Francisco Lopez 提交于 11月 10, 2017
  
  130a9933
- F
  
  Added RequestAuthenticator spec · 8e57cc7e
  由 Francisco Lopez 提交于 11月 09, 2017
  
  8e57cc7e
- J
  Adds Rubocop rule for line break after guard clause · 181cd299
  由 Jacopo 提交于 11月 14, 2017
```
Adds a rubocop rule (with autocorrect) to ensure line break after guard clauses.
```
  181cd299
16 11月, 2017 1 次提交

Update container repository path reference · f4df4f9e

由 Grzegorz Bizon 提交于 11月 16, 2017

We should allow to use double underscore in the path, and it seems that
our container repository path regexp was outdated.

See https://github.com/docker/distribution/blob/master/reference/regexp.go

f4df4f9e

15 11月, 2017 2 次提交

R
Add total_time_spent to the `changes` hash in issuable Webhook payloads · 05c10c9b
由 Rémy Coutable 提交于 11月 14, 2017
```
Signed-off-by: NRémy Coutable <remy@rymai.me>
```
05c10c9b

Isolate the fork network background migrations · f961744b

由 Bob Van Landuyt 提交于 11月 15, 2017

Before the `PopulateForkNetworksRange` spec would also call the
`CreateForkNetworkMemberships` which we would count on in the spec.

With this, I'm isolating that, and counting only records created in
this particular migration instead.

f961744b

14 11月, 2017 3 次提交
- B
  Don't try to create fork network memberships for forks of forks · aaf18bb8
  由 Bob Van Landuyt 提交于 11月 14, 2017
```
In case the root project of a Fork-of-fork is deleted, the ForkNetwork
and the membership for that fork network is never created. In this
case we shouldn't try to create the membership, since the parent
membership will never be created.

This means that these fork networks will be lost.
```
  aaf18bb8
- A
  
  Incorporate Gitaly's WikiService.WikiGetAllPages RPC · 282e7f8e
  由 Alejandro Rodríguez 提交于 11月 09, 2017
  
  282e7f8e
- A
  
  Add spec examples for Gitlab::Gitaly::WikiService · 5a38a9d8
  由 Alejandro Rodríguez 提交于 11月 09, 2017
  
  5a38a9d8
13 11月, 2017 1 次提交
- L
  
  Add Gitlab::Utils::StrongMemoize · 258bf3e1
  由 Lin Jen-Shin (godfat) 提交于 11月 13, 2017
  
  258bf3e1
10 11月, 2017 2 次提交
- J
  
  Prepare Repository#fetch_source_branch for migration · de301d13
  由 Jacob Vosmaer (GitLab) 提交于 11月 10, 2017
  
  de301d13
- Y
  Clean up schema of the "issues" table · d825c9cb
  由 Yorick Peterse 提交于 11月 06, 2017
```
This adds various foreign key constraints, indexes, missing NOT NULL
constraints, and changes some column types from timestamp to
timestamptz.

Fixes https://gitlab.com/gitlab-org/gitlab-ce/issues/31811
```
  d825c9cb
09 11月, 2017 6 次提交

J

Handle forks in Gitlab::Checks::LfsIntegrity · ebd51744
由 James Edwards-Jones 提交于 11月 08, 2017

ebd51744

Merge branch 'ssrf-protections-round-2' into 'security-10-1' · 89bd7835

由 Douwe Maan 提交于 11月 07, 2017

Replace SSRF resolver with Addrinfo.getaddrinfo to include alternative localhost versions

See merge request gitlab/gitlabhq!2219

(cherry picked from commit 4a1e7378)

1bffa0c3 Replace SSRF resolver with Addrinfo.getaddrinfo to include alternative localhost versions

89bd7835

M
Revert "add metrics tagging to the sidekiq middleware" · 7fd3ce41
由 micael.bergeron 提交于 11月 08, 2017
```
This reverts commit d5859bb9.
This reverts commit 2b7e03cf.
This reverts commit 7799a9bc.
```
7fd3ce41

Support importing GH projects without rate limits · f37fe2ed

由 Yorick Peterse 提交于 11月 08, 2017

GitHub Enterprise disables rate limiting for the API, resulting in HTTP
404 errors when requesting rate limiting details. This changes
Gitlab::GithubImport::Client so it can deal with rate limiting being
disabled.

f37fe2ed

Restore Enterprise support in the GH importer · 2b886a78

由 Yorick Peterse 提交于 11月 08, 2017

This was removed by accident as the old GitHub importer handled this
deep down the codebase, making it easy to miss.

2b886a78

S

Fix Error 500 when pushing LFS objects with a write deploy key · 0232450c
由 Stan Hu 提交于 11月 08, 2017

0232450c

08 11月, 2017 8 次提交

J

Moved LfsIntegrity specs to own file · 78ea074f
由 James Edwards-Jones 提交于 11月 08, 2017

78ea074f
Y

Replace old GH importer with the parallel importer · 6e242e82
由 Yorick Peterse 提交于 10月 18, 2017

6e242e82

Rewrite the GitHub importer from scratch · 4dfe26cd

由 Yorick Peterse 提交于 10月 13, 2017

Prior to this MR there were two GitHub related importers:

* Github::Import: the main importer used for GitHub projects
* Gitlab::GithubImport: importer that's somewhat confusingly used for
  importing Gitea projects (apparently they have a compatible API)

This MR renames the Gitea importer to Gitlab::LegacyGithubImport and
introduces a new GitHub importer in the Gitlab::GithubImport namespace.
This new GitHub importer uses Sidekiq for importing multiple resources
in parallel, though it also has the ability to import data sequentially
should this be necessary.

The new code is spread across the following directories:

* lib/gitlab/github_import: this directory contains most of the importer
  code such as the classes used for importing resources.
* app/workers/gitlab/github_import: this directory contains the Sidekiq
  workers, most of which simply use the code from the directory above.
* app/workers/concerns/gitlab/github_import: this directory provides a
  few modules that are included in every GitHub importer worker.

== Stages

The import work is divided into separate stages, with each stage
importing a specific set of data. Stages will schedule the work that
needs to be performed, followed by scheduling a job for the
"AdvanceStageWorker" worker. This worker will periodically check if all
work is completed and schedule the next stage if this is the case. If
work is not yet completed this worker will reschedule itself.

Using this approach we don't have to block threads by calling `sleep()`,
as doing so for large projects could block the thread from doing any
work for many hours.

== Retrying Work

Workers will reschedule themselves whenever necessary. For example,
hitting the GitHub API's rate limit will result in jobs rescheduling
themselves. These jobs are not processed until the rate limit has been
reset.

== User Lookups

Part of the importing process involves looking up user details in the
GitHub API so we can map them to GitLab users. The old importer used
an in-memory cache, but this obviously doesn't work when the work is
spread across different threads.

The new importer uses a Redis cache and makes sure we only perform
API/database calls if absolutely necessary.  Frequently used keys are
refreshed, and lookup misses are also cached; removing the need for
performing API/database calls if we know we don't have the data we're
looking for.

== Performance & Models

The new importer in various places uses raw INSERT statements (as
generated by `Gitlab::Database.bulk_insert`) instead of using Rails
models. This allows us to bypass any validations and callbacks,
drastically reducing the number of SQL queries and Gitaly RPC calls
necessary to import projects.

To ensure the code produces valid data the corresponding tests check if
the produced rows are valid according to the model validation rules.

4dfe26cd

Cache feature names in RequestStore · 90be53c5

由 Yorick Peterse 提交于 10月 30, 2017

The GitHub importer (and probably other parts of our code) ends up
calling Feature.persisted? many times (via Gitaly). By storing this data
in RequestStore we can save ourselves _a lot_ of database queries.

Fixes https://gitlab.com/gitlab-org/gitlab-ce/issues/39361

90be53c5

Add returning IDs to Gitlab::Database.bulk_insert · bda30182

由 Yorick Peterse 提交于 10月 22, 2017

This adds the keyword argument "return_ids" to
Gitlab::Database.bulk_insert. When set to `true` (and PostgreSQL is
used) this method will return an Array of the IDs of the inserted rows,
otherwise it will return an empty Array.

bda30182

J

Improve GitLab Import rake task to work with Hashed Storage and Subgroups · 1c8af321
由 James Lopez 提交于 11月 07, 2017

1c8af321
B

Remove EE-specific group paths · 9b0899cb
由 Bob Van Landuyt 提交于 11月 07, 2017

9b0899cb
B

Update failure message when finding new routes in `PathRegex` · e070e216
由 Bob Van Landuyt 提交于 10月 27, 2017

e070e216