1. 01 5月, 2019 8 次提交
  2. 30 4月, 2019 3 次提交
    • P
      Fix flaky gpcloud S3KeyReaderTest unit test (#7576) · 4ff071cf
      Peifeng Qiu 提交于
      S3KeyReaderTest.MTReadWithUnexpectedFetchDataAtSecondRound is a
      flaky case, related to multithread timing.
      
      The case setup S3KeyReader and try to download in parallel with 2
      chunks(threads). When any of them encounters an error, all thread
      will abort with the shared error.
      
      The case assumed that the first created thread will call fetchData()
      twice before another thread fetch with error. But if the first
      thread is never scheduled to run, the second thread will call
      fetchData() first and sets the shared error. Then the first thread
      continues and will exit at the first call to fetchData(), reporting
      shared error.
      
      Modify the second call to fetchData() to be at most once.
      4ff071cf
    • P
      Recursively create tablespace directories if they do not exist but we need... · 7a09e80d
      Paul Guo 提交于
      Recursively create tablespace directories if they do not exist but we need them when re-redoing some tablespace related xlogs (e.g. database create with a tablespace) on mirror.
      
      It is observed many time that gp_replica_check test fails because some mirror nodes
      can not be brought up before testing recently. The related log looks like this:
      
      2019-04-17 14:52:14.951 CST [23030] FATAL:  could not create directory "pg_tblspc/65546/PG_12_201904072/65547": No such file or directory
      2019-04-17 14:52:14.951 CST [23030] CONTEXT:  WAL redo at 0/3011650 for Database/CREATE: copy dir 1663/1 to 65546/65547
      
      That is because some mirror nodes can not be recovered after previous testing,
      not due to gp_replica_check itself. The root cause is that tablespace recovery
      related. Pengzhou Tang and Hao Wu digged that intially and kindly found a mini
      repro as below.
      
      run on shell:
        rm -rf /tmp/some_isolation2_pg_basebackup_tablespace
        mkdir -p /tmp/some_isolation2_pg_basebackup_tablespace
      
      copy and run the below sql on psql client:
        drop tablespace if exists some_isolation2_pg_basebackup_tablespace;
        create tablespace some_isolation2_pg_basebackup_tablespace location '/tmp/some_isolation2_pg_basebackup_tablespace';
        \!gpstop -ra -M fast;
        drop database if exists some_database_with_tablespace;
        create database some_database_with_tablespace tablespace some_isolation2_pg_basebackup_tablespace;
        drop database some_database_with_tablespace;
        drop tablespace some_isolation2_pg_basebackup_tablespace;
        \!gpstop -ra -M immediate;
      
      The root cause is on mirror after drop database & drop tablespace, 'immediate'
      stop causes the pg_control file not up-to-date with latest redo start lsn (this
      is allowed), when the node restarts, it re-redoes 'create database
      some_database_with_tablespace tablespace
      some_isolation2_pg_basebackup_tablespace' but the tablespace directories have
      been deleted in previous redoing.
      
      The 'could not create directory' error could happen on re-redoing create table
      in a tablespace also. We've seen this case on the ci environment, but that is
      because missing of a get_parent_directory() call in the 'create two parents'
      code block in TablespaceCreateDbspace(). Changing it to a simpler call
      pg_mkdir_p() instead.
      
      Also it seems that the src_path could be missing also in dbase_redo() for the
      example below. For example re-redoing at the alter step since tbs1 directory is
      deleted in later 'drop tablespace tbs1'.
        alter database db1 set tablespace tbs2;
        drop tablespace tbs1;
      
      There is discussion on upstream about this,
        https://www.postgresql.org/message-id/flat/CAEET0ZGx9AvioViLf7nbR_8tH9-%3D27DN5xWJ2P9-ROH16e4JUA%40mail.gmail.com
      
      In this patch I recreate those directories to avoid this error. Other solutions
      include ignoring the directory-not-existing error or forcing a flush when
      redoing those kind of checkpoint xlogs which are added normally in drop
      database, etc.
      
      Let's revert or update the code change after the solution is finalized on
      upstream.
      7a09e80d
    • L
      docs - pxf init/sync support to master standby (#7540) · 433a6ebb
      Lisa Owen 提交于
      * docs - pxf init/sync support to master standby
      
      * edits requested by david
      
      * edits requested by francisco and oliver
      
      * pxf sync from master TO standby or seg host
      
      * identify sync run on master in pxf sync option description
      433a6ebb
  3. 29 4月, 2019 8 次提交
  4. 28 4月, 2019 3 次提交
  5. 27 4月, 2019 2 次提交
  6. 26 4月, 2019 6 次提交
  7. 25 4月, 2019 10 次提交