提交 · 5X_STABLE · Greenplum / Gpdb

17 11月, 2020 1 次提交

Avoid checking distributed snapshot for visibility checks on QD · 48b13271

由 Ashwin Agrawal 提交于 11月 16, 2020

This is partial cherry-pick from commit
b3f300b9.  In the QD, the distributed
transactions become visible at the same time as the corresponding
local ones, so we can rely on the local XIDs. This is true because the
modification of local procarray and globalXactArray are protected by
lock and hence a atomic operation during transaction commit.

We have seen many situations where catalog queries run very slow on QD
and potential reason is checking distributed logs. Process local
distributed log cache fall short for this usecase as most of XIDs are
unique and hence get frequent cache misses. Shared memory cache falls
short as only caches 8 pages and many times need many more pages to be
cached to be effective.
Co-authored-by: NHubert Zhang <hzhang@pivotal.io>
Co-authored-by: NGang Xiong <gangx@vmware.com>

48b13271

16 11月, 2020 1 次提交

Notice reject messages of external tables executed on master (#11111) · 1fc8ed78

由 Mingli Zhang 提交于 11月 16, 2020

With reject limit, if reject limit is not reached,
there should be some messages that how many rows
are rejected as `execute on segments` does.

Co-authored-by Mingli Zhang <zmingli@vmware.com>
Co-authored-by Adam Lee <adlee@vmware.com>

1fc8ed78

13 11月, 2020 1 次提交
- C
  Bump ORCA version to 3.114 · 8665584b
  由 Chris Hajas 提交于 11月 12, 2020
```
Corresponding ORCA PR: "Fix handling of empty constraint array in Orca"
```
  8665584b
12 11月, 2020 1 次提交
- S
  Merge pull request #10739 from greenplum-db/173498885-gphome-gp5 · bfaac567
  由 Shaoqi Bai 提交于 11月 12, 2020
```
[backport 5X]GPHOME is determined dynamically when sourcing greenplum_path
```
  bfaac567
06 11月, 2020 3 次提交

Harden analyzedb against concurrently dropped and recreated tables · 22448ff7

由 Abhijit Subramanya 提交于 10月 28, 2020

Commit 4bbbb381 introduced some hardening
around concurrent drop and recreate of tables while analyzedb is running but it
failed to take into account the code around updating the last operation
performed. This commit fixes it.

(cherry picked from commit 6949ecfb1a90c92db63c7c26463f3a9110d3fa12)

22448ff7

H
Bump ORCA version to 3.113 (#11124) · 7980a57a
由 Hans Zeller 提交于 11月 05, 2020
```
The corresponding ORCA PR is https://github.com/greenplum-db/gporca/pull/607
```
7980a57a

Faithfully parse inheritance-recursion for external tables · 0387161b

由 Jesse Zhang 提交于 8月 10, 2018

When SQL standard table inheritance was added in upstream (by commit
2fb6cc90 in Postgres 7.1), mentioning a table in the FROM clause of a
query would necessarily mean traversing through the inheritance
hierarchy. The need to distinguish between the (legacy, less common, but
legitimate nonetheless) intent of not recursing into child tables gave
rise to two things: the guc `sql_inheritance` which toggles the default
semantics of parent tables, and the `ONLY` keyword used in front of
parent table names to explicitly skip descendant tables.

ORCA doesn't like queries that skip descendant tables: it falls back to
the legacy planner as soon as it detects that intent.

Way way back in Greenplum-land, when external tables were given a
separate designation in relstorage (RELSTORAGE_EXTERNAL), we seemed to
have added code in parser (parse analysis) so that queries on external
tables *never* recurse into their child tables, regardless of what the
user specifies -- either via `ONLY` or `*` in the query, or via guc
`sql_inheritance`. Technically, that process scrubs the range table
entries to hard-code "do not recurse".

The combination of those two things -- hard coding "do not recurse" in
the RTE for the analyzed parse tree and ORCA detecting intent of `ONLY`
through RTE -- led ORCA to *always* fall back to planner when an
external table is mentioned in the FROM clause. commit 013a6e9d tried
fixing this by *detecting harder* whether there's an external table.

The behavior of the parse-analyzer hard coding a "do not recurse" in the
RTE for an external table seems wrong for several reasons:

  1. It seems unnecessarily defensive

  2. It doesn't seem to belong in the parser.

     a. While changing "recurse" back to "do not recurse" abounds, all
     other occurrences happen in the planner as an optimization for
     childless tables.

     b. It deprives an optimizer of the actual intent expressed by the
     user: because of this hardcoding, neither ORCA nor planner would
     have a way of knowing whether the user specified `ONLY` in the
     query.

     c. It deprives the user of the ability to use child tables with an
     external table, either deliberately or coincidentally.

     d. A corollary is that any old views created as `SELECT a,b FROM
     ext_table` will be perpetuated as `SELECT a,b FROM ONLY ext_table`.

This commit removes this defensive setting in the parse analyzer. As a
consequence, we're able to reinstate the simpler RTE check before commit
013a6e9d. Queries and new views will include child tables as expected.

Note that this commit will introduce a behavior change:
(taken from https://github.com/greenplum-db/gpdb/pull/5455#issuecomment-412247709)

1. a (not external) table by default means "me and my descendants" even if it's childless
2. an external table with child tables previously would never recurse into child tables
2. after this patch you need to use ONLY to exclude descendant tables.

(cherry picked from commit 2371cb3b)

0387161b

03 11月, 2020 1 次提交

Fix resgroup unusable if its dropping failed · b7c42625

由 xiong-gang 提交于 11月 03, 2020

In function DropResourceGroup(), group->lockedForDrop is set
to true by calling ResGroupCheckForDrop, however, it can only
be set to false inside dropResgroupCallback. This callback is
registered at the ending of function DropResourceGroup. If an
error occured between them, group->lockedForDrop would be true
forever.

Fix it by putting the register process ahead of the lock call.
To prevent Assert(group->nRunning* > 0) if ResGroupCheckForDrop
throws an error, return directly if group->lockedForDrop did
not change.

See:

```
gpconfig -c gp_resource_manager -v group
gpstop -r -a

psql
                CPU_RATE_LIMIT=20,
                MEMORY_LIMIT=20,
                CONCURRENCY=50,
                MEMORY_SHARED_QUOTA=80,
                MEMORY_SPILL_RATIO=20,
                MEMORY_AUDITOR=vmtracker
        );

psql -U user_test
> \d -- hang
```
Co-authored-by: Ndh-cloud <60729713+dh-cloud@users.noreply.github.com>

b7c42625

02 11月, 2020 1 次提交

Update greenplum-database-release default repo to main · cbcd2b5b

由 Ning Wu 提交于 11月 02, 2020

The repo https://github.com/greenplum-db/greenplum-database-release has
been changed default repo from master to main. This is to sync up this
change
Co-authored-by: NNing Wu <ningw@vmware.com>
Co-authored-by: NShaoqi Bai <bshaoqi@vmware.com>

cbcd2b5b

28 10月, 2020 2 次提交

Pin PR resource to v0.21 to avoid Github abuse rate limit · 9b690a9e

由 Kalen Krempely 提交于 10月 27, 2020

We started hitting this on Thursday, and there's been ongoing report
from the community about this as well. While upstream is figuring out a
long term solution [1], we've been advised [2] to pin to the previous
release (v0.21.0) to avoid being blocked for hours at once.

[1]: telia-oss/github-pr-resource#238
[2]: telia-oss/github-pr-resource#238 (comment)
Co-authored-by: NJesse Zhang <sbjesse@gmail.com>

9b690a9e

L

docs - add -a (no prompt) option to analyzedb call in script (#10985) · 761060d9
由 Lisa Owen 提交于 10月 27, 2020

761060d9

26 10月, 2020 1 次提交

Skip setting PT information in WAL during recovery · b8ac48ad

由 xiong-gang 提交于 10月 26, 2020

If the system crashes when splitting a btree page and the WAL of downlink to
the parent is not flushed, crash recovery will complete this step in
'btree_xlog_cleanup' and it may led to split the parent, which will write a new
WAL record in '_bt_split'. We skip populating the PT information during
recovery in function 'RelationNeedToFetchGpRelationNodeForXLog' and it will
cause PANIC in this case. This is fixed in 6X and later version by commit
'40dae7ec'. This commit only set invalid PT information in the new WAL record.
Co-authored-by: NGang Xiong <gangx@vmware.com>

b8ac48ad

22 10月, 2020 1 次提交

demo_cluster.sh: remove GPSEARCH · 8d79e5e2

由 Kalen Krempely 提交于 6月 18, 2020

Use GPHOME directly. There is no need for GPSEARCH.

This enables demo_cluster.sh to handle symlinks which is useful when
GPDB is installed using RPMs and a demo cluster is desired.

(cherry picked from commit 6932195a)
Co-authored-by: NJamie McAtamney <jmcatamney@vmware.com>

8d79e5e2

21 10月, 2020 1 次提交

The inner relation of LASJ_NOTIN should not have partition locaus · 343f8826

由 Jinbao Chen 提交于 10月 21, 2020

The result of NULL not in an unempty set is false. The result of
NULL not in an empty set is true. But if an unempty set has
partitioned locus. This set will be divided into several subsets.
Some subsets may be empty. Because NULL not in empty set equals
true. There will be some tuples that shouldn't exist in the result
set.

The patch disable the partitioned locus of inner table by removing
the join clause from the redistribution_clauses.

this commit cherry pick from 6X_STABLE 8c93db54f3d93a890493f6a6d532f841779a9188
Co-authored-by: NHubert Zhang <hubertzhang@apache.org>
Co-authored-by: NRichard Guo <riguo@pivotal.io>

343f8826

16 10月, 2020 1 次提交

Increment ExternalScan::scancounter across queries in ORCA · 78024fbc

由 Shreedhar Hardikar 提交于 10月 12, 2020

gpfdist uses the global xid & timestamp to distinguish whether each
connection belongs to the same external scan or not.

ORCA generates a unique scan number for each ExternalScan within the
same plan, but not accross plans. So, within a transaction, we may issue
multiple external scans that do not get differentiated properly,
producing different results.

This commit patches that by using a different scan number accross plans,
just like what planner does. Ideally gpfdist should also take into
account the command-id of the query to prevent this problem for other
cases such as prepared statements.

78024fbc

14 10月, 2020 1 次提交

Bump ORCA version to 3.112.0 · 395179c8

由 Chris Hajas 提交于 10月 08, 2020

Corresponding ORCA commit: "Disallow physical index scans in Orca to
have predicate condition with subquery"

395179c8

08 10月, 2020 1 次提交
- B
  Revert deletion of separate_qautils_files_for_rc (#10948) · f6275724
  由 Brent Doil 提交于 10月 07, 2020
```
The task is being used in the 5X_release pipeline.
Authored-by: NBrent Doil <bdoil@vmware.com>
```
  f6275724
03 10月, 2020 2 次提交

J

Remove unused coverity pipeline from 5X_STABLE · 36a4a834
由 Jesse Zhang 提交于 10月 02, 2020

36a4a834

concourse: Remove unused dev_generate_installer.yml · bc3ed32d

由 Amil Khanzada 提交于 1月 29, 2019

- We're not sure when this file became abandoned, but it doesn't seem to
  be being used anywhere.
- Also remove task file and bash scripts that were only referenced by
  this pipeline.
Co-authored-by: NBradford D. Boyle <bboyle@pivotal.io>
Co-authored-by: NAmil Khanzada <akhanzada@pivotal.io>
(cherry picked from commit b55a0b71)

bc3ed32d

02 10月, 2020 4 次提交

Add format checking in PR pipeline · 713981ed

由 Jesse Zhang 提交于 9月 30, 2020

In the same vein as commit 868fd2d7, this adds a very quick
clang-format job that checks for conformance, similar to the Travis CI
job did in commit 54273fdd (greenplum-db/gpdb#10357).

(cherry picked from commit 6aac45f3)

713981ed

Check ORCA and GPOPT formatting in Concourse. · 26e8c12e

由 Jesse Zhang 提交于 9月 29, 2020

This patch adds a simple job that checks for the clang-format
conformance, similar to the Travis CI job did in commit 54273fdd
(greenplum-db/gpdb#10357).

I was not amused when I realized the gen_pipeline.py doesn't do the due
diligence of finding transitive dependencies (so the new job actually
already blocks release, but the script insists on stopping at immediate
blockers). But that's left for another day.

(cherry picked from commit 868fd2d7)

26e8c12e

Replace docker-image with registry-image. · c8993c2c

由 Jesse Zhang 提交于 9月 14, 2020

For context, the registry-image resource was introduced more than two
years ago as a drop-in replace for the docker-image resource (especially
when you're only fetching, not building). It's leaner, less
resource-intense, faster, and doesn't rely on spawning a Docker daemon.

Also swept up in this patch are four unused files that are left behind
by previous changes.

See https://github.com/concourse/registry-image-resource for more.

This is spiritually a cherry-pick of be6ff30f, but I'm only
touching the 5X_STABLE pipeline and the PR pipeline because they are in
my way of backporting 868fd2d7 and 6aac45f3
(clang-format check).

c8993c2c

Use in_parallel in lieu of aggregate · 472e263c

由 Jesse Zhang 提交于 10月 01, 2020

In the same vein of commits a5fb66d3 and d1f74955.

fly validate-pipeline (and set-pipeline) warns about it being
deprecated. This is also slightly in my way of backporting
868fd2d7 and 6aac45f3.

472e263c

01 10月, 2020 1 次提交

Skip Segments which are down for updating filespaces files on the host · 1fe16e29

由 Bhuvnesh Chaudhary 提交于 9月 23, 2020

If any segment host was down and the temporary or transaction filespace were
moved to a different filespace (i.e other than default), it used to do a
SSH to update the filespace content in a file on the down host and
will fail as the host is down. This will cause utilities by gprecoverseg
to fail even though if the segments on the down host are excluded from
the gprecoverseg input file.

1fe16e29

30 9月, 2020 1 次提交

Add ORCA formatting commit to .git-blame-ignore-revs · a949d95b

由 Jesse Zhang 提交于 9月 29, 2020

This includes the commit on master, 2 back branches, and the standalone
gporca repository.

(cherry picked from commit 7ab6b7a4)

a949d95b

29 9月, 2020 6 次提交

Format ORCA and GPOPT. · 219fe0c4

由 Jesse Zhang 提交于 9月 17, 2020

The canonical config file is in src/backend/gpopt/.clang-format (instead
of under the non-existent src/backend/gporca), I've created one (instead
of two) symlink, for GPOPT headers. Care has been taken to repoint the
symlink to the canonical config under gpopt, instead of gpopt as it is
under HEAD.

This is spiritually a cherry-pick of commit 2f7dd76c.
(cherry picked from commit 2f7dd76c)

219fe0c4

[Travis] Checks formatting in Travis. · 8b7421b2

由 Jesse Zhang 提交于 9月 17, 2020

Add a stage called "check-format" that runs before the fan-out of
installcheck-small. While we're at it, also enforce that the full config
file is generated from the intent file. This should be fairly quick. On
my laptop, the `fmt chk` step takes 3 seconds. (On this backbranch, it
actually takes 1 second on my laptop).

Care has been taken to add back the matrix syntax in Travis CI config so
it's still valid. Also subtle is the explicit mention of the "test"
stage in the matrix so as not to lose it in the expansion, as this
backbranch effectively doesn't have a build matrix (every "matrix key"
has at most one value).

Enforcement in Concourse is forthcoming.

(cherry picked from commit 54273fdd)

8b7421b2

Adds a script to format and check formatting. · 310c3674

由 Jesse Zhang 提交于 9月 17, 2020

This is intended for both local developer use and for CI.

This depends on GNU parallel. One-time install:

macOS: brew install parallel clang-format
Debian: apt install parallel clang-format-10

To format all ORCA / GPOPT code:

$ src/tools/fmt fmt

To check for formatting conformance:

$ src/tools/fmt chk

To modify the configuration, you'll need two steps:
1. Edit clang-format.intent.yml
2. Generate the expanded configuration file:

$ src/tools/fmt gen

This commit also adds a formatting README, To document some of the
rationale behind tooling choice. Also mention the new `README.format.md`
from both the style guide and ORCA's main README.

(cherry picked from commit 57b744c1)

310c3674

Initial .clang-format. · 4390927b

由 Jesse Zhang 提交于 9月 17, 2020

Generated using clang-format-10

This is spiritually a cherry-pick of commit 16b48d24, but I have
to tweak things a bit by moving the .clang-format file from
src/backend/gporca to under src/backend/gpopt

(cherry picked from commit 16b48d24)

4390927b

Project certain outrrefs in the targetlist of subqueries · 82fa6ef7

由 Shreedhar Hardikar 提交于 9月 17, 2020

In a previous ORCA version (3.311) we added code to fall back gracefully
when a subquery select list contains a single outer ref that is not part
of an expression, such as in

select * from foo where a is null or a = (select foo.b from bar)

This commit adds a fix that allows us to handle such queries in ORCA
by adding a project in the translator that will echo the outer ref
from within the subquery, and using that projected value in the
select list of the subquery. This ensures that we use a NULL value for
the scalar subquery in the expression for the outer ref when the
subquery returns no rows.

Also note that this is still skipped for grouping cols in the target
list. This was done to avoid regression for certain queries, such as:

select *
from A
where not exists (select sum(C.i)
                  from C
                  where C.i = A.i
                  group by a.i);

ORCA is currently unable to decorrelate sub-queries that contain project
nodes, So, a `SELECT 1` in the subquery would also cause this
regression. In the above query, the parser adds `a.i` to the target list
of the subquery, that would get an echo projection (as described above),
and thus would prevent decorelation by ORCA. For this reason, we decided
to maintain existing behavior until ORCA is able to handle projections
in subqueries better.

Also add ICG tests.
Co-authored-by: NHans Zeller <hzeller@pivotal.io>
Co-authored-by: NShreedhar Hardikar <shardikar@pivotal.io>

82fa6ef7

H
Bump ORCA version to 3.111.0, add test cases · 24049865
由 Hans Zeller 提交于 9月 09, 2020
```
The corresponding ORCA PR is https://github.com/greenplum-db/gporca/pull/605
```
24049865

26 9月, 2020 1 次提交

Add consistency check on resource group wait queue · c54c7a91

由 xiong-gang 提交于 9月 26, 2020

We had several issues on 5x and 6x that shows the resource group wait queue is
somehow corrupted, this commit add some checks and PANIC at the earliest to
help debugging.

c54c7a91

22 9月, 2020 1 次提交
- K
  ci: build centos6 and centos7 release candidate RPMs · 8beb8616
  由 Kalen Krempely 提交于 9月 15, 2020
```
Co-authored-by: NXin Zhang <zhxin@vmware.com>
Co-authored-by: NBrent Doli <bdoil@vmware.com>
```
  8beb8616
21 9月, 2020 1 次提交

Fix interconnect hung issue (#10757) · 3bef5530

由 Jinbao Chen 提交于 9月 21, 2020

We hit interconnect hung issue many times in many cases, all have
the same pattern: the downstream interconnect motion senders keep
sending the tuples and they are blind to the fact that upstream
nodes have finished and quitted the execution earlier, the QD
then get enough tuples and wait all QEs to quit which cause a
deadlock.

Many nodes may quit execution earlier, eg, LIMIT, HashJoin, Nest
Loop, to resolve the hung issue, they need to stop the interconnect
stream explicitly by calling ExecSquelchNode(), however, we cannot
do that for rescan cases in which data might lose, eg, commit
2c011ce4. For rescan cases, we tried using QueryFinishPending to
stop the senders in commit 02213a73 and let senders check this
flag and quit, that commit has its own problem, firstly, QueryFini
shPending can only set by QD, it doesn't work for INSERT or UPDATE
cases, secondly, that commit only let the senders detect the flag
and quit the loop in a rude way (without sending the EOS to its
receiver), the receiver may still be stuck inreceiving tuples.

This commit revert the QueryFinishPending method firstly.

To resolve the hung issue, we move TeardownInterconnect to the
ahead of cdbdisp_checkDispatchResult so it guarantees to stop
the interconnect stream before waiting and checking the status
of QEs.

For UDPIFC, TeardownInterconnect() remove the ic entries, any
packets for this interconnect context will be treated as 'past'
packets and be acked with STOP flag.

For TCP, TeardownInterconnect() close all connection with its
children, the children will treat any readable data in the
connection as a STOP message include the closure operation.

this commit backport from master ec1d9a70

3bef5530

19 9月, 2020 2 次提交

Refactor query string truncation on top of · e393c88b

由 Asim R P 提交于 9月 19, 2020

Commit 889ba39e fixed the query string truncation in dispatcher to
make it locale-aware.  This patch refactors that change so as to avoid
accessing a string beyond its length.

Reviewed by: Heikki, Ning Yu and Polina Bungina

(cherry picked from commit abf6b330)

e393c88b

Fix query string truncation while dispatching to QE · b76d049b

由 Polina Bungina 提交于 9月 19, 2020

Execution of a long enough query containing multi-byte characters can cause incorrect truncation of the query string. Incorrect truncation implies an occasional cut of a multi-byte character and (with log_min_duration_statement set to 0 ) subsequent write of an invalid symbol to segment logs. Such broken character present in logs produces problems when trying to fetch logs info from gp_toolkit.__gp_log_segment_ext  table - queries fail with the following error: «ERROR: invalid byte sequence for encoding…».
This is caused by buildGpQueryString function in `cdbdisp_query.c`, which prepares query text for dispatch to QE. It does not take into account character length when truncation is necessary (text is longer than QUERY_STRING_TRUNCATE_SIZE).

(cherry picked from commit f31600e9)

b76d049b

18 9月, 2020 2 次提交

Don't dispatch client_encoding to QE · 9a6cd1ee

由 xiong-gang 提交于 9月 18, 2020

When client_encoding is dispatch to QE, error messages generated in QEs were
converted to client_encoding, but QD assumed that they were in server encoding,
it will leads to corruption.

This is fixed in 6X in a6c9b4, but this skips the gpcopy changes since 5X
doesn't support syntax 'COPY...ENCODING'.

Fix issue: https://github.com/greenplum-db/gpdb/issues/10815

9a6cd1ee

Align Orca relhasindex behavior with Planner (#10788) · 8083a046

由 David Kimura 提交于 9月 17, 2020

Function `RelationGetIndexList()` does not filter out invalid indexes.
That responsiblity is left to the caller (e.g. `get_relation_info()`).
Issue is that Orca was not checking index validity.

This commit also introduces an optimization to Orca that is already used
in Planner whereby we first check relhasindex before checking pg_index.

(cherry picked from commit b011c351)

8083a046

17 9月, 2020 2 次提交

Do not read a persistent tuple after it is freed · 5f765a8e

由 Asim R P 提交于 9月 15, 2020

This bug was found in a production environment where vacuum on
gp_persistent_relation was concurrently running with a backend
performing end-of-xact filesystem operations.  And the GUC
debug_persistent_print was enabled.

The *_ReadTuple() function was called on a persistent TID after the
corresponding tuple was deleted with frozen transaction ID.  The
concurrent vacuum recycled the tuple and it led to a SIGSEGV when the
backend tried to access values from the tuple.

Fix it by avoiding the debug log message in case when the persistent
tuple is freed (transitioning to FREE state).  All other state
transitions are logged.

In absence of concurrent vacuum, things worked just fine because the
*_ReadTuple() interface reads tuples from persistent tables directly
using TID.

5f765a8e

Skip FK check when do relation truncate · b50c134b

由 Weinan WANG 提交于 9月 17, 2020

GPDB does not support FK, but keep FK grammar in DDL, since it
reduce DB migration manual workload from others.
Hence, we do not need FK check for truncate command, rid of it.

b50c134b