PostgreSQL中Review PG的Optimizer机制如何优化函数

网友投稿 254 2024-01-05

PostgreSQL中Review PG的Optimizer机制如何优化函数

小编给大家分享一下PostgreSQL中Review PG的Optimizer机制如何优化函数，相信大部分人都还不怎么了解，因此分享这篇文章给大家参考一下，希望大家阅读完这篇文章后大有收获，下面让我们一起去了解一下吧！

一、Optimizer Functions

Optimizer Functions-查询优化函数

The primary entry point is planner().

planner() //主入口

set up for recursive handling of subqueries

-subquery_planner()//planner->subquery_planner

pull up sublinks and subqueries from rangetable, if possible

canonicalize qual

Attempt to simplify WHERE clause to the most useful form; this includes

flattening nested AND/ORs and detecting clauses that are duplicated in

different branches of an OR.

simplify constant expressions

process sublinks

convert Vars of outer query levels into Params

--grouping_planner()//planner->subquery_planner->grouping_planner

preprocess target list for non-SELECT queries

handle UNION/INTERSECT/EXCEPT, GROUP BY, HAVING, aggregates,

ORDER BY, DISTINCT, LIMIT

---query_planner()//subquery_planner->grouping_planner->query_planner

make list of base relations used in query

split up the qual into restrictions (a=1) and joins (b=c)

find qual clauses that enable merge and hash joins

----make_one_rel()//...grouping_planner->query_planner->make_one_rel

set_base_rel_pathlists() //为每一个RelOptInfo生成访问路径

find seqscan and all index paths for each base relation

find selectivity of columns used in joins

make_rel_from_joinlist() //使用遗传算法或动态规划算法构造连接路径

hand off join subproblems to a plugin, GEQO, or standard_join_search()

-----standard_join_search()//这是动态规划算法

call join_search_one_level() for each level of join tree needed

join_search_one_level():

For each joinrel of the prior level, do make_rels_by_clause_joins()

if it has join clauses, or make_rels_by_clauseless_joins() if not.

Also generate "bushy plan" joins between joinrels of lower levels.

Back at standard_join_search(), generate gather paths if needed for

each newly constructed joinrel, then apply set_cheapest() to extract

the cheapest path for it.

Loop back if this wasnt the top join level.

Back at grouping_planner:

do grouping (GROUP BY) and aggregation//在最高层处理分组/聚集/唯一过滤/排序/控制输出元组数目等

do window functions

make unique (DISTINCT)

do sorting (ORDER BY)

do limit (LIMIT/OFFSET)

Back at planner():

convert finished Path tree into a Plan tree

do final cleanup after planning二、Optimizer Data Structures

Optimizer Data Structures

数据结构

PlannerGlobal - global information for a single planner invocation

PlannerInfo - information for planning a particular Query (we make

a separate PlannerInfo node for each sub-Query)

RelOptInfo - a relation or joined relations

RestrictInfo - WHERE clauses, like "x = 3" or "y = z"

(note the same structure is used for restriction and

join clauses)

Path - every way to generate a RelOptInfo(sequential,index,joins)

SeqScan - represents a sequential scan plan //顺序扫描

IndexPath - index scan //索引扫描

BitmapHeapPath - top of a bitmapped index scan //位图索引扫描

TidPath - scan by CTID //CTID扫描

SubqueryScanPath - scan a subquery-in-FROM //FROM子句中的子查询扫描

ForeignPath - scan a foreign table, foreign join or foreign upper-relation //FDW

CustomPath - for custom scan providers //定制化扫描

AppendPath - append multiple subpaths together //多个子路径APPEND,常见于集合操作

MergeAppendPath - merge multiple subpaths, preserving their common sort order //保持顺序的APPEND

ResultPath - a childless Result plan node (used for FROM-less SELECT)//结果路径(如SELECT 2+2)

MaterialPath - a Material plan node //物化路径

UniquePath - remove duplicate rows (either by hashing or sorting) //去除重复行路径

GatherPath - collect the results of parallel workers //并行

GatherMergePath - collect parallel results, preserving their common sort order //并行,保持顺序

ProjectionPath - a Result plan node with child (used for projection) //投影

ProjectSetPath - a ProjectSet plan node applied to some sub-path //投影(应用于子路径上)

SortPath - a Sort plan node applied to some sub-path //排序

GroupPath - a Group plan node applied to some sub-path //分组

UpperUniquePath - a Unique plan node applied to some sub-path //应用于子路径的Unique Plan

AggPath - an Agg plan node applied to some sub-path //应用于子路径的聚集

GroupingSetsPath - an Agg plan node used to implement GROUPING SETS //分组集合

MinMaxAggPath - a Result plan node with subplans performing MIN/MAX //最大最小

WindowAggPath - a WindowAgg plan node applied to some sub-path //应用于子路径的窗口函数

SetOpPath - a SetOp plan node applied to some sub-path //应用于子路径的集合操作

RecursiveUnionPath - a RecursiveUnion plan node applied to two sub-paths //递归UNION

LockRowsPath - a LockRows plan node applied to some sub-path //应用于子路径的的LockRows

ModifyTablePath - a ModifyTable plan node applied to some sub-path(s) //应用于子路径的数据表更新(如INSERT/UPDATE操作等)

LimitPath - a Limit plan node applied to some sub-path//应用于子路径的LIMIT

NestPath - nested-loop joins//嵌套循环连接

MergePath - merge joins//Merge Join

HashPath - hash joins//Hash Join

EquivalenceClass - a data structure representing a set of values known equal

PathKey - a data structure representing the sort ordering of a path

The optimizer spends a good deal of its time worrying about the ordering

of the tuples returned by a path. The reason this is useful is that by

knowing the sort ordering of a path, we may be able to use that path as

the left or right input of a mergejoin and avoid an explicit sort step.

Nestloops and hash joins dont really care what the order of their inputs

is, but mergejoin needs suitably ordered inputs. Therefore, all paths

generated during the optimization process are marked with their sort order

(to the extent that it is known) for possible use by a higher-level merge.

优化器在元组的排序上面花费了不少时间,原因是为了在Merge Join时避免专门的排序步骤.

It is also possible to avoid an explicit sort step to implement a users

ORDER BY clause if the final path has the right ordering already, so the

sort ordering is of interest even at the top level. grouping_planner() will

look for the cheapest path with a sort order matching the desired order,

then compare its cost to the cost of using the cheapest-overall path and

doing an explicit sort on that.

When we are generating paths for a particular RelOptInfo, we discard a path

if it is more expensive than another known path that has the same or better

sort order. We will never discard a path that is the only known way to

achieve a given sort order (without an explicit sort, that is). In this

way, the next level up will have the maximum freedom to build mergejoins

without sorting, since it can pick from any of the paths retained for its

inputs.

以上是“PostgreSQL中Review PG的Optimizer机制如何优化函数”这篇文章的所有内容，感谢各位的阅读！相信大家都有了一定的了解，希望分享的内容对大家有所帮助，如果还想学习更多知识，欢迎关注行业资讯频道！

标签：优化输出学习数据

暂时没有评论，来抢沙发吧~

PostgreSQL中Review PG的Optimizer机制如何优化函数

PostgreSQL中Review PG的Optimizer机制如何优化函数

微前端架构如何改变企业的开发模式与效率提升

前端开源框架如何提升开发效率与用户体验的关键因素

前端移动端框架如何推动数字化转型与创新发展

最近发表

更多内容

小程序SDK

Finclip技术文档

小程序开发

小程序容器

小程序框架

Finclip小程序平台

Finclip用户投稿

车联网

推荐文章

小程序SDK是什么意思？小程序sdk和插件有什么区别？

小程序支付功能怎么实现？

企业app开发流程是什么？

app运营模式有哪些？

小程序多端引流怎么做？

小程序生态分析的机会和威胁

Flutter入门这一篇效率文章就够了

原生与跨平台解决方案分析,跨平台软件开发技术方案

热更新技术：让软件更新变得更加轻松快速

解决方案

银行解决方案

证券解决方案

互联网解决方案

政企OA解决方案

科技解决方案

loT解决方案

信任解决方案

热评文章

AppCan:基于混合模式的移动应用开发,移动混合模

Hybrid App混合模式开发的了解

小程序容器技术助力券商数字营销突围，小程序容器化的意

用mpvue开发微信小程序基础知识（vue.js开发

小程序多端框架全面测评对比，强烈推荐！

券商app架构 - 解析券商应用程序的构建与设计