使用ORDER和LIMIT子句极慢的PostgreSQL查询_sql

概述我有一个表，让我们称之为“foos”，有近600万条记录。我运行以下查询： SELECT "foos".*FROM "foos"INNER JOIN "bars" ON "foos".bar_id = "bars".idWHERE (("bars".baz_id = 13266))ORDER BY "foos"."id" DESCLIMIT 5 OFFSET 0; 此查询需要很长时间才能我有一个表，让我们称之为“foos”，有近600万条记录。我运行以下查询：

SELECT "foos".*FROM "foos"INNER JOIN "bars" ON "foos".bar_ID = "bars".IDWHERE (("bars".baz_ID = 13266))ORDER BY "foos"."ID" DESCliMIT 5 OFFSET 0;

此查询需要很长时间才能运行(Rails在运行时超时)。有问题的所有ID都有索引。好奇的部分是，如果我删除ORDER BY子句或liMIT子句，它几乎立即运行。

我假设，ORDER BY和liMIT的存在使得Postgresql在查询计划中做出一些不好的选择。任何人都有任何想法如何解决这个问题？

如果它有帮助，这里是所有3种情况的EXPLAIN：

//////// Both ORDER and liMITSELECT "foos".*FROM "foos"INNER JOIN "bars" ON "foos".bar_ID = "bars".IDWHERE (("bars".baz_ID = 13266))ORDER BY "foos"."ID" DESCliMIT 5 OFFSET 0;                                                     query PLAN                                                     -------------------------------------------------------------------------------------------------------------------- limit  (cost=0.00..16663.44 rows=5 wIDth=663)   ->  nested Loop  (cost=0.00..25355084.05 rows=7608 wIDth=663)         Join Filter: (foos.bar_ID = bars.ID)         ->  Index Scan Backward using foos_pkey on foos  (cost=0.00..11804133.33 rows=4963477 wIDth=663)               Filter: (((NOT privacy_protected) OR (user_ID = 67962)) AND ((status)::text = 'DONE'::text))         ->  Materialize  (cost=0.00..658.96 rows=182 wIDth=4)               ->  Index Scan using index_bars_on_baz_ID on bars  (cost=0.00..658.05 rows=182 wIDth=4)                     Index Cond: (baz_ID = 13266)(8 rows)//////// Just liMITSELECT "foos".*FROM "foos"INNER JOIN "bars" ON "foos".bar_ID = "bars".IDWHERE (("bars".baz_ID = 13266))liMIT 5 OFFSET 0;                                                              query PLAN                                                               --------------------------------------------------------------------------------------------------------------------------------------- limit  (cost=0.00..22.21 rows=5 wIDth=663)   ->  nested Loop  (cost=0.00..33788.21 rows=7608 wIDth=663)         ->  Index Scan using index_bars_on_baz_ID on bars  (cost=0.00..658.05 rows=182 wIDth=4)               Index Cond: (baz_ID = 13266)         ->  Index Scan using index_foos_on_bar_ID on foos  (cost=0.00..181.51 rows=42 wIDth=663)               Index Cond: (foos.bar_ID = bars.ID)               Filter: (((NOT foos.privacy_protected) OR (foos.user_ID = 67962)) AND ((foos.status)::text = 'DONE'::text))(7 rows)//////// Just ORDERSELECT "foos".*FROM "foos"INNER JOIN "bars" ON "foos".bar_ID = "bars".IDWHERE (("bars".baz_ID = 13266))ORDER BY "foos"."ID" DESC;                                                              query PLAN                                                               --------------------------------------------------------------------------------------------------------------------------------------- Sort  (cost=36515.17..36534.19 rows=7608 wIDth=663)   Sort Key: foos.ID   ->  nested Loop  (cost=0.00..33788.21 rows=7608 wIDth=663)         ->  Index Scan using index_bars_on_baz_ID on bars  (cost=0.00..658.05 rows=182 wIDth=4)               Index Cond: (baz_ID = 13266)         ->  Index Scan using index_foos_on_bar_ID on foos  (cost=0.00..181.51 rows=42 wIDth=663)               Index Cond: (foos.bar_ID = bars.ID)               Filter: (((NOT foos.privacy_protected) OR (foos.user_ID = 67962)) AND ((foos.status)::text = 'DONE'::text))(8 rows)

当您同时具有liMIT和ORDER BY时，优化程序已决定更快地通过键递减来遍历foo上未过滤的记录，直到它获得其余标准的五个匹配项。在其他情况下，它只是将查询作为嵌套循环运行，并返回所有记录。

Offhand，我想说的问题是，PG不会调查各种ID的联合分布，这就是为什么计划是这样次优的。

对于可能的解决方案：我假设你最近运行了ANALYZE。如果没有，请这样做。这可能解释了为什么你的估计时间很高，即使在快速返回的版本。如果问题仍然存在，可以运行ORDER BY作为子查询，并在外部查询中敲击liMIT。

总结

以上是内存溢出为你收集整理的使用ORDER和LIMIT子句极慢的PostgreSQL查询全部内容，希望文章能够帮你解决使用ORDER和LIMIT子句极慢的PostgreSQL查询所遇到的程序开发问题。

如果觉得内存溢出网站内容还不错，欢迎将内存溢出网站推荐给程序员好友。

欢迎分享，转载请注明来源：内存溢出

原文地址: http://outofmemory.cn/sjk/1171637.html