MongoDB 性能优化方案及实战：解决查询、读写慢问题-JavaScript中文网-JavaScript教程资源分享门户

引言

MongoDB 作为被广泛使用的 NoSQL 数据库，性能是其优点之一。然而，在处理大数据时，MongoDB 也容易遇到查询、读写慢等性能问题。本文将介绍 MongoDB 性能优化方案，并给出具体实例代码，帮助读者解决实际问题。

数据结构优化

MongoDB 是文档型数据库，结构灵活，支持非规范化的数据存储。但过多的嵌套和数组使用可能引起查询效率下降。因此，建议使用扁平化数据结构，减少嵌套和数组使用，尤其是大数组的使用。

例如，下列文档：

-- -------------------- ---- -------
-
    ----- - -------------------------------------
    ------ - --------
    ----- - ---
    --------- - -
        ----------
        ----------
        -----------
        ----------
        ------------
        ----------
        ---------
    --
    --------- - -
        ------ - ----------
        ---------- - ----------
        -------- - ------- -----------
    -
-展开代码

可改为扁平化结构：

-- -------------------- ---- -------
-
    ----- - -------------------------------------
    ------ - --------
    ----- - ---
    --------------- - -----
    --------------- - -----
    ---------------- - -----
    --------------- - -----
    ----------------- - -----
    --------------- - -----
    --------------- - -----
    -------------- - ----------
    ------------------ - ----------
    ---------------- - ------- -----------
-展开代码

这样可以减少数组嵌套和深度，同时提高查询效率。

索引优化

索引是 MongoDB 查询的关键，如何合理使用索引，能够极大提高查询效率。

1.固定前缀索引

固定前缀索引（Prefix Index）是指只对键名前固定长度的部分建立索引。例如，对以下文档建立索引：

{
    "_id" : 1,
    "person" : {
        "name" : "Alice",
        "age" : 25
    }
}

在 person 中查询 name：

db.collection.find({"person.name": "Alice"})

只需在 person 前固定长度 7（即 person. 的长度）建立索引，即可使用固定前缀索引优化查询：

db.collection.createIndex({"person.name": 1}, {partialFilterExpressions: {"person": {"$exists": true}}})

2.复合索引

当多个键会同时用于查询时，应使用复合索引（Compound Index）来提高查询效率。例如，对以下文档建立索引：

{
    "_id" : 2,
    "person" : {
        "name" : "Bob",
        "age" : 30
    }
}

在 person 中对 name 和 age 同时查询：

db.collection.find({"person.name": "Bob", "person.age": {$gte: 25}})

建立 person.name 和 person.age 的复合索引：

db.collection.createIndex({"person.name": 1, "person.age": 1}, {partialFilterExpressions: {"person": {"$exists": true}}})

3.覆盖索引

覆盖索引（Covered Index）是指查询可以直接使用索引返回所需数据，无需访问文档。建立覆盖索引可减少查询时间和磁盘 I/O。例如，对以下文档建立索引：

-- -------------------- ---- -------
-
    ----- - --
    -------- - -
        ------ - ----------
        ----- - --
    --
    --------- - -
        ----------
        ----------
        -----------
        ----------
        ------------
        ----------
        ---------
    -
-展开代码

对 name 和 age 进行查询：

db.collection.find({"person.name": "Charlie", "person.age": 35}, {_id: 0, "person.name": 1, "person.age": 1})

建立 person.name 和 person.age 的覆盖索引：

db.collection.createIndex({"person.name": 1, "person.age": 1, "_id": 0}, {partialFilterExpressions: {"person": {"$exists": true}}})

需要注意的是，在规划索引时，应权衡索引的数量、大小和维护成本，避免过度使用索引。

查询优化

1.限制返回字段

限制返回字段是优化查询最为简单有效的方法之一。一般情况下，不需要返回文档中的所有字段，只需返回需要的字段即可减少网络传输和解析时间，提高查询效率。例如，对以下文档：

-- -------------------- ---- -------
-
    ----- - --
    -------- - -
        ------ - --------
        ----- - --
    --
    --------- - -
        ----------
        ----------
        -----------
        ----------
        ------------
        ----------
        ---------
    --
    --------- - -
        ------ - ----------
        ---------- - ----------
        -------- - ------- -----------
    -
-展开代码

查询 person 中的所有信息：

db.collection.find({"person.name": "Alice"})

限制返回 name 和 age：

db.collection.find({"person.name": "Alice"}, {_id: 0, "person.name": 1, "person.age": 1})

2.使用聚合管道

聚合管道（Aggregation Pipeline）是一种将多个操作组合到一起的数据处理框架。聚合过程逐步将文档批量处理，逐步生成最终的结果。聚合管道与查询的区别在于：查询的处理方式是逐一从文档中挑选符合条件的文档并返回，而聚合管道则是依次将文档放入管道中处理，最后生成结果。因此，聚合管道可以更直观方便地进行数据处理和分析，并且可以使用各种丰富的管道操作符进行处理。例如，对以下文档：

-- -------------------- ---- -------
-
    ----- - --
    -------- - -
        ------ - --------
        ----- - --
    --
    --------- - -
        -
            ------- ----------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- ----------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- -----------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- ----------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- ------------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- ----------
            ------- --------- ------------- ------ -------------
        --
        -
            ------- ----------
            ------- --------- ------------- ------ -------------
        -
    --
    --------- - -
        ------ - ----------
        ---------- - ----------
        -------- - ------- -----------
    -
-展开代码

统计 hobbies 中名称为 reading 和 cooking 的兴趣的数量：

db.collection.aggregate([
    {$match: {"person.name": "Alice"}},
    {$unwind: "$hobbies"},
    {$match: {"hobbies.name": {$in: ["reading", "cooking"]}}},
    {$group: {_id: "$person.name", count: {$sum: 1}}}
])

以上是常用的查询优化方法，下面我们来看看如何将这些方法应用于实际项目中。

实战

需求

我们有一个存储博客文章的 MongoDB 集合，结构如下：

-- -------------------- ---- -------
-
    ------ -------------------------------------
    ------- -
        ------- --------
        -------- -----------------
    --
    -------- --- ----- ------
    ---------- ------- --------
    -------- --
    -------- -
        -------------------------------------
        ------------------------------------
    --
    ----------- -
        -
            ------- -
                ------- --------
                -------- -----------------
            --
            ---------- ----- ------
            ------- -----------------------------------
        --
        -
            ------- -
                ------- --------
                -------- -----------------
            --
            ---------- ----- -------
            ------- -----------------------------------
        -
    --
    ------- -----------------------------------
-展开代码

现在需要实现以下功能：

根据文章标题查询文章
统计每篇文章的浏览量、点赞量和评论数量
统计每位用户发表的文章数、浏览量、点赞量和评论数量

方案

为 title 字段建立单键索引，并限制只返回 _id 和 title 字段
使用聚合管道对数据进行统计
使用聚合管道对数据进行统计

实现

首先，建立单键索引：

db.articles.createIndex({"title": 1})

查询 title 中包含 MongoDB 的文章：

db.articles.find({"title": /MongoDB/}, {"_id": 1, "title": 1})

结果示例：

{ "_id" : ObjectId("61e4bb7a24587f8d77e88e25"), "title" : "Introduction to MongoDB" }
{ "_id" : ObjectId("61e4bb7a24587f8d77e88e26"), "title" : "MongoDB Performance Optimization" }
{ "_id" : ObjectId("61e4bb7a24587f8d77e88e27"), "title" : "MongoDB Data Modeling Techniques" }

接下来，使用聚合管道进行统计。统计每篇文章的浏览量、点赞量和评论数量：

-- -------------------- ---- -------
-----------------------
    -------- ----
    ---------- -
        ------ --
        -------- --
        -------- --
        -------- ------- ----------
        ----------- ------- ------------
    --
--展开代码

其中，$match 筛选条件为空，表示统计集合中所有文章。$project 用于筛选字段，使用 $size 统计数组长度。

结果示例：

{ "_id" : ObjectId("61e4bb7a24587f8d77e88e25"), "title" : "Introduction to MongoDB", "views" : 10, "likes" : 2, "comments" : 3 }
{ "_id" : ObjectId("61e4bb7a24587f8d77e88e26"), "title" : "MongoDB Performance Optimization", "views" : 5, "likes" : 1, "comments" : 1 }
{ "_id" : ObjectId("61e4bb7a24587f8d77e88e27"), "title" : "MongoDB Data Modeling Techniques", "views" : 3, "likes" : 0, "comments" : 0 }

最后，统计每位用户发表的文章数、浏览量、点赞量和评论数量：

-- -------------------- ---- -------
-----------------------
    -------- -
        ---- --------------
        ----------------- ------ ---
        -------------- ------ ----------
        -------------- ------ ------- -----------
        ----------------- ------ ------- -------------
    --
--展开代码

其中，$group 根据 user.email 字段进行分组，使用 $sum 统计数量。

结果示例：

{ "_id" : "userA@gmail.com", "total_articles" : 3, "total_views" : 18, "total_likes" : 4, "total_comments" : 5 }
{ "_id" : "userB@gmail.com", "total_articles" : 1, "total_views" : 5, "total_likes" : 1, "total_comments" : 1 }
{ "_id" : "userC@gmail.com", "total_articles" : 1, "total_views" : 3, "total_likes" : 0, "total_comments" : 0 }

结论

综上所述，MongoDB 的性能优化方案包括数据结构优化、索引优化和查询优化。在实际项目中，应根据需求合理使用这些方法，才能提高 MongoDB 的性能，为用户提供更优质的服务。

来源：JavaScript中文网，转载请注明来源 https://www.javascriptcn.com/post/672f35bfeedcc8a97c8d50ff

MongoDB 性能优化方案及实战：解决查询、读写慢问题

引言

数据结构优化

索引优化

1.固定前缀索引

2.复合索引

3.覆盖索引

查询优化

1.限制返回字段

2.使用聚合管道

实战

需求

方案

实现

结论

纠错反馈

程序员教程

程序员面试题库