C++复习（入门）#

EE538, Instructor: Arash Saifhashemi

USC, Viterbi school of Engineering, Electrical and Computer Engineering

暂不涉及继承，多态，虚函数等。

除C++外，包含一些基础的数据结构（以及STL的用法），基础的算法思想。

Instruction#

C++基础

语法

软件工程基础

测试、Source Control, Shell scripts
动态编程
OOP

算法与数据结构基础

树、链表、哈希、堆…
时间复杂度分析
算法：贪心，recursion递归，动态规划…

Programmer Basic Tools#

注册Stackoverflow 账号

注册Github账号

VS Code
Git
Linux-compatible terminal
C++ Toolchain
- Mac: Xcode + Bazel
- Windows: Cywin or MinGW
- Linux: GCC + Bazel

C++的设计理念和特性#

快，不像python或java一样在运行代码的时候，编译器像个保姆，内置了很多检查机制。而C++更像“程序员知道自己在做什么”。

“You only pay for what you use”

C++的灵魂，是如果你不需要某个功能（比如安全检查），你就不用为它付出性能代价。

例如：如果一个数组的大小是10，我们试图访问第100个元素，Python会抛出IndexError并停止运行。C++会直接去内存中偏移量为100的位置读写数据。这样会导致：

那块内存正好没有被使用，程序看起来正常
那块内存存折系统关键指令，修改后导致程序崩溃
其他变量的值被修改，导致出现难以排查的逻辑错误

但是现在比较新的C++，很大程度上规避了这些风险，虽然仍然存在，但是可以让C++具有高性能的同时，获得了更高的安全性。

编译器#

.h file: Only the declarations

.cc/cpp file: Implementations —compile—> .o file —> executable

关于写码#

Find corner cases 边界问题 Pseudo-Code 伪代码

规范的步骤：

仔细分析问题，细节考虑
边界问题的考虑
写出算法的outline（可能有很多种思路）
1. 对比性能，谁更快，更高效
实现
测试
验证代码的正确性
1. Induction 数学归纳法
2. Contradiction 反证法
3. Case Analysis 分类讨论
4. Other techniques

Induction#

数学归纳法：k = 1时成立，如果 k 时成立，推导对于k+1也成立

Unit Tests#

A unit test is a piece of code that tests a function or a class
Unit tests are written by the developer

Version Control#

Who, what, where, when, why

Git#

Modified: 只是修改，还没有提交到仓库
staged: 已经在当前版本中标记了已修改的文件，以便将其放入下次提交快照(snapshot)中
Committed: 数据已经安全保存到数据库

Runtime Analysis#

Count the number of elementary operations instead of timing them.
Consider the Worst Case. Make the runtime a function of the size of the input. It is just an estimate.

时间复杂度的计算：

C++语法#

overloaded重载

操作符#

其中，Bitwise operators的用法是：

数据类型#

Primary

int
char
bool
float
double
void

Derived

array
pointer
References

User Defined

struct
class
union
enum
typedef

Variables and Modifiers

signed
unsigned
long
short

各种数据类型的bytes，bits长度

enum#

enum class更规范

1
// Enum type in C:
2
enum ColorPallet1 { Red, Green, Blue };
3
enum ColorPallet2 { Yellow, Orange, Red };
4

5
// Enum Class in C++
6
// Declaration
7
enum class ColorPalletClass1 { Red, Green, Blue };
8
enum class ColorPalletClass2 { Yellow, Orange, Red };
9

10
// Assignment
11
ColorPalletClass1 col1 = ColorPalletClass1::Red;
12
ColorPalletClass2 col2 = ColorPalletClass2::Red;

auto#

1
for (auto n : my_vector){
2
  std::cout<< n;
3
}

Pointers#

1
int a = 1;
2
int* pp = &a;
3
int** r = &pp;
4
std::cout << "r: " << r << std::endl;
5
std::cout << "*r: " << *r << std::endl;
6
std::cout << "**r: " << **r << std::endl;
7
// 如果写
8
int* r = &pp;
9
//会报错，因为&pp的类型是int**，而定义的r是int*
10
//int* r 和 int *r 的两种写法是等价的，最好固定一种写法
11
//另外，在定义时不要写成
12
int* p,q; //这实际上定义的是一个指针p和一个整型q, 另外，指针定义的时候，要初始化：int* p = nullptr

Pointer : new and delete

使用的是堆heap，动态内存分配

1
int i = 5;
2
int* p = new int;
3
p = &i; //上面定义new int的地址就已经丢失了
4
delete p;// 忘记delete会导致内存泄漏

一般来讲，引用比指针更加的安全

References#

1
int i = 10;
2
int& j = i;
3
j++;

Passing parameters

参数传递要用指针或引用，用变量本身只是临时复制了一个函数内的变量，不会改变外部的值

1
int main() {
2
  std::vector<int> my_vector = {1, 2, 3, 4, 5, 6, 7, 8};
3
  for (auto n : my_vector) {
4
    n++;
5
  }
6
  // What is the value of my_vector? 不变
7
  for (auto &n : my_vector) {
8
    n *= 10;
9
  }
10
  //{10, 20, 30, 40, 50, 60, 70, 80}
11
  return 0;
12
}

Pointers VS Reference#

指针变量需要额外占用一块内存，存的是原变量的地址。

引用是原变量的“绰号”，在例如函数中，可以直接操作原有的那块内存，加上const表示”只读”

Array#

Size is fixed at compile time
Cannot be resized
There is not a reliable way to find the array size
Can be misused (out of bound)

arr[i] → *(arr + i) 本质上依旧是指针

Dynamic Array#

==Size doesn’t have to be known at compile time==
Cannot be resized
There is no way to find the size.
Can be misused (out of bound)

1
arr = new int[size];
2
delete [] arr;
3
push_back(int*& arr, int& size, int new_item)//和动态数组相关的函数，也需要加入size这个参数

Dynamic arrays should really carry their size with them, that means we often use two variables for them:

A pointer to the beginning of the array
The size of the array

对于现代C++，会尽量避免使用裸指针的用法，使用vector这个STL用法的函数

std::vector#

An enhanced version of Arrays

● ==Size doesn’t have to be known at compile time==

● ==Can be resized automatically== ● ==The size is always kept updated==

● Can be misused (out of bound) with [ ] ○ ==Misuse can be controlled using at() method==

Important methods to know:

v.push_back(1). Compare this to push_back(my_dyn_array, size, 1)
v.pop_back()
v.insert()
v.erase()
size(): v.size()
What is the runtime complexity of v[i] ?

std::string#

push_back()
pop_back()
getline()
concat
insert()

Flow Control#

If, switch

while, for, do-while

Namespaces#

1
namespace ns1 {
2
int x = 1;
3
void Print() { std::cout << "Printing example 1." << std::endl; }
4
} // namespace ns1
5
namespace ns2 {
6
int x = 2;
7
void Print() { std::cout << "Printing example 2." << std::endl; }
8
} // namespace ns2
9
// Global namespace
10
int x = 3; // ::x
11

12
int main() {
13
std::cout << "ns1::x: " << ns1::x << std::endl;
14
std::cout << "ns2::x: " << ns2::x << std::endl;
15
ns1::Print();
16
ns2::Print();

Variable scope#

Global全局变量
Local局部变量

Const#

将一个变量定义为常量

可以用到函数里，也可以用到类(Class)上

STL#

Standard Template Library of C++

std::vector#

std::set< Type >#

有序的去重集合

Important methods to know

size()
insert()
count()
find()
erase()

Things to know about std::set: ○ Internally it is sorted based on keys ○ Access, Insert, and find complexity is O(log(n)) ○ Reinserting the same key will just update the data, there is no duplicate keys

std::pair<T1, T2>#

○ Couples together a pair of things (of type T1 to T2) ○ For a pair of items p:

Access the first item by p.first
Access the second item by p.second.

1
std::pair<std::string, int> p1("Ari", 3);
2
std::pair<std::string, int> p2("Ted", 4);
3
std::pair<std::string, int> p3 = p1;
4

5
std::cout << "p1.first: " << p1.first << std::endl;
6
std::cout << "p1.second: " << p1.second << std::endl;

std::map<Key, Value>#

也叫字典Dict，底层用红黑树实现

Important methods to know ○ size() ○ insert() ○ count()

Things to know about std::map: ○ Internally it is sorted based on keys ○ Access, Insert, and find complexity is O(log(n)) ○ Map is really a collection of pairs ○ Accessing a non-existent key using [ ], creates that key ○ No duplicate keys

关于拉链法和如何设计哈希函数，可以参考之前专门讲hashmap的博客

std::unordered_map#

底层用哈希表实现

map / set = 有序、稳定、慢一点 unordered_map / unordered_set = 无序、快、但不稳定

Iterators#

Used for iteration of STL objects

STL: A set of C++ template classes and functions

Four components ○ Algorithms ○ Containers: vector, set, map ○ Functions ○ Iterators

1
std::vector<int> v = {1, 2, 3, 4, 5};
2
// An easy way of iteration
3
for (int n : v) {
4
std::cout << "n: " << n << std::endl;
5
}
6
// General way of iteration
7
std::vector<int>::iterator it;
8
for (it = v.begin(); it != v.end(); ++it) {
9
int n = *it;//可以理解为一个指针
10
std::cout << "n: " << n << std::endl;
11
}

v.begin(): address of the first item

v.end(): address AFTER the last item or NULL

A More Modern Way of Using Iterators

1
// using auto
2
std::set<int> s = {1, 2, 3, 4, 5};
3
for (auto it = s.begin(); it != s.end(); ++it) {
4
const int &n = *it;//const防止修改，引用防止复制。想要修改就把const去掉
5
std::cout << "n: " << n << std::endl;
6
}

Why Iterators?

迭代器 = STL的泛化“指针”，可以兼容所有STL基本容器的迭代方法，统一接口，切不暴露底层的指针，支持不同类型的访问，代码复用性好

1
// Insert a number before 3
2
std::vector<int> v = {1, 2, 3, 4, 5};
3
for (auto it = v.begin(); it != v.end(); ++it) {
4
  const int &n = *it;
5
  if (n == 3) {
6
    it = v.insert(it, 12);//这里会返回一个迭代器对象，必须重新对it赋值，不能只调用，因为插入后，原来的it失效了，下一次使用*it时，会报UB的错(Undefined Behavior!)
7
    ++it;
8
  }
9
}

STL Algorithm#

Passing Functions As Parameters

1
int BinaryOperation(int a, int b, std::function<int(int, int)> func) {
2
return func(a, b);
3
}
4
int Add(int a, int b) { return a + b; }
5
int Mult(int a, int b) { return a * b; }
6

7
int main() {
8
int result1 = BinaryOperation(10, 20, Add);
9
int result2 = BinaryOperation(10, 20, Mult);
10
}

一些用法：

1
int IncrementByTen(int a) { return a + 10; }
2

3
int main() {
4
std::vector<int> inputs = {0, 1, 2, 3, 4, 5, 6, 7, 8};
5
std::vector<int> outputs(inputs.size());
6
// Increment all of them
7
std::transform(inputs.begin(), inputs.end(),
8
outputs.begin(), IncrementByTen);
9
}
10

11
bool IsOdd(int i) { return ((i % 2) == 1); }
12

13
int main() {
14
std::vector<int> inputs = {0, 1, 2, 3, 4, 5, 6, 7, 8};
15
std::vector<int> outputs(inputs.size()); // {0, 0, ...,0}
16
// Increment all of them
17
std::copy_if(inputs.begin(), inputs.end(),
18
outputs.begin(), IsOdd);
19
// Output = {1, 3, 5, 7, 0,...,0}
20
}

Struct & Class#

类的组成

成员变量
构造函数 How do u build an instance
方法（成员函数）
析构函数 How do u clean up after an instance
公有/私有

当没有方法（成员函数）时，会用结构体

Constructor#

构造函数

Deep & shallow copy#

复制构造函数

实现：

1
Point(const Point &rhs){//引用传递，另外const表示让数据来源“只读”，不会被意外修改
2
  //非指针的属性，正常赋值
3
  //动态内存的属性，要新开一块内存，然后把老对象的值复制给新的
4
}

==构造函数，不返回类型==

在构造函数中，传参对象时的三种方式：

值传递：❌，构造函数会一直被调用，陷入死循环
指针传递：✅，但是容易出错
引用传递：✅，简洁，调用复制构造函数

deepcopy这里写错了，应该是*z_ = *rhs.z_;

在使用动态内存分配时，需要使用deep copy，也就是不能使用默认的复制构造函数（这样实现的是浅拷贝），要自己定义：

1
...
2
int main(){
3
  MyString str("I love C++, yeah!");
4
  {
5
    MyString str2(str);//调用默认的复制构造函数实现浅复制
6
    cout<<str2.get_string()<<endl;
7
  }//超出对象str2的生存期，str2被析构
8
  cout<<str.get_string()<<endl;//因为是浅复制，数据已被删除，会输出乱码
9
  return 0;//在析构str时，由于属性已经在str2时被回收，发生运行错误
10
}

由于浅复制，str2的m_pbuf和str的m_pbuf指向同一块堆内存空间

复制构造函数的使用场景

用类的一个对象去初始化该类的另一个对象时（显式）
函数的形参是类的对象时（可以理解为隐式）
函数的返回值时类的对象时（可以理解为隐式）

本质上都是在用类的对象去初始化另一个对象

1
void F(Point local_p) {
2
  //会调用拷贝构造函数和析构函数
3
  std::cout << "Inside F. Pass by value" << std::endl;
4
  std::cout << "local_p.i_: " <<local_p.i_ << ", local_p.j_: " << local_p.j_<< std::endl;
5
}

但是，如果在某函数传参时，使用的是引用传递，就不会调用拷贝构造函数和析构函数：

1
void F_reference(Point &p) {//只传递地址，没有新对象产生，本质是一个别名
2
  std::cout << "Inside F. Pass by reference" << std::endl;
3
}//

练习：

Write Deep Copy for the SinglyLinkedList class

1
class SinglyLinkedList {
2
public:
3
  SinglyLinkedList() { }
4
  ~SinglyLinkedList() { }
5
  ListNode *head_;
6
  // ...
7
};
8

9
//错完的版本
10
SinglyLinkedList(const SinglyLinkedList& head_){
11
  if ()
12
  ListNode* newNode = new ListNode();
13
  newNode->value = head_->value;
14
  newNode->next = head_->next;
15
}
16

17
//正确的
18
// 1. 这是一个拷贝构造函数，没有返回类型，参数是常量引用以保证性能和安全
19
SinglyLinkedList(const SinglyLinkedList& other) {
20
    if (other.head_ == nullptr) {
21
        head_ = nullptr;
22
        return;
23
    }
24

25
    // 2. 首先复制头节点，申请新内存
26
    head_ = new ListNode(other.head_->value);//由于ListNode的构造函数的默认实现，next指向nullptr
27

28
    ListNode* currentNew = head_;
29
    ListNode* currentOld = other.head_->next;
30

31
    // 3. 循环遍历旧链表，逐个“克隆”
32
    while (currentOld != nullptr) {
33
        // 为新节点申请独立内存，而不是直接指向旧地址
34
        currentNew->next = new ListNode(currentOld->value);
35

36
        // 移动指针继续往后走
37
        currentNew = currentNew->next;//此时为nullptr
38
        currentOld = currentOld->next;
39
    }
40
}

初始化列表#

1
class Point {
2
public:
3
  // PARAMETERIZED Constructor Version 1
4
  Point(int i, int j) {
5
    std::cout << "**PARAMETERIZED constructor." << std::endl;
6
    i_ = i;
7
    j_ = j;
8
    z_ = new int;
9
  }
10
  // PARAMETERIZED Constructor Version 2 初始化列表
11
  Point(int i, int j) : j_(j), i_(i) { //先i后j
12
    std::cout << "**PARAMETERIZED constructor." << std::endl;
13
    z_ = new int;
14
  }
15
private:
16
  int i_;
17
  int j_;
18
};

==关于初始化列表使用时，成员初始化的顺序：==

==由成员定义的顺序决定，而不是初始化列表中的属性==

Destructor#

In most cases you only need a destructor only if you are using dynamic memory allocation.

程序会自己隐式地调用析构函数，因此在使用指针（动态内存分配）时，只需要实现一个析构函数，而不需要显示调用它

操作符重载#

Pre and Postfix unary operators

1
// Prefix overload
2
// ++p;
3
Point& operator++() {
4
  i_++;//直接修改原对象
5
  j_++;
6
  return *this;//返回对象本身
7
}
8

9
// Postfix overload
10
// p++;
11
Point operator++(int) {
12
Point temp = *this;//先备份
13
  i_++; //修改原对象
14
  j_++;
15
  return temp;//返回备份
16
}
17

18
class Point {
19
public:
20
  // Copy assignment
21
  Point &operator=(const Point &rhs) {//这里必须使用引用，如果返回值（对象），会调用拷贝构造函数和析构函数（之前提到的）
22
    i_ = rhs.i_;
23
    j_ = rhs.j_;
24
    std::cout << "Copy assignment" << std::endl;
25
    return *this; // 赋值，改变自身，返回自身引用。注意：*this表示的是对象，this是指针。
26
  }
27
  Point operator+(const Point &rhs) const {
28
    Point res; //是函数内部的局部变量，函数执行完成会被销毁，
29
    res.i_ = i_ + rhs.i_;
30
    res.j_ = j_ + rhs.j_;
31
    return res; // 加法，创建新对象，返回新对象。不能使用引用，返回一个已被销毁的内存会崩溃
32
  }
33
  // Prefix overload, Point p2 = ++p1;
34
  Point &operator++() {
35
    i_++;
36
    j_++;
37
    return *this;
38
  }
39
  // Postfix overload, Point p2 = p1++;
40
  Point operator++(int) {
41
    Point temp = *this; // 1.
42
    ++*this; // 2
43
    return temp; // 3.
44
  }

++p VS p++#

++p: 先自增，再参与表达式运算（返回新值）

p++: 先参与表达式运算（返回旧值），再自增

各有各的用途

场景	推荐写法	原因
基础类型 (`int`, `char`)	`++i` 或 `i++`	编译器会优化，两者基本无区别
自定义类对象	`++p`	避免调用拷贝构造函数，效率更高
STL 迭代器 (如 `list`)	`++it`	遵循标准 C++ 最佳实践，保证性能最优

对于for循环，在 for(init; condition; increment) 中，最后一项 increment 是一个独立表达式。

当你写 ++i 时，它返回新值的引用，但循环逻辑并不使用这个返回值。
当你写 i++ 时，它确实创建了一个旧值的副本并返回了，但 for 循环的控制结构直接忽略了这个返回值。

循环关心的只是 i 变量本身是否被修改了。 无论前置还是后置，i 最终都加了 1，所以循环能正常跑下去。

for中，二者都可，但代价不同。

重载代码：

++i 的逻辑：

C++

1
i = i + 1;
2
return i; // 直接给引用，没开销

i++ 的逻辑：

C++

1
int temp = i; // 额外开销：必须备份旧值
2
i = i + 1;
3
return temp;  // 返回备份，然后备份销毁

Summary#

时间/空间复杂度计算#

It’s called runtime complexity, but it’s really number of operations!

number of elementary operations in the worst case as a function of the input size.

在对比时，忽略到最大的一项 (n —> ∞)

忽略乘在前面的系数，加在后面的系数

时间复杂度取决于：input size 和 Algorithm

Limitations of Big O#

图二中，使用Vector要比Linked List快5-20倍，因为cache要不停的在内存中找，vector是连续内存存储，可以一次把一整块cache line读进cache。而list的节点分散在内存中，每次next都是一次指针跳转，cache line存在浪费。

Data Structure#

数据结构

std::stack#

栈，注意在top()和pop()之前，先判空

1
std::stack<int> s;
2
int r = s.top(); // Seriously?
3
s.pop(); // Don't do this either!
4

5
// Do this instead
6
if(!s.empty()){
7
  s.pop();
8
}

std::queue#

队列，FIFO

empty() 判断是否为空
size()
front() //read front
push()
pop()

std::list#

链表，是一个双向链表

优缺点：

内存不连续，查找很慢
插入/删除快
对缓存（Cache）不友好，现代CPU处理连续内存很快，但List散落在各处，导致CPU频繁去内存里“抓取”，命中率低（Cache Miss）。所以最好不要使用

std::list indexing#

可以使用iteration迭代器

Using iterators, we only have access to the front and back of the list

Front: begin()
One after back: end()
So index i would be:
- it=begin(); it++ (i times)
We can’t directly add to iterators
STL provides these functions:
- std::advance
- std::next

1
std::list<int> myList = {10, 20, 30, 40};
2
auto it = myList.begin();
3
std::advance(it, 2); // 手动移动到索引 2 的位置（即第三个元素 30）
4
std::cout << *it;

链表的实现#

1
struct ListNode {
2
  int val;
3
  ListNode *next;
4
  ListNode(int x) : val(x), next(nullptr) {} //initialization list
5
};

How to find the last node?#

1
p = head;
2
while (p != nullptr) {
3
  p_prev = p;
4
  p = p->next;
5
}// p_prev就是

进行优化：

1
p = head;
2
while (p->next != nullpte){
3
  p = p->next;
4
}

push_back(22)#

1
p = getBackPointer();
2
ListNode* newNode = new ListNode;
3
p->next = newNode;

pop_back()#

1
p = GetOneBeforeBackPointer(); // p->next->next != nullptr;
2
...

insert_after(p, 22)#

1
ListNode* newNode = new ListNode;
2
newNode->next = p->next;
3
p->next = newNode;

erase() 不用p_prev.#

1
p->value = p->next->value; //复制后一个的值给自己
2
p->next = p->next->next; //跳过下一个节点
3
delete p;

If there are a lot of erase and insert, it seems that a linked list is faster. Despite this, you should almost never use a linked list!

Trees#

Tree Data Structure#

Tree#

判断是否为树的定义就略过了，具有递归性Recursive

Binary Tree#

二叉树

Full Binary Tree 满二叉树
- Each node has 0 or 2 children (no single child node)
Complete Binary Tree 完全二叉树
- All levels are filled, except the last one
- Last level is filled from left to right
Perfect Binary Tree 完美二叉树
- All interior nodes have two children and all leaves have the same height.
- Last level is completely full
Balanced Binary Tree 平衡二叉树
- The left and right subtrees of every node differ in height by no more than 1左右子树高度差不超过1
Binary Search Tree 二叉搜索树
- 左边比右边小
- 没有重复的节点

Tree Height#

log2(n+1) <= height <= n，对于有N个节点的二叉树
log2(n) appears in runtime of a lot of tree-based algorithms
k层最多有 $2^{k-1}$ 个节点
共k层的树最多有 $2^k-1$ 个节点

Tree Algorithms (Recursion)#

Traversal#

遍历

Depth First (口诀：Node的位置)
- Preorder : Node, left, right
- Inorder : left, Node, right
- Postorder : left, right, Node
Breadth first
- Level Order 层序遍历

1
print(root){
2
  if (root == nullptr)  return;
3
  cout << root->val;
4
  print(root->left);
5
  print(root->right);
6
}
7

8
print(root){
9
  if (root == nullptr)  return;
10
  print(root->left);
11
  cout << root->val;
12
  print(root->right);
13
}
14

15
print(root){
16
  if (root == nullptr)  return;
17
  print(root->left);
18
  print(root->right);
19
  cout << root->val;
20
}

Storing Binary Trees#

in Vectors：

完全二叉树和完美二叉树非常适合存入vector/array中，因为效率高，浪费少，这也是为什么我们喜欢它们

Using Pointers：

1
struct TreeNode {
2
  int val;
3
  TreeNode *left;
4
  TreeNode *right;
5
  TreeNode(int x) : val(x), left(nullptr), right(nullptr) {}
6
};

Storing General Trees#

Using Pointers

1
struct TreeNode {
2
  int val;
3
  vector<TreeNode *> children;
4
  TreeNode(int x) : val(x) {}
5
};

Binary Search Tree#

14 2 18 43 41 15 0

Convert this vector to BST

Insert in BST#

1
void BST::insert(TreeNode *root, int v) {
2
  if (root == nullptr) {
3
    root = new TreeNode(v);
4
  } else if (v < root->val) {
5
    insert(root->left, v);
6
  } else if (v > root->val) {
7
    insert(root->right, v);
8
  }
9
}
10
t.insert(t.root_, 8);
11
t.insert(t.root_, 3);
12
t.insert(t.root_, 10);
13
t.insert(t.root_, 2);

![截屏2026-01-06 23.54.15](C++/截屏2026-01-06 23.54.15.png)

Search in BST#

![截屏2026-01-06 23.56.14](C++/截屏2026-01-06 23.56.14.png)

Visiting a BST in LNR order will give us nodes in Ascending order:

2, 3, 4, 5, 7, 8, 10, 12, 14

Heaps#

Heap Data Structure#

Min Heap#

最小堆

complete binary tree
parent is less than or equal to both children. Root is the smallest
Functions
- Push(key)
- Pop()
- Key = top() //就是data_[0]

因为是完全二叉树的缘故，堆很适合使用vector存储。另外min heap找到min的时间复杂度是O(1)

1
class Heap {
2
public:
3
  int GetParentIndex(int i);
4
  int GetLeftIndex(int i);
5
  int GetRightIndex(int i);
6
  int GetParent(int i)
7
  int GetSmallestChildIndex(int i);
8
  int top();
9
  void push(int v);
10
  void pop();
11
  void TrickleUp(int i);
12
  void TrickleDown(int i);
13
private:
14
  std::vector<int> data_;
15
};
16
//因为是转换成vector的缘故，用i的乘法计算
17
int GetLeftIndex(int i) {
18
  if ((2 * i) + 1 >= data_.size()) {
19
    return -1;
20
  }
21
  return (2 * i) + 1;
22
}
23
int GetRightIndex(int i) {
24
  if ((2 * i) + 2 >= data_.size()) {
25
    return -1;
26
  }
27
  return (2 * i) + 2;
28
}
29

30
int GetParentIndex(int i) {
31
  if (i == 0) {
32
    return -1;
33
  }
34
  return (i - 1) / 2;
35
}
36

37
int GetSmallestChildIndex(int i) {
38
  auto left_index = GetLeftIndex(i);//迭代器
39
  auto right_index = GetRightIndex(i);
40
  return data_[left_index] < data_[right_index] ?
41
  left_index :
42
  right_index;
43

44
  // What if left_index or right_index is -1?
45
}
46

47
int Heap::top() {
48
  if (data_.empty()) {
49
    return INT_MAX;
50
  } else {
51
    return data_[0];
52
  }
53
}
54

55
//push() 加到末尾，向上交换
56
void Heap::TrickleUp(int i) {
57
  // Fix the min heap property if it is violated
58
  while (i != 0 && GetParent(i) > data_[i]) {
59
    Swap(data_[i], data_[GetParentIndex(i)]);
60
    i = GetParentIndex(i);
61
  }
62
}
63

64
//pop() 去掉root，和末尾的交换，然后向下交换
65
void Heap::TrickleDown(int i) {
66
  // If it's a leaf node
67
  if (GetLeftIndex(i) > data_.size() && GetRightIndex(i) > data_.size()) {
68
    return;
69
  }
70
  int smallest_child_index = GetSmallestChildIndex(i);
71
  if (data_[i] > data_[smallest_child_index]) {
72
    Swap(data_[i], data_[smallest_child_index]);
73
    TrickleDown(smallest_child_index);
74
  }
75
}

Array —> heap#

O(nlogn)

1
Heap ConvertToHeap(const std::vector<int> &input) {
2
  Heap result_heap;
3

4
  for (int e : input) {
5
    result_heap.push(e);
6
  }
7
  return result_heap;
8
}

O(n)

1
void Heap::ConvertToHeap(const std::vector<int> &input) {
2
  data_ = input;
3
  for (int i = data_.size() / 2; i >= 0; i--) {
4
    TrickleDown(i);
5
  }
6
}

Max Heap#

最大堆

Parent bigger than or equal to both children
max is root
Functions
- Push(key)
- Pop()
- key = top()

STL Implementing#

1
// max priority queue
2
std::priority_queue<int> q;
3
q.push(5);
4
q.push(15);
5
q.push(1);
6
q.push(42);
7
q.push(3);
8
q.push(2);
9
// One by one extract items from max heap
10
while (q.empty() == false) {
11
  std::cout << q.top() << std::endl;
12
  q.pop();
13
}
14

15
// min priority queue
16
std::priority_queue<int, std::vector<int>, std::greater<int> > q;
17
q.push(5);
18
q.push(15);
19
q.push(1);
20
q.push(42);
21
q.push(3);
22
q.push(2);
23
// One by one extract items from min heap
24
while (q.empty() == false) {
25
  std::cout << q.top() << std::endl;
26
  q.pop();
27
}

Heap Sort#

By using pop() from a min heap

Graph#

“一笔画”：欧拉路径，通过图中所有边恰好一次的路径

1
bool Graph::IsEulerWalkable() {
2
  std::vector<int> degrees(v_.size());
3
  for (auto e : e_) {//遍历所有边
4
    degrees[e.first]++;
5
    degrees[e.second]++;
6
  }
7
  int countOdds = 0;
8
  for (auto d : degrees) {//遍历vector
9
    if (d % 2 == 1) {
10
      countOdds++;
11
    }
12
  }
13
  return (countOdds == 0 || countOdds == 2);
14
}

Definition#

G = (V, E)

Vertex / Vertices
Edges
- undirected
- directed

Size:

|V|, n
|E|, m
m_max = n(n-1)

adjacent: two nodes are neighbors

predecessor(parent) —> successor(child)

Connectivity:

Undirected

Connected
Unconnected

Directed

strongly connected 能回到起点
Weakly connected 不能
Unilaterally Connected

Representation#

图的三种表示方法

Direct Translation of the Definition
Adjacency List
Adjacency Matrix

Direct Translation of the Definition#

有set实现，也有vector实现

1
class Graph {
2
public:
3
  Graph(std::vector<int> &v, std::vector<std::pair<int, int>> &e): v_(v), e_(e) {}
4
  bool IsEulerWalkable();
5
  std::vector<int> v_;
6
  std::vector<std::pair<int, int>> e_;
7
};
8

9
//V={0, 1, 2, 3}
10
//E={(0,1), (1,2), (2,3), (3,0)}
11

12
int main() {
13
  std::vector<int> v = {0, 1, 2, 3};
14
  std::vector<std::pair<int, int>> e = {{0, 1}, {1, 2}, {2, 3}, {3, 0}};
15
  Graph g(v, e);
16
  std::cout << g.IsEulerWalkable() << std::endl;
17
}

(Assuming every node is connected to at least one edge. So we don’t need V anymore)

Adjacency List#

邻接链表（map+set）

Adjacency Matrix#

邻接矩阵 (Vector of vector)

![截屏2026-01-07 01.30.09](C++/截屏2026-01-07 01.30.09.png)

![截屏2026-01-07 01.30.47](C++/截屏2026-01-07 01.30.47.png)

Algorithms#

DFS#

Depth First Search

选择任意一个节点作为root，访问过的做标记，递归选取一个未访问过的孩子节点。每个节点和边只访问一次。

递归算法

1
class MapSetGraph {
2
Public:
3
  void DFS(int root){
4
    std::map<int, int> &marks;
5
    DFS_helper(root, marks);
6
  }
7
  void DFS_helper(int root, std::map<int, int> &marks);
8
  std::map<int, std::set<int>> edge_map_;
9
};
10

11
void DFS_helper(int root, std::map<int, int> &marks) {
12
  marks[root] = 1;
13
  std::cout << "visited: " << root << std::endl;
14
  for (const int child : edge_map_[root]) {
15
    if (marks[child] != 1) {
16
      DFS_helper(child, marks);
17
    }
18
  }
19
}

How to Solve Recursive Algorithms ?

You should answer the following questions:

What is the base case?
What are my recursion calls (e.g. F(n-1) and F(n-2))
How do I combine the solutions to the recursive calls? (e.g. we use ADD in Fibonacci)

What Can Help You?

The recursion tree is your best friend on which you run DFS
- The root is the recursive function, e.g. F(n)
- Children of root: your recursive calls
- The height of the tree gives me the memory complexity
- The total nodes in the tree gives me runtime complexity
  - Assuming the combining function can be done in O(1)
    - E.g. Sum in Fibonacci can be done in O(1)
    - E.g. in MergeSort, the combining function, MERGE is longer than O(1)

In solving a recursive function, we are really running DFS on a recursion Tree.

DFS的非递归实现：

用stack栈

BFS#

Breadth First Search

任意选一个节点作为root，访问过的做标记，visit all children of v before moving to v’s grandchildren, 在同一深度访问所有节点。每个节点和边只访问一次。

使用Queue实现，送进队列后的结果在上面（从根节点0开始）

1
void MapSetGraph::BFS(int root) {
2
  std::map<int, bool> visited;
3
  std::queue<int> q;
4
  q.push(root);
5
  visited[root] = true;
6
  while (!q.empty()) {
7
    int cur = q.front();
8
    q.pop();
9
    for (auto &n : edge_map_[cur]) {
10
      if (!visited[n]) {
11
        visited[n] = true;
12
        q.push(n);
13
      }
14
    }
15
  }
16
}

![截屏2026-01-07 08.11.42](C++/截屏2026-01-07 08.11.42.png)

BFS当每个边的权重相等时，可以用来找到最短路径

Grid Representation#

BFS 解决思路：

起点入队：将 Start 坐标 $(0,0)$ 放入队列。
标记访问：使用一个二维数组 visited[4][4] 来记录哪些格子走过，防止转圈。
探索邻居：从队列取出当前格，检查它的上下左右四个邻居：
- 如果邻居坐标在界内、值是 $1$ 、且没被访问过。
- 将邻居标记为已访问并入队。
终点判定：如果在探索过程中到达了 Finish 坐标 $(3,3)$ ，说明路径存在。

SDA#

Shortest Distance Algorithms

SDP: The Shortest Distance problem

SPP: The Shortest Path Problem

Weighted Graph#

Dijkstra#

Bellman-Ford#

Summary#

![截屏2026-01-07 01.40.01](C++/截屏2026-01-07 01.40.01.png)

Algorithm#

算法

Dynamic Programming#

动态规划属于一种算法优化，核心思路是通过“空间换时间”，解决递归中严重的重复计算问题

Memoization#

记忆化搜索 (AKA Caching AKA Look Up Table)

Dynamic Programming ≈ Recursive function + Memoization

1
// without memoization
2
int Fibonacci(int n) {
3
  if (n == 0 || n == 1) {
4
    return 1;
5
  }
6
  int l = Fibonacci(n - 2);
7
  int r = Fibonacci(n - 1);
8
  return r + l;
9
}//存在多次重复计算
10

11
//With memoization
12
std::unordered_map<int, int> memo;
13

14
int Fibonacci(int n) {
15
  if (n == 0 || n == 1) {
16
    return 1;
17
  }
18

19
  if (memo.count(n) > 0) {
20
    return memo[n];
21
  } else {
22
    memo[n] = Fibonacci(n - 1) + Fibonacci(n - 2);
23
  }
24
  return memo[n];
25
}//计算出来的数存起来，下次用时候直接拿，避免重复计算

Bottom-up(Tabulation)#

1
//Tabulation
2
int Fibonacci(int n) {
3
  // Allocate for 0 to n included
4
  std::vector<int> d(n + 1);
5
  d[0] = d[1] = 1;
6

7
  for (int i = 2; i <= n; i++) {
8
    d[i] = d[i - 1] + d[i - 2];
9
  }
10
  return d[n];
11
}
12

13
//Memoization
14
std::unordered_map<int, int> memo;
15
int Fibonacci(int n) {
16
  if (n == 0 || n == 1) {
17
    return 1;
18
  }
19

20
  if (memo.count(n) > 0) {
21
    return memo[n];
22
  } else {
23
    memo[n] = Fibonacci(n - 1) + Fibonacci(n - 2);
24
  }
25
  return memo[n];
26
}

Approach a DP Problem#

Coin Changing Problem#

转化成递归树的形式

1
//(Non-Optimized) 递归暴力搜索
2
int CoinChange(std::vector<int> &coins, int amount) {
3
  if (amount == 0)
4
    return 0;
5
  if (amount < 0)
6
    return -1; // Indicates invalid
7
  int min = INT_MAX;
8
  for (auto coin : coins) {
9
    int n = CoinChange(coins, amount - coin);
10
    if (n >= 0) {
11
      min = std::min(n, min);
12
    }
13
  }
14
  if (min >= 0 && min < INT_MAX) {
15
    return 1 + min;
16
  else
17
    return -1;
18
}
19

20
//Memoization is your friend! 记忆化搜索
21
std::unordered_map<int, int> memo;
22
int coinChange(std::vector<int> &coins, int amount) {
23
  if (amount == 0)
24
    return 0;
25
  if (amount < 0)
26
    return -1;
27
  if (memo.count(amount) > 0)
28
    return memo[amount];
29
  int min = INT_MAX;
30
  for (auto coin : coins) {
31
    int n = coinChange(coins, amount - coin);
32
    if (n >= 0) {
33
      min = std::min(n, min);
34
    }
35
  }
36
  if (min >= 0 && min < INT_MAX)
37
    memo[amount] = 1 + min;
38
  else
39
    memo[amount] = -1;
40
  return memo[amount];
41
}
42

43
//Tabulation 制表法
44
int coinChange(std::vector<int> &coins, int amount) {
45
  std::vector<int> d(amount + 1);
46
  d[0] = 0;
47

48
  for (int i = 1; i <= amount; i++) {
49
    d[i] = INT_MAX-1; // -1 because we add +1 later
50

51
    for (auto coin : coins) {
52
      if (i - coin >= 0) {
53
        d[i] = std::min(d[i - coin] + 1, d[i]);
54
      }
55
    }
56
  }
57

58
  return d[amount] > amount ? -1 : d[amount];
59
}