diff --git a/.github/FUNDING.yml b/.github/FUNDING.yml
deleted file mode 100644
index 4b16f597..00000000
--- a/.github/FUNDING.yml
+++ /dev/null
@@ -1,12 +0,0 @@
-# These are supported funding model platforms
-
-github: # Replace with up to 4 GitHub Sponsors-enabled usernames e.g., [user1, user2]
-patreon: # Replace with a single Patreon username
-open_collective: # Replace with a single Open Collective username
-ko_fi: # Replace with a single Ko-fi username
-tidelift: # Replace with a single Tidelift platform-name/package-name e.g., npm/babel
-community_bridge: # Replace with a single Community Bridge project-name e.g., cloud-foundry
-liberapay: # Replace with a single Liberapay username
-issuehunt: # Replace with a single IssueHunt username
-otechie: # Replace with a single Otechie username
-custom: # Replace with up to 4 custom sponsorship URLs e.g., ['link1', 'link2']
diff --git a/.gitignore b/.gitignore
index 2bac022e..714fa7e9 100644
--- a/.gitignore
+++ b/.gitignore
@@ -1,4 +1,6 @@
-﻿.vscode
+﻿.idea
+.vscode
 .github
 img/*.html
-
+notebook
+venv
diff --git a/README.md b/README.md
index a0bf3a03..91ab7d1f 100644
--- a/README.md
+++ b/README.md
@@ -1,3 +1,4 @@
+
 <div align="center">
 <img src="https://img.shields.io/badge/-Python-brightgreen">
 <img src="https://img.shields.io/badge/-%E6%95%B0%E6%8D%AE%E5%88%86%E6%9E%90-yellowgreen">
@@ -7,1707 +8,295 @@
 </div>
 <br>
 
-<div align="center">
-<img src="https://static01.imgkr.com/temp/9785963f175849d7bbc8a49ea02a54ea.png" width="20%" />
+<!-- <div align="center">
 <img src="https://static01.imgkr.com/temp/f379139a2c5d463799c35c1aa68911d7.png" width="18%"/>
-</div>
-
-<br>
-
-<br>
+</div> -->
 </div>
 
 ## 介绍
 
-告别枯燥，告别枯燥，致力于打造 Python 经典小例子、小案例。 如果转载本库小例子、小案例，请备注下方链接：Python小例子 https://github.com/jackzhenguo/python-small-examples
-
-查看完整的旧版本：[改版前V3.0](V3.md)
-
-## 贡献
-
-欢迎贡献小例子到此库
+告别枯燥，告别枯燥，致力于打造 Python 经典小例子、小案例。 
 
 ## License
 
-允许按照要求转载，但禁止用于任何商用目的。
-
-## Python 原创教程
-这是经过很久打磨的一个Python教程，全部是个人原创，已首发在公众号，并且托管在我的[个人网站](http://www.zglg.work/?page_id=535)。想系统入门Python的欢迎学习：
-
-[Python技术栈完整教程](http://www.zglg.work/?page_id=535)
-
-![](./img/大纲.png)
-
-适合小白的 Python 系统入门课程
-
-- [1 数字专题](http://www.zglg.work/?page_id=530)
-- [2 字符串专题](http://www.zglg.work/?page_id=540)
-- 3 列表专题
-  - [3.1 列表基础](http://www.zglg.work/?page_id=563)
-  - [3.2 列表进阶](http://www.zglg.work/?page_id=575)
-- 4 流程控制专题
-  - [4.1 流程控制基础](http://www.zglg.work/?page_id=618)
-  - [4.2 流程控制进阶](http://www.zglg.work/?page_id=621)
-- 5 编程风格
-  - [5.1 基本编程习惯](http://www.zglg.work/?page_id=654)
-  - [5.2 EAFP 防御编程风格](http://www.zglg.work/?page_id=657)
-  - [5.3 LBYL 防御编程风格](http://www.zglg.work/?page_id=659)
-- 6 Python 函数
-  - [6.1 函数组成](http://www.zglg.work/?page_id=883)
-  - [6.2 引用传参](http://www.zglg.work/?page_id=885)
-  - [6.3 默认参数与关键字参数](http://www.zglg.work/?page_id=887)
-  - [6.4 可变参数](http://www.zglg.work/?page_id=889)
-  - [6.6 偏函数](http://www.zglg.work/?page_id=892)
-  - [6.7 递归函数](http://www.zglg.work/?page_id=894)
-  - [6.8 匿名函数](http://www.zglg.work/?page_id=896)
-  - [6.9 高阶函数](http://www.zglg.work/?page_id=898)
-  - [6.10 嵌套函数](http://www.zglg.work/?page_id=900)
-- 7 面向对象编程基础
-  - [7.1 类定义](http://www.zglg.work/?page_id=904)
-  - [7.2 对象或实例](http://www.zglg.work/?page_id=906)
-  - [7.3 打印对象](http://www.zglg.work/?page_id=908)
-  - [7.4 属性](http://www.zglg.work/?page_id=910)
-  - [7.5 private,protected,public](http://www.zglg.work/?page_id=913)
-  - [7.6 继承](http://www.zglg.work/?page_id=916)
-  - [7.7 多态](http://www.zglg.work/?page_id=918)
-
-后续章节正在整理推送中。
-
-
-
-## Python 小例子
-
-| 小例子 | 链接                    | 标签              | 版本          | 难度 |
-| ---- | ---------------------------------- | ---- | ---- | ---- |
-| 1    | [实现 relu](md/1.md) | max   | V4.0        | ⭐️⭐️ |
-| 2    | [进制转化](md/2.md)                | bin,oct,hex | V4.0            |  ⭐️⭐️    |
-| 3    | [整数和ASCII互转](md/3.md)         | chr,ord | V1.0     | ⭐️⭐️ |
-| 4    | [元素都为真检查](md/4.md)          | all   | V2.0      | ⭐️⭐️⭐️ |
-| 5    | [至少一个为真检查](md/5.md)        | any | V2.0    | ⭐️⭐️⭐️ |
-| 6    | [判断是真是假](md/6.md)            | bool        | V2.0        | ⭐️⭐️⭐️ |
-| 7    | [创建复数](md/7.md)                | complex         | V1.0            | ⭐️⭐️⭐️ |
-| 8    | [取商和余数](md/8.md)              | divmod        | V1.0          | ⭐️⭐️ |
-| 9    | [转为浮点类型](md/9.md)            | float       | V1.0        | ⭐️⭐️ |
-| 10   | [转为整型](md/10.md)               | int            | V1.0           | ⭐️ |
-| 11   | [次幂](md/11.md)                   | pow                | V1.0               | ⭐️ |
-| 12   | [四舍五入](md/12.md)               | round          | V1.0           | ⭐️ |
-| 13   | [链式比较](md/13.md)               | compare       | V1.0           | ⭐️⭐️ |
-| 14   | [字符串转字节](md/14.md)           | bytes,utf-8 | V1.0       | ⭐️⭐️ |
-| 15   | [任意对象转为字符串](md/15.md)     | str  | V1.0 | ⭐️⭐️ |
-| 16   | [执行字符串表示的代码](md/16.md)   | compile | V1.0 | ⭐️⭐️⭐️ |
-| 17   | [计算表达式](md/17.md)             | eval         | V1.0         | ⭐️⭐️⭐️⭐️ |
-| 18   | [字符串格式化](md/18.md)           | format     | V1.0       | ⭐️⭐️⭐️⭐️ |
-| 19   | [拿来就用的排序函数](md/19.md)     | sorted | V1.0 | ⭐️⭐️⭐️ |
-| 20   | [求和函数](md/20.md)               | sum            | V1.0           | ⭐️⭐️ |
-| 21   | [nonlocal用于内嵌函数中](md/21.md) | nonlocal | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 22   | [global 声明全局变量](md/22.md)    | global | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 23   | [交换两元素](md/23.md)             | pack,unpack | V1.0         | ⭐️⭐️ |
-| 24   | [操作函数对象](md/24.md)           | operator   | V2.0       | ⭐️⭐️⭐️⭐️ |
-| 25   | [生成逆序序列](md/25.md)           | range | V1.0       | ⭐️⭐️ |
-| 26   | [函数的五类参数使用例子](md/26.md) | variable parameter | V2.0 | ⭐️⭐️⭐️⭐️ |
-| 27   | [使用slice对象](md/27.md)          | slice     | V2.0      | ⭐️⭐️⭐️⭐️⭐️ |
-| 28   | [lambda 函数](md/28.md)  | lambda | V3.0 | ⭐️⭐️⭐️⭐️ |
-| 29   | [转为字典](md/29.md)               | dict           | V1.0           | ⭐️⭐️ |
-| 30   | [冻结集合](md/30.md)               | frozenset | V1.0           | ⭐️⭐️ |
-| 31   | [转为集合类型](md/31.md)           | set        | V1.0       | ⭐️⭐️ |
-| 32   | [转元组](md/32.md)                 | tuple            | V1.0             | ⭐️⭐️ |
-| 33   | [对象是否可调用](md/33.md)         | callable | V2.5   | ⭐️⭐️⭐️⭐️ |
-| 34   | [ascii 展示对象](md/34.md)         | `__repr__` | V2.5     | ⭐️⭐️⭐️ |
-| 35   | [类方法](md/35.md)                 | classmethod      | V1.5             | ⭐️⭐️⭐️ |
-| 36   | [动态删除属性](md/36.md)           | delattr,hasattr | V1.5       | ⭐️⭐️ |
-| 37   | [一键查看对象所有方法](md/37.md)   | dir | V1.5 | ⭐️⭐️ |
-| 38   | [动态获取对象属性](md/38.md)       | getattr | V1.5   | ⭐️⭐️ |
-| 39   | [对象是否有某个属性](md/39.md)     | hasattr | V1.5 | ⭐️⭐️⭐️ |
-| 40   | [对象门牌号](md/40.md)             | id           | V1.0         | ⭐️ |
-| 41   | [实例和对象关系判断](md/41.md)    | isinstance   | V1.5         | ⭐️⭐️⭐️ |
-| 42   | [issubclass父子关系鉴定](md/42.md) | issubclass | V1.5 | ⭐️⭐️⭐️ |
-| 43   | [所有对象之根](md/43.md)           | object     | V1.0       | ⭐️ |
-| 44   | [创建属性的两种方法](md/44.md)     | property | V2.5 | ⭐️⭐️⭐️⭐️⭐️ |
-| 45   | [查看对象类型](md/45.md)           | type      | V1.0       | ⭐️ |
-| 46   | [元类使用介绍](md/46.md)     | type,`__class__` | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 47   | [枚举对象](md/47.md)       | enumerate | V1.0   | ⭐️⭐️⭐️ |
-| 48   | [查看变量所占字节数](md/48.md)  | getsizeof | V1.0 | ⭐️⭐️⭐️ |
-| 49   | [过滤器filter](md/49.md)  | filter | V1.5 | ⭐️⭐️⭐️ |
-| 50   | [返回对象哈希值](md/50.md)    | hash | V1.0 | ⭐️⭐️ |
-| 51   | [help 一键帮助](md/51.md)  | help | V1.0 | ⭐️ |
-| 52   | [获取用户输入](md/52.md)     | input | V1.0 | ⭐️ |
-| 53   | [创建迭代器](md/53.md)      | iter,`__iter__` | V1.5  | ⭐️⭐️⭐️ |
-| 54   | [文件读写和mode 取值表](md/54.md) | open,read,write,with,mode | V2.0 | ⭐️⭐️⭐️ |
-| 55   | [创建range序列](md/55.md) | range | V1.0 | ⭐️⭐️ |
-| 56   | [反向迭代器reversed](md/56.md) | reversed | V1.0 | ⭐️⭐️ |
-| 57   | [zip迭代器](md/57.md)     | zip  | V1.5 | ⭐️⭐️⭐️ |
-| 58   | [operator使用举例](md/58.md) | operator | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 59   | [传输json对象](md/59.md)  | json | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 60   | [不用else和if实现计算器](md/60.md) | operator | V1.0 | ⭐️⭐️⭐️ |
-| 61   | [去最求平均](md/61.md)      | list,sort,round | V1.0  | ⭐️⭐️⭐️⭐️ |
-| 62   | [打印99乘法表](md/62.md)    | for,range,format | V1.0 | ⭐️⭐️⭐️ |
-| 63   | [递归版flatten函数](md/63.md) | recursion,list,isinstance | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 64   | [列表等分为n份](md/64.md)  | list,ceil | V1.0 | ⭐️⭐️⭐️ |
-| 65   | [压缩列表](md/65.md)       | list,filter | V1.0   | ⭐️⭐️⭐️⭐️ |
-| 66   | [求更长的列表](md/66.md)     | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 67   | [求列表众数](md/67.md)     | max,lambda,count | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 68   | [所有多个列表的最大值](md/68.md) | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 69   | [列表检查重复](md/69.md)     | set  | V1.0 | ⭐️⭐️⭐️ |
-| 70   | [一行代码实现列表反转](md/70.md) | reverse | V1.0 | ⭐️⭐️ |
-| 71   | [浮点数等差数列](md/71.md)    | range,float | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 72   | [按条件分组](md/72.md)      | lambda | V1.0  | ⭐️⭐️⭐️⭐️ |
-| 73   | [map实现向量运算](md/73.md)  | map,lambda | V1.0 | ⭐️⭐️⭐️ |
-| 74   | [值最大的字典](md/74.md)     | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 75   | [合并两个字典](md/75.md)     | **   | V1.0 | ⭐️⭐️⭐️ |
-| 76   | [Topn 字典](md/76.md)    | heapq,nlargest | V1.0 | ⭐️⭐️⭐️ |
-| 77   | [判断是否为异位词](md/77.md) | collections,Counter | V1.0 | ⭐️⭐️⭐️ |
-| 78   | [逻辑上合并字典](md/78.md) | ChainMap | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 79   | [带名字的元组](md/79.md) | namedtuple | V1.0 | ⭐️⭐️⭐️ |
-| 80   | [sample 样本抽样](md/80.md) | random,sample | V1.0 | ⭐️⭐️⭐️ |
-| 81   | [重洗数据集](md/81.md) | shuffle | V1.0 | ⭐️⭐️⭐️ |
-| 82   | [10个均匀分布的坐标点](md/82.md) | random,uniform | V1.0 | ⭐️⭐️⭐️ |
-| 83   | [10个高斯分布的坐标点](md/83.md) | random,gauss | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 84   | [chain串联小容器为大容器](md/84.md) | itertools,chain | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 85   | [product 使用案例](md/85.md) | product | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 86   | [反转字符串的两个方法](md/86.md) | reversed | V1.0 | ⭐️⭐️ |
-| 87   | [join 串联字符串](md/87.md) | join | V1.0 | ⭐️⭐️ |
-| 88   | [字符串字节长度](md/88.md) | encode | V1.0 | ⭐️⭐️ |
-| 89   | [正则中字符 `r`作用](md/89.md) | re,r | V3.0 | ⭐️⭐️⭐️ |
-| 90   | [正则原子操作](md/90.md) | re | V3.0 | ⭐️⭐️⭐️ |
-| 91   | [正则中的转义](md/91.md) | re,\ | V3.0 | ⭐️⭐️⭐️ |
-| 92   | [正则最普通查找](md/92.md) | re,findall | V3.0 | ⭐️⭐️⭐️ |
-| 93   | [使用通用字符查找](md/93.md) | re,\s,\w,\d | V3.0 | ⭐️⭐️⭐️ |
-| 94   | [使用元字符查找](md/94.md) | re,+,* | V3.0 | ⭐️⭐️⭐️ |
-| 95   | [捕获子串](md/95.md) | () | V3.0 | ⭐️⭐️⭐️⭐️ |
-| 96   | [贪心捕获和非贪心捕获](md/96.md) | re | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 97   | [使用正则做密码安全检查](md/97.md) | re | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 98   | [爬取百度首页标题](md/98.md) | re | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 99   | [批量转化为驼峰格式(Camel)](md/99.md) | re | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 100   | [是否互为排序词](md/100.md) | collections,defaultdict | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 101   | [str1是否由str2旋转而来](md/101.md) | str | V1.0 | ⭐️⭐️⭐️ |
-| 102   | [使用正则判断是否为正浮点数](md/102.md) | str,re,float | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 103   | [获取文件后缀名](md/103.md) | os,splitext | V1.0 | ⭐️⭐️ |
-| 104   | [获取路径中的文件名](md/104.md) | os,split | V1.0 | ⭐️⭐️ |
-| 105   | [批量修改文件后缀](md/105.md) | argparse,listdir | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 106   | [xls批量转换成xlsx](md/106.md) | os,listdir,splitext | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 107   | [获取指定后缀名的文件](md/107.md) | os,listdir,splitext | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 108   | [批量压缩文件](md/108.md) | zipfile | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 109   | [32位加密](md/109.md) | hashlib | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 110   | [年的日历图](md/110.md) | calendar | V1.0 | ⭐️⭐️ |
-| 111   | [判断是否为闰年](md/111.md) | calendar | V1.0 | ⭐️⭐️⭐️ |
-| 112   | [判断月有几天](md/112.md) | calendar,datetime | V1.0 | ⭐️⭐️⭐️ |
-| 113   | [月的第一天](md/113.md) | datetime | V1.0 | ⭐️⭐️ |
-| 114 | [月的最后一天](md/114.md) | calendar,datetime | V1.0 | ⭐️⭐️ |
-| 115 | [获取当前时间](md/115.md) | time,datetime | V1.0 | ⭐️⭐️ |
-| 116 | [字符时间转时间](md/116.md) | time,datetime | V1.0 | ⭐️⭐️ |
-| 117 | [时间转字符时间](md/117.md) | time,datetime | V1.0 | ⭐️⭐️ |
-| 118 | [默认启动主线程](md/118.md) | threading | V1.0 | ⭐️⭐️ |
-| 119 | [创建线程](md/119.md) | threading | V1.0 | ⭐️⭐️ |
-| 120 | [交替获得CPU时间片](md/120.md) | threading | V1.0 | ⭐️⭐️⭐️ |
-| 121 | [多线程抢夺同一个变量](md/121.md) | threading | V1.0 | ⭐️⭐️⭐️ |
-| 122 | [多线程变量竞争引起的问题](md/122.md) | threading | V1.0 | ⭐️⭐️⭐️ |
-| 123 | [多线程锁](md/123.md) | threading,lock | V1.0 | ⭐️⭐️⭐️ |
-| 124 | [时间转数组及常用格式](md/124.md) | time,datetime,format | V1.0 | ⭐️⭐️⭐️ |
-| 125 | [寻找第n次出现位置](md/125.md) | enumerator | V1.0 | ⭐️⭐️⭐️ |
-| 126 | [斐波那契数列前n项](md/126.md) | yield,range | V1.0 | ⭐️⭐️⭐️ |
-| 127 | [找出所有重复元素](md/127.md) | calendar,datetime | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 128 | [联合统计次数](md/128.md) | Counter | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 129 | [groupby单字段分组](md/129.md) | itertools, groupby,lambda | V1.0 | ⭐️⭐️⭐️ |
-| 130 | [groupby多字段分组](md/130.md) | itemgetter,itertools,groupby | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 131 | [itemgetter和key函数](md/131.md) | operator,itemgetter,itertools | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 132 | [sum函数计算和聚合同时做](md/132.md) | sum,generator | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
-| 133 | [获得某天后的1~n天](md/133.md) | Calendar,monthrange | V4.0 | ⭐️⭐️⭐️ |
-| 134 | [list分组(生成器版)](md/134.md) | yield,generator | V1.0 | ⭐️⭐️⭐️ |
-| 135 | [列表全展开(生成器版)](md/135.md) | list,yield,generator | V1.0 | ⭐️⭐️⭐️ |
-| 136 | [测试函数运行时间的装饰器](md/136.md) | decorator | V1.0 | ⭐️⭐️⭐️⭐️ |
-| 137 | [统计异常次数装饰器](md/137.md) | decorator,nonlocal | V1.5 | ⭐️⭐️⭐️⭐️ |
-| 138 | [通俗理解装饰器](md/138.md) | decorator | V1.0 | ⭐️⭐️⭐️ |
-| 139 | [定制递减迭代器](md/139.md) | Iterator | V3.0 | ⭐️⭐️⭐️⭐️ |
-| 140 | [turtle绘制奥运五环图](md/140.md) | turtle | V1.0 | ⭐️⭐️⭐️ |
-| 141 | [turtle绘制漫天雪花](md/141.md) | turtle | V1.0 | ⭐️⭐️⭐️ |
-| 142 | [Python词云图](md/142.md) | WordCloud | V1.0 | ⭐️⭐️⭐ |
-| 143 | [Plotly柱状图和折线图](md/143.md) | plotly | V1.0 | ⭐️⭐ |
-| 144 | [seaborn热力图](md/144.md) | seaborn | V1.0 | ⭐️⭐ |
-| 145 | [Pyecharts仪表盘](md/145.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 146 | [Pyecharts漏斗图](md/146.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 147 | [Pyecharts水球图](md/147.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 148 | [Pyecharts饼图](md/148.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 149 | [Pyecharts极坐标图](md/149.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 150 | [Pyecharts词云图](md/150.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 151 | [Pyecharts热力图](md/151.md) | pyecharts | V1.0 | ⭐️⭐ |
-| 152 | [matplotlib绘制动图](md/152.md) | matplotlib | V1.0 | ⭐️⭐ |
-| 153 | [seaborn pairplot图](md/153.md) | seaborn | V1.0 | ⭐️⭐⭐⭐ |
-| 154 | [含单个元素的元组](md/154.md) | tuple | V1.0 | ⭐️⭐ |
-| 155 | [默认参数设为空](md/155.md) | function | V1.0 | ⭐️⭐⭐ |
-| 156 | [共享变量未绑定之坑](md/156.md) | global | V1.0 | ⭐️⭐⭐ |
-| 157 | [lambda自由参数之坑](md/157.md) | lambda | V1.0 | ⭐️⭐⭐ |
-| 158 | [各种参数使用之坑](md/158.md) | function paremeter | V1.0 | ⭐️⭐⭐ |
-| 159 | [列表删除之坑](md/159.md) | list | V1.0 | ⭐️⭐ |
-| 160 | [列表快速复制之坑](md/160.md) | list  | V1.0 | ⭐️⭐⭐ |
-| 161 | [字符串驻留](md/161.md) | str  | V1.0 | ⭐️⭐⭐⭐⭐ |
-| 162 | [相同值的不可变对象](md/162.md) | mutable  | V1.0 | ⭐️⭐⭐ |
-| 163 | [对象销毁顺序](md/163.md) | OOP del   | V1.0 | ⭐️⭐⭐⭐ |
-| 164 | [充分认识for](md/164.md) | for  | V1.0 | ⭐️⭐⭐ |
-| 165 | [认识执行时机](md/165.md) | generator  | V1.0 | ⭐️⭐⭐⭐⭐ |
-| 166 | [创建空集合错误](md/166.md) | set  | V1.0 | ⭐️⭐ |
-| 167 | [pyecharts传入Numpy数据绘图失败](md/167.md) | numpy pyecharts  | V1.0 | ⭐️⭐⭐ |
-| 168 | [优化代码异常输出包](md/168.md) | debugger  | V1.0 | ⭐️⭐⭐ |
-| 169 | [图像处理包pillow](md/169.md) | pillow  | V1.0 | ⭐️⭐⭐ |
-| 170 | [一行代码找到编码](md/170.md) | chardet  | V1.0 | ⭐️⭐⭐ |
-| 171 | [子类继承父类的静态方法吗？](md/171.md) | staticmethod | V1.0 | ⭐️⭐⭐ |
-| 172 | [NumPy 的pad填充方法](md/172.md) | NumPy pad | V1.0 | ⭐️⭐⭐⭐ |
-| 173 | [创建下对角线为1、2、3、4的对角矩阵](md/173.md) | NumPy diag | V1.0 | ⭐️⭐⭐ |
-| 174 | [cut 数据分箱](md/174.md) | Pandas cut | v1.0 | ⭐️⭐⭐ |
-| 175 | [丢弃空值和填充空值](./md/175.md) | Pandas dropna fillna | v1.0 | ⭐️⭐⭐ |
-| 176 | [一行代码让 pip 安装加速 100 倍](md/176.md) | pip install | v1.0 | ⭐️⭐⭐ |
-| 177 | [数据分析神器：deepnote](./md/177.md) | deepnote | v1.0 | ⭐️⭐⭐ |
-| 178 | [apply 方法去掉特殊字符](./md/178.md) | pandas apply | v1.0 | ⭐️⭐⭐ |
-| 179 | [使用map对列做特征工程](./md/179.md) | pandas map | v1.0 | ⭐️⭐⭐ |
-| 180 | [category列转数值](./md/180.md) | pandas category | v1.0 | ⭐️⭐⭐ |
-| 181 | [rank排名](./md/181.md) | pandas rank | v1.0 | ⭐️⭐⭐|
-| 182 | [完成数据下采样，调整步长由小时为天](./md/182.md) | pandas resample | v1.0 | ⭐️⭐⭐ |
-| 183 | [如何用 Pandas 快速生成时间序列数据](/md/183.md) | pandas util | v1.0 | ⭐️⭐⭐ |
-| 184 | [如何快速找出 DataFrame 所有列 null 值个数](/md/184.md) | pandas isnull sum | v1.0 | ⭐️⭐⭐ |
-| 185 | [重新排序 DataFrame 的列](/md/185.md) | pandas dataframe | v1.0 | ⭐️⭐⭐ |
-| 186 | [使用 count 统计词条 出现次数](/md/186.md) | pandas count | v1.0 | ⭐️⭐⭐ |
-| 187 | [split 求时分(HH:mm)的分钟差](/md/187.md) | pandas split | v1.0 | ⭐️⭐⭐ |
-| 188 | [melt透视数据小技巧](/md/188.md) | pandas melt | v1.0 | ⭐️⭐⭐ |
-| 189 | [pivot 透视小技巧](/md/189.md) | pandas melt | v1.0 | ⭐️⭐⭐ |
-
-
-
-### Python 实战
-
-
-#### 221 自动群发邮件
-
-Python自动群发邮件
-
-```python
-import smtplib
-from email import (header)
-from email.mime import (text, application, multipart)
-import time
-
-def sender_mail():
-    smt_p = smtplib.SMTP()
-    smt_p.connect(host='smtp.qq.com', port=25)
-    sender, password = '113097485@qq.com', "**************"
-    smt_p.login(sender, password)
-    receiver_addresses, count_num = [
-        'guozhennianhua@163.com', 'xiaoxiazi99@163.com'], 1
-    for email_address in receiver_addresses:
-        try:
-            msg = multipart.MIMEMultipart()
-            msg['From'] = "zhenguo"
-            msg['To'] = email_address
-            msg['subject'] = header.Header('这是邮件主题通知', 'utf-8')
-            msg.attach(text.MIMEText(
-                '这是一封测试邮件，请勿回复本邮件~', 'plain', 'utf-8'))
-            smt_p.sendmail(sender, email_address, msg.as_string())
-            time.sleep(10)
-            print('第%d次发送给%s' % (count_num, email_address))
-            count_num = count_num + 1
-        except Exception as e:
-            print('第%d次给%s发送邮件异常' % (count_num, email_address))
-            continue
-    smt_p.quit()
-
-sender_mail()
-```
-
-
-
-注意：
-发送邮箱是qq邮箱，所以要在qq邮箱中设置开启SMTP服务，设置完成时会生成一个授权码，将这个授权码赋值给文中的`password`变量
-
-#### 222 二分搜索
-
-二分搜索是程序员必备的算法，无论什么场合，都要非常熟练地写出来。
-
-小例子描述：
-在**有序数组**`arr`中，指定区间`[left,right]`范围内，查找元素`x`
-如果不存在，返回`-1`
-
-二分搜索`binarySearch`实现的主逻辑
-
-```python
-def binarySearch(arr, left, right, x):
-    while left <= right:
-
-        mid = int(left + (right - left) / 2); # 找到中间位置。求中点写成(left+right)/2更容易溢出，所以不建议这样写
-
-        # 检查x是否出现在位置mid
-        if arr[mid] == x:
-            print('found %d 在索引位置%d 处' %(x,mid))
-            return mid
-
-            # 假如x更大，则不可能出现在左半部分
-        elif arr[mid] < x:
-            left = mid + 1 #搜索区间变为[mid+1,right]
-            print('区间缩小为[%d,%d]' %(mid+1,right))
-
-        # 同理，假如x更小，则不可能出现在右半部分
-        elif x<arr[mid]:
-            right = mid - 1 #搜索区间变为[left,mid-1]
-            print('区间缩小为[%d,%d]' %(left,mid-1))
-
-    # 假如搜索到这里，表明x未出现在[left,right]中
-    return -1
-```
-
-在`Ipython`交互界面中，调用`binarySearch`的小Demo:
-
-```python
-In [8]: binarySearch([4,5,6,7,10,20,100],0,6,5)
-区间缩小为[0,2]
-found 5 at 1
-Out[8]: 1
-
-In [9]: binarySearch([4,5,6,7,10,20,100],0,6,4)
-区间缩小为[0,2]
-区间缩小为[0,0]
-found 4 at 0
-Out[9]: 0
-
-In [10]: binarySearch([4,5,6,7,10,20,100],0,6,20)
-区间缩小为[4,6]
-found 20 at 5
-Out[10]: 5
-
-In [11]: binarySearch([4,5,6,7,10,20,100],0,6,100)
-区间缩小为[4,6]
-区间缩小为[6,6]
-found 100 at 6
-Out[11]: 6
-```
-
-#### 223 爬取天气数据并解析温度值
-
-爬取天气数据并解析温度值
-
-素材来自朋友袁绍，感谢！
-
-爬取的html 结构
-
-<img src="./img/1.png" width="50%"/>
-
-```python
-import requests
-from lxml import etree
-import pandas as pd
-import re
-
-url = 'http://www.weather.com.cn/weather1d/101010100.shtml#input'
-with requests.get(url) as res:
-    content = res.content
-    html = etree.HTML(content)
-```
-
-
-
-通过lxml模块提取值
-
-lxml比beautifulsoup解析在某些场合更高效
-
-```python
-location = html.xpath('//*[@id="around"]//a[@target="_blank"]/span/text()')
-temperature = html.xpath('//*[@id="around"]/div/ul/li/a/i/text()')
-```
-
-结果：
-
-```python
-['香河', '涿州', '唐山', '沧州', '天津', '廊坊', '太原', '石家庄', '涿鹿', '张家口', '保定', '三河', '北京孔庙', '北京国子监', '中国地质博物馆', '月坛公
-园', '明城墙遗址公园', '北京市规划展览馆', '什刹海', '南锣鼓巷', '天坛公园', '北海公园', '景山公园', '北京海洋馆']
-
-['11/-5°C', '14/-5°C', '12/-6°C', '12/-5°C', '11/-1°C', '11/-5°C', '8/-7°C', '13/-2°C', '8/-6°C', '5/-9°C', '14/-6°C', '11/-4°C', '13/-3°C'
-, '13/-3°C', '12/-3°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-2°C', '12/-2°C', '12/-3°C']
-```
-
-
-构造DataFrame对象
-
-```python
-df = pd.DataFrame({'location':location, 'temperature':temperature})
-print('温度列')
-print(df['temperature'])
-```
-
-正则解析温度值
-
-```python
-df['high'] = df['temperature'].apply(lambda x: int(re.match('(-?[0-9]*?)/-?[0-9]*?°C', x).group(1) ) )
-df['low'] = df['temperature'].apply(lambda x: int(re.match('-?[0-9]*?/(-?[0-9]*?)°C', x).group(1) ) )
-print(df)
-```
-
-详细说明子字符创捕获
-
-除了简单地判断是否匹配之外，正则表达式还有提取子串的强大功能。用`()`表示的就是要提取的分组（group）。比如：`^(\d{3})-(\d{3,8})$`分别定义了两个组，可以直接从匹配的字符串中提取出区号和本地号码
-
-```python
-m = re.match(r'^(\d{3})-(\d{3,8})$', '010-12345')
-print(m.group(0))
-print(m.group(1))
-print(m.group(2))
-
-# 010-12345
-# 010
-# 12345
-```
-
-如果正则表达式中定义了组，就可以在`Match`对象上用`group()`方法提取出子串来。
-
-注意到`group(0)`永远是原始字符串，`group(1)`、`group(2)`……表示第1、2、……个子串。
-
-
-最终结果
-
-```kepython
-Name: temperature, dtype: object
-    location temperature  high  low
-0         香河     11/-5°C    11   -5
-1         涿州     14/-5°C    14   -5
-2         唐山     12/-6°C    12   -6
-3         沧州     12/-5°C    12   -5
-4         天津     11/-1°C    11   -1
-5         廊坊     11/-5°C    11   -5
-6         太原      8/-7°C     8   -7
-7        石家庄     13/-2°C    13   -2
-8         涿鹿      8/-6°C     8   -6
-9        张家口      5/-9°C     5   -9
-10        保定     14/-6°C    14   -6
-11        三河     11/-4°C    11   -4
-12      北京孔庙     13/-3°C    13   -3
-13     北京国子监     13/-3°C    13   -3
-14   中国地质博物馆     12/-3°C    12   -3
-15      月坛公园     12/-3°C    12   -3
-16   明城墙遗址公园     13/-3°C    13   -3
-17  北京市规划展览馆     12/-2°C    12   -2
-18       什刹海     12/-3°C    12   -3
-19      南锣鼓巷     13/-3°C    13   -3
-20      天坛公园     12/-2°C    12   -2
-21      北海公园     12/-2°C    12   -2
-22      景山公园     12/-2°C    12   -2
-23     北京海洋馆     12/-3°C    12   -3
-```
-
-### 十、数据分析
-
-本项目基于Kaggle电影影评数据集，通过这个系列，你将学到如何进行数据探索性分析(EDA)，学会使用数据分析利器`pandas`，会用绘图包`pyecharts`，以及EDA时可能遇到的各种实际问题及一些处理技巧。
-
-
-
-本项目需要导入的包：
-
-```python
-import pandas as pd
-import numpy as np
-import matplotlib.pyplot as plt
-from pyecharts.charts import Bar,Grid,Line
-import pyecharts.options as opts
-from pyecharts.globals import ThemeType
-```
-
-#### 1 创建DataFrame
-pandas中一个dataFrame实例：
-```python
-Out[89]:
-        a  val
-0  apple1  1.0
-1  apple2  2.0
-2  apple3  3.0
-3  apple4  4.0
-4  apple5  5.0
-```
-
-我们的**目标**是变为如下结构：
-```python
-a  apple1  apple2  apple3  apple4  apple5
-0     1.0     2.0     3.0     4.0     5.0
-```
-
-乍看可使用`pivot`，但很难一步到位。
-
-所以另辟蹊径，提供一种简单且好理解的方法：
-
-```python
-In [113]: pd.DataFrame(index=[0],columns=df.a,data=dict(zip(df.a,df.val)))
-Out[113]:
-a  apple1  apple2  apple3  apple4  apple5
-0     1.0     2.0     3.0     4.0     5.0
-```
-以上方法是重新创建一个DataFrame,直接把`df.a`所有可能取值作为新dataframe的列，index调整为`[0]`，注意类型必须是数组类型(array-like 或者 Index)，两个轴确定后，`data`填充数据域。
-
-```python
-In [116]: dict(zip(df.a,df.val))
-Out[116]: {'apple1': 1.0, 'apple2': 2.0, 'apple3': 3.0, 'apple4': 4.0, 'apple5': 5.0}
-```
-
-
-
-#### 2 导入数据
-数据来自kaggle，共包括三个文件：
-
-1. movies.dat
-2. ratings.dat
-3. users.dat
-
-`movies.dat`包括三个字段：['Movie ID', 'Movie Title', 'Genre']
-
-使用pandas导入此文件：
-
-```python
-import pandas as pd
-
-movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
-```
-
-导入后，显示前5行：
-
-```python
-   Movie ID                                        Movie Title  \
-0         8      Edison Kinetoscopic Record of a Sneeze (1894)   
-1        10                La sortie des usines Lumi猫re (1895)   
-2        12                      The Arrival of a Train (1896)   
-3        25  The Oxford and Cambridge University Boat Race ...   
-4        91                         Le manoir du diable (1896)   
-5       131                           Une nuit terrible (1896)   
-6       417                      Le voyage dans la lune (1902)   
-7       439                     The Great Train Robbery (1903)   
-8       443        Hiawatha, the Messiah of the Ojibway (1903)   
-9       628                    The Adventures of Dollie (1908)  
-                                          Genre  
-0                             Documentary|Short  
-1                             Documentary|Short  
-2                             Documentary|Short  
-3                                           NaN  
-4                                  Short|Horror  
-5                           Short|Comedy|Horror  
-6  Short|Action|Adventure|Comedy|Fantasy|Sci-Fi  
-7                    Short|Action|Crime|Western  
-8                                           NaN  
-9                                  Action|Short  
-```
-
-
-
-次导入其他两个数据文件
-
-`users.dat`:
-
-```python
-users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
-print(users.head())
-```
-
-结果：
-
-```python
-   User ID  Twitter ID
-0        1   397291295
-1        2    40501255
-2        3   417333257
-3        4   138805259
-4        5  2452094989
-5        6   391774225
-6        7    47317010
-7        8    84541461
-8        9  2445803544
-9       10   995885060
-```
-
-
-
-`rating.data`:
-
-```python
-ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
-print(ratings.head())
-```
-
-结果：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp
-0        1    111161      10        1373234211
-1        1    117060       7        1373415231
-2        1    120755       6        1373424360
-3        1    317919       6        1373495763
-4        1    454876      10        1373621125
-5        1    790724       8        1374641320
-6        1    882977       8        1372898763
-7        1   1229238       9        1373506523
-8        1   1288558       5        1373154354
-9        1   1300854       8        1377165712
-```
-
- **read_csv 使用说明**
-
-说明，本次导入`dat`文件使用`pandas.read_csv`函数。
-
-第一个位置参数`./data/movietweetings/ratings.dat` 表示文件的相对路径
-
-第二个关键字参数：`delimiter='::'`，表示文件分隔符使用`::`
-
-后面几个关键字参数分别代表使用的引擎，文件没有表头，所以`header`为`None;`
-
-导入后dataframe的列名使用`names`关键字设置，这个参数大家可以记住，比较有用。
-
-
-
-Kaggle电影数据集第一节，我们使用数据处理利器 `pandas`， 函数`read_csv` 导入给定的三个数据文件。
-
-```python
-import pandas as pd
-
-movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
-users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
-ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
-```
-
-用到的`read_csv`，某些重要的参数，如何使用在上一节也有所提到。下面开始数据探索分析(EDA)
-
-> 找出得分前10喜剧(comedy)
-
-
-
-#### 3 处理组合值
-
-表`movies`字段`Genre`表示电影的类型，可能有多个值，分隔符为`|`，取值也可能为`None`.
-
-针对这类字段取值，可使用Pandas中Series提供的`str`做一步转化，**注意它是向量级的**，下一步，如Python原生的`str`类似，使用`contains`判断是否含有`comedy`字符串：
-
-```python
-mask = movies.Genre.str.contains('comedy',case=False,na=False)
-```
-
-注意使用的两个参数：`case`, `na`
-
-case为 False，表示对大小写不敏感；
-na Genre列某个单元格为`NaN`时，我们使用的充填值，此处填充为`False`
-
-返回的`mask`是一维的`Series`，结构与 movies.Genre相同，取值为True 或 False.
-
-观察结果：
-
-```python
-0    False
-1    False
-2    False
-3    False
-4    False
-5     True
-6     True
-7    False
-8    False
-9    False
-Name: Genre, dtype: bool
-
-```
-
-
- #### 4 访问某列
-
-得到掩码mask后，pandas非常方便地能提取出目标记录：
-
-```python
-comedy = movies[mask]
-comdey_ids = comedy['Movie ID']
-
-```
-
-以上，在pandas中被最频率使用，不再解释。看结果`comedy_ids.head()`：
-
-```python
-5      131
-6      417
-15    2354
-18    3863
-19    4099
-20    4100
-21    4101
-22    4210
-23    4395
-25    4518
-Name: Movie ID, dtype: int64
-
-```
-
-
-
-1-4介绍`数据读入`，`处理组合值`，`索引数据`等, pandas中使用较多的函数，基于Kaggle真实电影影评数据集，最后得到所有`喜剧 ID`：
-
-```python
-5      131
-6      417
-15    2354
-18    3863
-19    4099
-20    4100
-21    4101
-22    4210
-23    4395
-25    4518
-Name: Movie ID, dtype: int64
-
-```
-
-下面继续数据探索之旅~
-
-#### 5 连接两个表
-
-拿到所有喜剧的ID后，要想找出其中平均得分最高的前10喜剧，需要关联另一张表：`ratings`:
-
-再回顾下ratings表结构：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp
-0        1    111161      10        1373234211
-1        1    117060       7        1373415231
-2        1    120755       6        1373424360
-3        1    317919       6        1373495763
-4        1    454876      10        1373621125
-5        1    790724       8        1374641320
-6        1    882977       8        1372898763
-7        1   1229238       9        1373506523
-8        1   1288558       5        1373154354
-9        1   1300854       8        1377165712
-
-```
-
-
-pandas 中使用`join`关联两张表，连接字段是`Movie ID`，如果顺其自然这么使用`join`：
-
-```python
-combine = ratings.join(comedy, on='Movie ID', rsuffix='2')
-
-```
-
-左右滑动，查看完整代码
-
-大家可验证这种写法，仔细一看，会发现结果非常诡异。
-
-究其原因，这是pandas join函数使用的一个算是坑点，它在官档中介绍，连接右表时，此处右表是`comedy`，它的`index`要求是连接字段，也就是 `Movie ID`. 
-
-左表的index不要求，但是要在参数 `on`中给定。
-
-**以上是要注意的一点**
-
-修改为：
-
-```python
-combine = ratings.join(comedy.set_index('Movie ID'), on='Movie ID')
-print(combine.head(10))
-
-```
-
-以上是OK的写法
-
-观察结果：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp Movie Title Genre
-0        1    111161      10        1373234211         NaN   NaN
-1        1    117060       7        1373415231         NaN   NaN
-2        1    120755       6        1373424360         NaN   NaN
-3        1    317919       6        1373495763         NaN   NaN
-4        1    454876      10        1373621125         NaN   NaN
-5        1    790724       8        1374641320         NaN   NaN
-6        1    882977       8        1372898763         NaN   NaN
-7        1   1229238       9        1373506523         NaN   NaN
-8        1   1288558       5        1373154354         NaN   NaN
-9        1   1300854       8        1377165712         NaN   NaN
-
-```
-
-Genre列为`NaN`表明，这不是喜剧。需要筛选出此列不为`NaN` 的记录。
-
-#### 6 按列筛选
-
-pandas最方便的地方，就是向量化运算，尽可能减少了for循环的嵌套。
-
-按列筛选这种常见需求，自然可以轻松应对。
-
-为了照顾初次接触 pandas 的朋友，分两步去写：
-
-```python
-mask = pd.notnull(combine['Genre'])
-
-```
-
-结果是一列只含`True 或 False`的值
-
-```python
-result = combine[mask]
-print(result.head())
-
-```
-
-结果中，Genre字段中至少含有一个Comedy字符串，表明验证了我们以上操作是OK的。
-
-```python
-    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
-12        1   1588173       9        1372821281      Warm Bodies (2013)   
-13        1   1711425       3        1372604878        21 & Over (2013)   
-14        1   2024432       8        1372703553   Identity Thief (2013)   
-17        1   2101441       1        1372633473  Spring Breakers (2012)   
-28        2   1431045       7        1457733508         Deadpool (2016)   
-
-                             Genre  
-12           Comedy|Horror|Romance  
-13                          Comedy  
-14    Adventure|Comedy|Crime|Drama  
-17              Comedy|Crime|Drama  
-28  Action|Adventure|Comedy|Sci-Fi  
-
-
-```
-
-
-
-截止目前已经求出所有喜剧电影`result`，前5行如下，Genre中都含有`Comedy`字符串：
-```python
-    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
-12        1   1588173       9        1372821281      Warm Bodies (2013)   
-13        1   1711425       3        1372604878        21 & Over (2013)   
-14        1   2024432       8        1372703553   Identity Thief (2013)   
-17        1   2101441       1        1372633473  Spring Breakers (2012)   
-28        2   1431045       7        1457733508         Deadpool (2016)   
+允许按照要求转载，但禁止用于任何商用目的。如果转载本库小例子、小案例，请备注下方链接：
 
-                             Genre  
-12           Comedy|Horror|Romance  
-13                          Comedy  
-14    Adventure|Comedy|Crime|Drama  
-17              Comedy|Crime|Drama  
-28  Action|Adventure|Comedy|Sci-Fi  
-```
+[Python小例子所有汇总](https://ai-jupyter.com/python-small-examples/)
 
+### 更多教程
 
+[AI消息](https://ai-jupyter.com/)
 
-#### 7 按照Movie ID 分组
+[AI新闻报道](https://ai-jupyter.com/ai-news-all/)
 
-result中会有很多观众对同一部电影的打分，所以要求得分前10的喜剧，先按照`Movie ID`分组，然后求出平均值：
-```python
-score_as_movie = result.groupby('Movie ID').mean()
-```
+[AI大模型](https://ai-jupyter.com/ai-llm/)
 
-前5行显示如下：
-```python
-               User ID  Rating  Rating Timestamp
-Movie ID                                        
-131       34861.000000     7.0      1.540639e+09
-417       34121.409091     8.5      1.458680e+09
-2354       6264.000000     8.0      1.456343e+09
-3863      43803.000000    10.0      1.430439e+09
-4099      25084.500000     7.0      1.450323e+09
-```
+[AI工具集](https://ai-jupyter.com/ai-chatgpt/)
 
-#### 8 按照电影得分排序
+[ChatGPT4o免费使用六种方法](https://ai-jupyter.com/ai-chatgpt/)
 
-```python
-score_as_movie.sort_values(by='Rating', ascending = False,inplace=True)
-score_as_movie
-```
-前5行显示如下：
-```python
-	User ID	Rating	Rating Timestamp
-Movie ID			
-7134690	30110.0	10.0	1.524974e+09
-416889	1319.0	10.0	1.543320e+09
-57840	23589.0	10.0	1.396802e+09
-5693562	50266.0	10.0	1.511024e+09
-5074	43803.0	10.0	1.428352e+09
-```
-都是满分？这有点奇怪，会不会这些电影都只有几个人评分，甚至只有1个？评分样本个数太少，显然最终的平均分数不具有太强的说服力。
+[Python教程](https://ai-jupyter.com/python-packages/)
 
-所以，下面要进行每部电影的评分人数统计
+[数据分析教程](https://ai-jupyter.com/numpy-intro/)
 
-#### 9 分组后使用聚合函数
+[算法教程](https://ai-jupyter.com/algorithm-basic/)
 
-根据`Movie ID`分组后，使用`count`函数统计`每组个数`，只保留count列，最后得到`watchs2`:
+[AI教程](https://ai-jupyter.com/statistics/)
 
-```python
-watchs = result.groupby('Movie ID').agg(['count'])
-watchs2 = watchs['Rating']['count']
-```
-打印前20行：
-```python
-print(watchs2.head(20))
-```
-结果：
-```python
-Movie ID
-131      1
-417     22
-2354     1
-3863     1
-4099     2
-4100     1
-4101     1
-4210     1
-4395     1
-4518     1
-4546     2
-4936     2
-5074     1
-5571     1
-6177     1
-6414     3
-6684     1
-6689     1
-7145     1
-7162     2
-Name: count, dtype: int64
-```
-果然，竟然有这么多电影的评论数只有1次！样本个数太少，评论的平均值也就没有什么说服力。
+[Git教程](https://ai-jupyter.com/git/)
 
-查看`watchs2`一些重要统计量：
-```python
-watchs2.describe()
-```
-结果：
-```python
-count    10740.000000
-mean        20.192086
-std         86.251411
-min          1.000000
-25%          1.000000
-50%          2.000000
-75%          7.000000
-max       1843.000000
-Name: count, dtype: float64
-```
-共有10740部**喜剧**电影被评分，平均打分次数20次，标准差86，75%的电影样本打分次数小于7次，最小1次，最多1843次。
+[程序员](https://ai-jupyter.com/others/)
 
-#### 10 频率分布直方图
+[资料下载](https://ai-jupyter.com/python-20/)
 
-绘制评论数的频率分布直方图，便于更直观的观察电影被评论的分布情况。上面分析到，75%的电影打分次数小于7次，所以绘制打分次数小于20次的直方图：
 
-```python
-fig = plt.figure(figsize=(12,8))
-histn = plt.hist(watchs2[watchs2 <=19],19,histtype='step')
-plt.scatter([i+1 for i in range(len(histn[0]))],histn[0])
-```
-
-![](./img/20200131094927.jpg)
-
-`histn`元祖表示个数和对应的被分割的区间，查看`histn[0]`:
-```python
-array([4383., 1507.,  787.,  541.,  356.,  279.,  209.,  163.,  158.,
-        118.,  114.,   90.,  104.,   81.,   80.,   73.,   62.,   65.,
-         52.])
-```
-```python
-sum(histn[0]) # 9222
-```
-看到电影评论次数1到19次的喜剧电影9222部，共有10740部喜剧电影，大约`86%`的喜剧电影评论次数`小于20次`，有`1518`部电影评论数不小于20次。
-
-我们肯定希望挑选出被评论次数尽可能多的电影，因为难免会有水军和滥竽充数等`异常评论`行为。那么，如何准确的量化最小抽样量呢？
-
-
-
-#### 11 最小抽样量
-
-根据统计学的知识，最小抽样量和Z值、样本方差和样本误差相关，下面给出具体的求解最小样本量的计算方法。
-
-采用如下计算公式：
-
-$$ n = \frac{Z^2\sigma^2}{E^2} $$
-
-
-此处，$Z$ 值取为95%的置信度对应的Z值也就是1.96，样本误差取为均值的2.5%.
-
-根据以上公式，编写下面代码：
-
-```python
-n3 = result.groupby('Movie ID').agg(['count','mean','std'])
-n3r = n3[n3['Rating']['count']>=20]['Rating']
-```
-只计算影评超过20次，且满足最小样本量的电影。计算得到的`n3r`前5行：
-```python
-	count	mean	std
-Movie ID			
-417	22	8.500000	1.263027
-12349	68	8.485294	1.227698
-15324	20	8.350000	1.039990
-15864	51	8.431373	1.374844
-17925	44	8.636364	1.259216
-```
-进一步求出最小样本量：
-```python
-nmin = (1.96**2*n3r['std']**2) / ( (n3r['mean']*0.025)**2 )
-```
-`nmin`前5行：
-```python
-Movie ID
-417         135.712480
-12349       128.671290
-15324        95.349276
-15864       163.434005
-17925       130.668350
-```
-
-筛选出满足最小抽样量的喜剧电影：
-
-```python
-n3s = n3r[ n3r['count'] >= nmin ]
-```
-结果显示如下，因此共有`173`部电影满足最小样本抽样量。
-
-```python
-
-count	mean	std
-Movie ID			
-53604	129	8.635659	1.230714
-57012	207	8.449275	1.537899
-70735	224	8.839286	1.190799
-75686	209	8.095694	1.358885
-88763	296	8.945946	1.026984
-...	...	...	...
-6320628	860	7.966279	1.469924
-6412452	276	7.510870	1.389529
-6662050	22	10.000000	0.000000
-6966692	907	8.673649	1.286455
-7131622	1102	7.851180	1.751500
-173 rows × 3 columns
-```
-
-#### 12 去重和连表
-
-按照平均得分从大到小排序：
-```python
-n3s_sort = n3s.sort_values(by='mean',ascending=False)
-```
-结果：
-```python
-	count	mean	std
-Movie ID			
-6662050	22	10.000000	0.000000
-4921860	48	10.000000	0.000000
-5262972	28	10.000000	0.000000
-5512872	353	9.985836	0.266123
-3863552	199	9.010050	1.163372
-...	...	...	...
-1291150	647	6.327666	1.785968
-2557490	546	6.307692	1.858434
-1478839	120	6.200000	0.728761
-2177771	485	6.150515	1.523922
-1951261	1091	6.083410	1.736127
-173 rows × 3 columns
-```
-仅靠`Movie ID`还是不知道哪些电影，连接`movies`表：
-```python
-ms = movies.drop_duplicates(subset=['Movie ID'])
-ms = ms.set_index('Movie ID')
-n3s_final = n3s_drops.join(ms,on='Movie ID')
-```
-
-#### 13 结果分析
-
-喜剧榜单前50名：
-```python
-Movie Title
-Five Minutes (2017)
-MSG 2 the Messenger (2015)
-Avengers: Age of Ultron Parody (2015)
-Be Somebody (2016)
-Bajrangi Bhaijaan (2015)
-Back to the Future (1985)
-La vita 鐚?bella (1997)
-The Intouchables (2011)
-The Sting (1973)
-Coco (2017)
-Toy Story 3 (2010)
-3 Idiots (2009)
-Green Book (2018)
-Dead Poets Society (1989)
-The Apartment (1960)
-P.K. (2014)
-The Truman Show (1998)
-Am鑼卨ie (2001)
-Inside Out (2015)
-Toy Story 4 (2019)
-Toy Story (1995)
-Finding Nemo (2003)
-Dr. Strangelove or: How I Learned to Stop Worrying and Love the Bomb (1964)
-Home Alone (1990)
-Zootopia (2016)
-Up (2009)
-Monsters, Inc. (2001)
-La La Land (2016)
-Relatos salvajes (2014)
-En man som heter Ove (2015)
-Snatch (2000)
-Lock, Stock and Two Smoking Barrels (1998)
-How to Train Your Dragon 2 (2014)
-As Good as It Gets (1997)
-Guardians of the Galaxy (2014)
-The Grand Budapest Hotel (2014)
-Fantastic Mr. Fox (2009)
-Silver Linings Playbook (2012)
-Sing Street (2016)
-Deadpool (2016)
-Annie Hall (1977)
-Pride (2014)
-In Bruges (2008)
-Big Hero 6 (2014)
-Groundhog Day (1993)
-The Breakfast Club (1985)
-Little Miss Sunshine (2006)
-Deadpool 2 (2018)
-The Terminal (2004)
-```
-
-前10名评论数图：
-
-![](./img/2020013109495711.jpg)
-
-代码：
-```python
-x = n3s_final['Movie Title'][:10].tolist()[::-1]
-y = n3s_final['count'][:10].tolist()[::-1]
-bar = (
-    Bar()
-    .add_xaxis(x)
-    .add_yaxis('评论数',y,category_gap='50%')
-    .reversal_axis()
-    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影被评论次数"),
-                    toolbox_opts=opts.ToolboxOpts(),)
-)
-grid = (
-    Grid(init_opts=opts.InitOpts(theme=ThemeType.LIGHT))
-    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
-)
-grid.render_notebook()
-```
-
-前10名得分图：
-
-![](./img/2020013109500812.jpg)
-
-代码：
-```python
-x = n3s_final['Movie Title'][:10].tolist()[::-1]
-y = n3s_final['mean'][:10].round(3).tolist()[::-1]
-bar = (
-    Bar()
-    .add_xaxis(x)
-    .add_yaxis('平均得分',y,category_gap='50%')
-    .reversal_axis()
-    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影平均得分"),
-                    xaxis_opts=opts.AxisOpts(min_=8.0,name='平均得分'),
-                    toolbox_opts=opts.ToolboxOpts(),)
-)
-grid = (
-    Grid(init_opts=opts.InitOpts(theme=ThemeType.MACARONS))
-    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
-)
-grid.render_notebook()
-```
-
-
-
-#### 14 生成哑变量
-
-分类变量的数值化，是指将枚举类变量转化为indicator变量或称dummy变量。
-
-那么什么是`indicator变量`，看看如下例子，A变量解析为：`[1,0,0]`, B解析为：`[0,1,0]`, C解析为：`[0,0,1]`
-```python
-In [8]: s = pd.Series(list('ABCA'))
-In [9]: pd.get_dummies(s)
-Out[9]:
-   A  B  C
-0  1  0  0
-1  0  1  0
-2  0  0  1
-3  1  0  0
-```
-
-如果输入的字符有4个唯一值，看到字符a被解析为[1,0,0,0]，向量长度为4.
-
-```python
-In [5]: s = pd.Series(list('abaccd'))
-In [6]: pd.get_dummies(s)
-Out[6]:
-   a  b  c  d
-0  1  0  0  0
-1  0  1  0  0
-2  1  0  0  0
-3  0  0  1  0
-4  0  0  1  0
-5  0  0  0  1
-```
-
-也就是说dummy向量的长度等于输入字符串中，唯一字符的个数。
-
-#### 15 讨厌的SettingWithCopyWarning！！！
-
-Pandas 处理数据，太好用了，谁用谁知道！
-
-使用过 Pandas 的，几乎都会遇到一个警告：
-
-*SettingWithCopyWarning*
-
-非常烦人！
-
-尤其是刚接触 Pandas 的，完全不理解为什么弹出这么一串：
-
-```python
-d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
-A value is trying to be set on a copy of a slice from a DataFrame.
-Try using .loc[row_indexer,col_indexer] = value instead
-
-See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy 
-```
-
-归根结底，是因为代码中出现`链式操作`...
-
-有人就问了，什么是`链式操作`?
-
-这样的：
-
-```python
-tmp = df[df.a<4]
-tmp['c'] = 200
-```
-
-先记住这个最典型的情况，即可！
-
-有的人就问了：出现这个 Warning, 需要理会它吗？ 
-
-如果结果不对，当然要理会；如果结果对，不care.
-
-举个例子~~
-
-```python
-import pandas as  pd
-
-df = pd.DataFrame({'a':[1,3,5],'b':[4,2,7]},index=['a','b','c'])
-df.loc[df.a<4,'c'] = 100
-print(df)
-print('it\'s ok')
-
-tmp = df[df.a<4]
-tmp['c'] = 200
-print('-----tmp------')
-print(tmp)
-print('-----df-------')
-print(df)
-```
-
-输出结果：
-```python
-   a  b      c
-a  1  4  100.0
-b  3  2  100.0
-c  5  7    NaN
-it's ok
-d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
-A value is trying to be set on a copy of a slice from a DataFrame.
-Try using .loc[row_indexer,col_indexer] = value instead
-
-See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy     
-  tmp['c'] = 200
------tmp------
-   a  b    c
-a  1  4  200
-b  3  2  200
------df-------
-   a  b      c
-a  1  4  100.0
-b  3  2  100.0
-c  5  7    NaN
-```
-
-it's ok 行后面的发生链式赋值，导致结果错误。因为 tmp 变了，df 没赋上值啊，所以必须理会。
-
-it's ok 行前的是正解。
-
-以上，链式操作尽量避免，如何避免？多使用 `.loc[row_indexer,col_indexer]`，提示告诉我们的~
-
-#### 16 NumPy 数据归一化、分布可视化
-
-仅使用 `NumPy`，下载数据，归一化，使用 `seaborn` 展示数据分布。
-
-**下载数据**
-
-```python
-import numpy as np
-
-url = 'https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data'
-wid = np.genfromtxt(url, delimiter=',', dtype='float', usecols=[1])
-```
-仅提取 `iris` 数据集的第二列 `usecols = [1]`
-
-**展示数据**
-
-```python
-array([3.5, 3. , 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.4, 3. ,
-       3. , 4. , 4.4, 3.9, 3.5, 3.8, 3.8, 3.4, 3.7, 3.6, 3.3, 3.4, 3. ,
-       3.4, 3.5, 3.4, 3.2, 3.1, 3.4, 4.1, 4.2, 3.1, 3.2, 3.5, 3.1, 3. ,
-       3.4, 3.5, 2.3, 3.2, 3.5, 3.8, 3. , 3.8, 3.2, 3.7, 3.3, 3.2, 3.2,
-       3.1, 2.3, 2.8, 2.8, 3.3, 2.4, 2.9, 2.7, 2. , 3. , 2.2, 2.9, 2.9,
-       3.1, 3. , 2.7, 2.2, 2.5, 3.2, 2.8, 2.5, 2.8, 2.9, 3. , 2.8, 3. ,
-       2.9, 2.6, 2.4, 2.4, 2.7, 2.7, 3. , 3.4, 3.1, 2.3, 3. , 2.5, 2.6,
-       3. , 2.6, 2.3, 2.7, 3. , 2.9, 2.9, 2.5, 2.8, 3.3, 2.7, 3. , 2.9,
-       3. , 3. , 2.5, 2.9, 2.5, 3.6, 3.2, 2.7, 3. , 2.5, 2.8, 3.2, 3. ,
-       3.8, 2.6, 2.2, 3.2, 2.8, 2.8, 2.7, 3.3, 3.2, 2.8, 3. , 2.8, 3. ,
-       2.8, 3.8, 2.8, 2.8, 2.6, 3. , 3.4, 3.1, 3. , 3.1, 3.1, 3.1, 2.7,
-       3.2, 3.3, 3. , 2.5, 3. , 3.4, 3. ])
-      
-```
-
-这是单变量(univariate)长度为 150 的一维 NumPy 数组。
-
-**归一化**
-
-求出最大值、最小值
-```python
-smax = np.max(wid)
-smin = np.min(wid)
-
-In [51]: smax,smin
-Out[51]: (4.4, 2.0)
-````
-归一化公式：
-```python
-s = (wid - smin) / (smax - smin)
-```
-只打印小数点后三位设置：
-```python
-np.set_printoptions(precision=3)  
-```
-
-归一化结果：
-```markdown
-array([0.625, 0.417, 0.5  , 0.458, 0.667, 0.792, 0.583, 0.583, 0.375,
-       0.458, 0.708, 0.583, 0.417, 0.417, 0.833, 1.   , 0.792, 0.625,
-       0.75 , 0.75 , 0.583, 0.708, 0.667, 0.542, 0.583, 0.417, 0.583,
-       0.625, 0.583, 0.5  , 0.458, 0.583, 0.875, 0.917, 0.458, 0.5  ,
-       0.625, 0.458, 0.417, 0.583, 0.625, 0.125, 0.5  , 0.625, 0.75 ,
-       0.417, 0.75 , 0.5  , 0.708, 0.542, 0.5  , 0.5  , 0.458, 0.125,
-       0.333, 0.333, 0.542, 0.167, 0.375, 0.292, 0.   , 0.417, 0.083,
-       0.375, 0.375, 0.458, 0.417, 0.292, 0.083, 0.208, 0.5  , 0.333,
-       0.208, 0.333, 0.375, 0.417, 0.333, 0.417, 0.375, 0.25 , 0.167,
-       0.167, 0.292, 0.292, 0.417, 0.583, 0.458, 0.125, 0.417, 0.208,
-       0.25 , 0.417, 0.25 , 0.125, 0.292, 0.417, 0.375, 0.375, 0.208,
-       0.333, 0.542, 0.292, 0.417, 0.375, 0.417, 0.417, 0.208, 0.375,
-       0.208, 0.667, 0.5  , 0.292, 0.417, 0.208, 0.333, 0.5  , 0.417,
-       0.75 , 0.25 , 0.083, 0.5  , 0.333, 0.333, 0.292, 0.542, 0.5  ,
-       0.333, 0.417, 0.333, 0.417, 0.333, 0.75 , 0.333, 0.333, 0.25 ,
-       0.417, 0.583, 0.458, 0.417, 0.458, 0.458, 0.458, 0.292, 0.5  ,
-       0.542, 0.417, 0.208, 0.417, 0.583, 0.417])
-```
-
-**分布可视化**
-
-```python
-import seaborn as sns
-sns.distplot(s,kde=False,rug=True)
-```
-频率分布直方图：
-
-
-![](https://imgkr.cn-bj.ufileos.com/49bf5190-429c-4172-a53c-e3f6b66d4e64.png)
-
-
-```python
-sns.distplot(s,hist=True,kde=True,rug=True)
-```
-带高斯密度核函数的直方图：
-
-![](https://imgkr.cn-bj.ufileos.com/4e4a72a5-8f59-4893-b435-e4b57e22a18e.png)
-
-
-
-**分布 fit 图**
-
-拿 `gamma` 分布去 fit ：
-```python
-from scipy import stats
-sns.distplot(s, kde=False, fit = stats.gamma)
-```
-
-
-
-![](https://imgkr.cn-bj.ufileos.com/89446755-7420-4f96-97fe-c4e45d0d3dec.png)
-
-
-拿双 `gamma` 去 fit：
-```python
-from scipy import stats
-sns.distplot(s, kde=False, fit = stats.dgamma)
-```
-
-![](https://imgkr.cn-bj.ufileos.com/f2c2a660-5433-4b4f-ad7b-d01da4121319.png)
-
-#### 17 Pandas 使用技巧
-
-对于动辄就几十或几百个 G 的数据，在读取的这么大数据的时候，我们有没有办法随机选取一小部分数据，然后读入内存，快速了解数据和开展 EDA ？
-
-使用 Pandas 的 skiprows 和 概率知识，就能做到。
-
-下面解释具体怎么做。
-
-如下所示，读取某 100 G 大小的 big_data.csv 数据
-
-1) 使用 skiprows 参数，
-
-2) x > 0 确保首行读入， 
-
-3) np.random.rand() > 0.01 表示 99% 的数据都会被随机过滤掉
-
-言外之意，只有全部数据的 1% 才有机会选入内存中。
-
-```python
-import pandas as pd
-import numpy as np
-
-df = pd.read_csv("big_data.csv", 
-skiprows = 
-lambda x: x>0 and np.random.rand() > 0.01)
-
-print("The shape of the df is {}. 
-It has been reduced 100 times!".format(df.shape))
-```
-
-使用这种方法，读取的数据量迅速缩减到原来的 1% ，对于迅速展开数据分析有一定的帮助。
-
-### 十一、一步一步掌握Flask web开发
-
-#### 1 Flask版 hello world
-
-Flask是Python轻量级web框架，容易上手，被广大Python开发者所喜爱。
-
-今天我们先从hello world开始，一步一步掌握Flask web开发。例子君是Flask框架的小白，接下来与读者朋友们，一起学习这个对我而言的新框架，大家多多指导。
-
-首先`pip install Flask`,安装Flask，然后import Flask，同时创建一个 `app`
-```python
-from flask import Flask
-
-App = Flask(__name__)
-```
-
-写一个index页的入口函数，返回hello world.
-
-通过装饰器：App.route('/')创建index页的路由或地址，一个`/`表示index页，也就是主页。
-
-```python
-@App.route('/')
-def index():
-    return "hello world"
-```
-
-调用 `index`函数:
-```python
-if __name__ == "__main__":
-    App.run(debug=True)
-```
-
-然后启动，会在console下看到如下启动信息，表明`服务启动成功`。
-```python
-* Debug mode: on
- * Restarting with stat
- * Debugger is active!
- * Debugger PIN: 663-788-611
- * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
-```
-
- 接下来，打开一个网页，相当于启动客户端，并在Url栏中输入：`http://127.0.0.1:5000/`，看到页面上答应出`hello world`，证明服务访问成功。
-
- 同时在服务端后台看到如下信息，表示处理一次来自客户端的`get`请求。
- ```python
- 27.0.0.1 - - [03/Feb/2020 21:26:50] "GET / HTTP/1.1" 200 -
- ```
-
- 以上就是flask的hello world 版
-
-#### 2 Flask之数据入库操作
-
-数据持久化就是将数据写入到数据库存储的过程。
-
-本例子使用`sqlite3`数据库。
-
-1)导入`sqlite3`，未安装前使用命令`pip install sqlite3`
-
-创建一个`py`文件：`sqlite3_started.py`，并写下第一行代码：
-```python
-import sqlite3
-```
-2)手动创建一个数据库实例`db`, 命名`test.db`
-
-3)创建与数据库实例`test.db`的连接:
-```python
-conn = sqlite3.connect("test.db")
-```
-
-4)拿到连接`conn`的cursor
-```python
-c = conn.cursor()
-```
-
-5)创建第一张表`books`
-
-共有四个字段：`id`,`sort`,`name`,`price`，类型分别为：`int`,`int`,`text`,`real`. 其中`id`为`primary key`. 主键的取值必须是唯一的(`unique`)，否则会报错。
-
-
-```python
-c.execute('''CREATE TABLE books
-      (id int primary key,
-       sort int,
-       name text,
-       price real)''')
-```
-第一次执行上面语句，表`books`创建完成。当再次执行时，就会报`重复建表`的错误。需要优化脚本，检查表是否存在`IF NOT EXISTS books`，不存在再创建：
-```python
-c.execute('''CREATE TABLE IF NOT EXISTS books
-      (id int primary key,
-       sort int,
-       name text,
-       price real)''')
-```
-
-6)插入一行记录
-
-共为4个字段赋值
-
-```python
-c.execute('''INSERT INTO books VALUES
-       (1, 
-       1, 
-       'computer science',
-       39.0)''')
-```
-
-7)一次插入多行记录
-
-先创建一个list:`books`，使用`executemany`一次插入多行。
-```python
-books = [(2, 2, 'Cook book', 68),
-         (3, 2, 'Python intro', 89),
-         (4, 3, 'machine learning', 59),
-         ]
-
-
-c.executemany('INSERT INTO books VALUES (?, ?, ?, ?)', books)
-```
-
-8)提交
-
-提交后才会真正生效，写入到数据库
-
-```python
-conn.commit()
-```
-
-9)关闭期初建立的连接conn
-
-务必记住手动关闭，否则会出现内存泄漏
-```python
-conn.close()
-print('Done')
-```
-
-10)查看结果
-例子君使用`vs code`，在扩展库中选择：`SQLite`安装。
-
-![image-20200208211721377](./img/image-20200208211721377.png)
-
-新建一个`sq`文件：`a.sql`，内容如下：
-
-```sql
-SELECT * from books 
-```
-右键`run query`，得到表`books`插入的4行记录可视化图：
-
-![image-20200208211806853](./img/image-20200208211806853.png)
-
-以上十步就是sqlite3写入数据库的主要步骤，作为Flask系列的第二篇，为后面的前端讲解打下基础。
-
-#### 3 Flask各层调用关系
-
-这篇介绍Flask和B/S模式，即浏览器/服务器模式，是接下来快速理解Flask代码的关键理论篇：**理解Views、models和渲染模板层的调用关系**。
-
-1) 发出请求
-
-当我们在浏览器地址栏中输入某个地址，按回车后，完成第一步。
-
-2) 视图层 views接收1)步发出的请求，Flask中使用解释器的方式处理这个求情，实例代码如下，它通常涉及到调用models层和模板文件层
-
-```python
-@main_blue.route('/', methods=['GET', 'POST'])
-def index():
-    form = TestForm()
-    print('test')
-```
-
-3) models层会负责创建数据模型，执行CRUD操作
-
-4) 模板文件层处理html模板
-
-5) 组合后返回html
-
-6) models层和html模板组合后返回给views层
-
-7）最后views层响应并渲染到浏览器页面，我们就能看到请求的页面。
-
-完整过程图如下所示：
-
-![image-20200211152007983](./img/web1.png)
-
-读者朋友们，如果你和例子君一样都是初学Flask编程，需要好好理解上面的过程。理解这些对于接下来的编程会有一定的理论指导，方向性指导价值。
-
-### Python 问答
-
-#### Python 如何生成二维码？
-
-
-
-
-
-## qrcode
-
-今天先来解答如何生成二维码。Python的`qrcode`包支持生成二维码。
-
-用法也很简单：
-
-```python
-import qrcode
-
-# 二维码内容
-data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
-# 生成二维码
-img = qrcode.make(data=data)
-# 直接显示二维码
-img.show()
-# 保存二维码为文件
-img.save("我的微信.jpg")
-```
-
-生成的二维码如下：
-
-![](https://imgkr2.cn-bj.ufileos.com/f0b08c53-0107-483b-bbe5-072bebc58e8d.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=rVtaeBWhzLPPq%252BFCVtiOv6rS0tI%253D&Expires=1603544615)
-
-
-大家微信扫描后，会出现我的二维码。
+## Python 小例子
 
-另外，还可以设置二维码的颜色等样式：
+### 基本操作
 
-```python
-import qrcode
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+|1   | [常见算术运算](md/198.md)|	运算  |	v1|	⭐⭐ |
+| 2    | [实现 relu](md/1.md) | max   | V4.0        | ⭐️⭐️ |
+| 3    | [进制转化](md/2.md)                | bin,oct,hex | V4.0            |  ⭐️⭐️|
+| 4    | [整数和ASCII互转](md/3.md)         | chr,ord | V1.0     | ⭐️⭐️ |
+| 5    | [元素都为真检查](md/4.md)          | all   | V2.0      | ⭐️⭐️⭐️ |
+| 6    | [至少一个为真检查](md/5.md)        | any | V2.0    | ⭐️⭐️⭐️ |
+| 7    | [判断是真是假](md/6.md)            | bool        | V2.0        | ⭐️⭐️⭐️ |
+| 8    | [创建复数](md/7.md)                | complex         | V1.0            | ⭐️⭐️⭐️ |
+| 9    | [取商和余数](md/8.md)              | divmod        | V1.0          | ⭐️⭐️ |
+| 10    | [转为浮点类型](md/9.md)            | float       | V1.0        | ⭐️⭐️ |
+| 11   | [转为整型](md/10.md)               | int            | V1.0           | ⭐️ |
+| 12   | [次幂](md/11.md)                   | pow                | V1.0               | ⭐️ |
+| 13   | [四舍五入](md/12.md)               | round          | V1.0           | ⭐️ |
+| 14   | [链式比较](md/13.md)               | compare       | V1.0           | ⭐️⭐️ |
+| 15   | [字符串转字节](md/14.md)           | bytes,utf-8 | V1.0       | ⭐️⭐️ |
+| 16   | [任意对象转为字符串](md/15.md)     | str  | V1.0 | ⭐️⭐️ |
+| 17   | [执行字符串表示的代码](md/16.md)   | compile | V1.0 | ⭐️⭐️⭐️ |
+| 18   | [计算表达式](md/17.md)             | eval         | V1.0         | ⭐️⭐️⭐️⭐️ |
+| 19   | [字符串格式化](md/18.md)           | format     | V1.0       | ⭐️⭐️⭐️⭐️ |
+| 20   | [交换两元素](md/23.md)             | pack,unpack | V1.0         | ⭐️⭐️ |
+| 21   | [转为字典](md/29.md)               | dict           | V1.0           | ⭐️⭐️ |
+| 22   | [冻结集合](md/30.md)               | frozenset | V1.0           | ⭐️⭐️ |
+| 23   | [转为集合类型](md/31.md)           | set        | V1.0       | ⭐️⭐️ |
+| 24   | [转元组](md/32.md)                 | tuple            | V1.0             | ⭐️⭐️ |
+| 25   | [查看变量所占字节数](md/48.md)  | getsizeof | V1.0 | ⭐️⭐️⭐️ |
+| 26 | [含单个元素的元组](md/154.md) | tuple | V1.0 | ⭐️⭐ |
+| 27 | [列表删除之坑](md/159.md) | list | V1.0 | ⭐️⭐ |
+| 28 | [列表快速复制之坑](md/160.md) | list  | V1.0 | ⭐️⭐⭐ |
+| 29 | [发现列表前3个最大或最小数](md/195.md) | list heapq | v1.0 | ⭐️⭐⭐⭐ |
+| 30 | [字符串驻留](md/161.md) | str  | V1.0 | ⭐️⭐⭐⭐⭐ |
+| 31 | [创建空集合错误](md/166.md) | set  | V1.0 | ⭐️⭐ |
+| 32 | [充分认识for](md/164.md) | for  | V1.0 | ⭐️⭐⭐ |
+| 33 | [认识执行时机](md/165.md) | generator  | V1.0 | ⭐️⭐⭐⭐⭐ |
+
+
+### 函数和模块常见用法
 
-# 实例化二维码生成类
-qr = qrcode.QRCode(border=2)
-# 设置二维码数据
-data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
-qr.add_data(data=data)
-# 启用二维码颜色设置
-qr.make(fit=True)
-img = qr.make_image(fill_color="orange", back_color="white")
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1   | [操作函数对象](md/24.md)           | operator   | V2.0       | ⭐️⭐️⭐️⭐️ |
+| 2   | [创建range序列](md/55.md) | range | V1.0 | ⭐️⭐️ |
+| 3   | [生成逆序序列](md/25.md)           | range | V1.0       | ⭐️⭐️ |
+| 4   | [拿来就用的排序函数](md/19.md)     | sorted | V1.0 | ⭐️⭐️⭐️ |
+| 5   | [求和函数](md/20.md)               | sum            | V1.0           | ⭐️⭐️ |
+| 6   | [函数的五类参数使用例子](md/26.md) | variable parameter | V2.0 | ⭐️⭐️⭐️⭐️ |
+| 7   | [使用slice对象](md/27.md)          | slice     | V2.0      | ⭐️⭐️⭐️⭐️⭐️ |
+| 8   | [lambda 函数](md/28.md)  | lambda | V3.0 | ⭐️⭐️⭐️⭐️ |
+| 9   | [枚举对象](md/47.md)       | enumerate | V1.0   | ⭐️⭐️⭐️ |
+| 10   | [过滤器filter](md/49.md)  | filter | V1.5 | ⭐️⭐️⭐️ |
+| 11   | [返回对象哈希值](md/50.md)    | hash | V1.0 | ⭐️⭐️ |
+| 12   | [带名字的元组](md/79.md) | namedtuple | V1.0 | ⭐️⭐️⭐️ |
+| 13   | [一行代码实现列表反转](md/70.md) | reverse | V1.0 | ⭐️⭐️ |
+| 14   | [反转字符串的两个方法](md/86.md) | reversed | V1.0 | ⭐️⭐️ |
+| 15   | [join 串联字符串](md/87.md) | join | V1.0 | ⭐️⭐️ |
+| 16   | [字符串字节长度](md/88.md) | encode | V1.0 | ⭐️⭐️ |
+| 17 | [groupby单字段分组](md/129.md) | itertools, groupby,lambda | V1.0 | ⭐️⭐️⭐️ |
+| 18 | [groupby多字段分组](md/130.md) | itemgetter,itertools,groupby | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 19 | [itemgetter和key函数](md/131.md) | operator,itemgetter,itertools | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 20 | [sum函数计算和聚合同时做](md/132.md) | sum,generator | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 21 | [默认参数设为空](md/155.md) | function | V1.0 | ⭐️⭐⭐ |
+| 22 | [各种参数使用之坑](md/158.md) | function paremeter | V1.0 | ⭐️⭐⭐ |
+| 23 | [lambda自由参数之坑](md/157.md) | lambda | V1.0 | ⭐️⭐⭐ |
+| 24 | [使用堆升序列表](md/196.md) | sort heapq | v1.0 | ⭐️⭐⭐⭐ |
+
+
+### 面向对象
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1   | [所有对象之根](md/43.md)           | object     | V1.0       | ⭐️ |
+| 2   | [对象是否可调用](md/33.md)         | callable | V2.5   | ⭐️⭐️⭐️⭐️ |
+| 3   | [ascii 展示对象](md/34.md)         | `__repr__` | V2.5     | ⭐️⭐️⭐️ |
+| 4   | [类方法](md/35.md)                 | classmethod      | V1.5             | ⭐️⭐️⭐️ |
+| 5   | [动态删除属性](md/36.md)           | delattr,hasattr | V1.5       | ⭐️⭐️ |
+| 6   | [一键查看对象所有方法](md/37.md)   | dir | V1.5 | ⭐️⭐️ |
+| 7   | [动态获取对象属性](md/38.md)       | getattr | V1.5   | ⭐️⭐️ |
+| 8   | [对象是否有某个属性](md/39.md)     | hasattr | V1.5 | ⭐️⭐️⭐️ |
+| 9   | [对象门牌号](md/40.md)             | id           | V1.0         | ⭐️ |
+| 10   | [实例和对象关系判断](md/41.md)    | isinstance   | V1.5         | ⭐️⭐️⭐️ |
+| 11   | [issubclass父子关系鉴定](md/42.md) | issubclass | V1.5 | ⭐️⭐️⭐️ |
+| 12   | [创建属性的两种方法](md/44.md)     | property | V2.5 | ⭐️⭐️⭐️⭐️⭐️ |
+| 13   | [查看对象类型](md/45.md)           | type      | V1.0       | ⭐️ |
+| 14   | [元类使用介绍](md/46.md)     | type,`__class__` | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 15 | [相同值的不可变对象](md/162.md) | mutable  | V1.0 | ⭐️⭐⭐ |
+| 16 | [对象销毁顺序](md/163.md) | OOP del   | V1.0 | ⭐️⭐⭐⭐ |
+| 17 | [子类继承父类的静态方法吗？](md/171.md) | staticmethod | V1.0 | ⭐️⭐⭐ |
+
+
+
+### 正则
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1   | [正则中字符 `r`作用](md/89.md) | re,r | V3.0 | ⭐️⭐️⭐️ |
+| 2   | [正则原子操作](md/90.md) | re | V3.0 | ⭐️⭐️⭐️ |
+| 3   | [正则中的转义](md/91.md) | re,\ | V3.0 | ⭐️⭐️⭐️ |
+| 4   | [正则最普通查找](md/92.md) | re,findall | V3.0 | ⭐️⭐️⭐️ |
+| 5   | [使用通用字符查找](md/93.md) | re,\s,\w,\d | V3.0 | ⭐️⭐️⭐️ |
+| 6   | [使用元字符查找](md/94.md) | re,+,* | V3.0 | ⭐️⭐️⭐️ |
+| 7   | [捕获子串](md/95.md) | () | V3.0 | ⭐️⭐️⭐️⭐️ |
+| 8   | [贪心捕获和非贪心捕获](md/96.md) | re | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 9   | [使用正则做密码安全检查](md/97.md) | re | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 10   | [爬取百度首页标题](md/98.md) | re | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 11   | [批量转化为驼峰格式(Camel)](md/99.md) | re | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 12   | [使用正则判断是否为正浮点数](md/102.md) | str,re,float | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 13 | [使用正则提取正整数和大于0的浮点数](md/197.md) | re findall | v2 | ⭐️⭐⭐⭐ |
+
+### 装饰器迭代器生成器
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1 | [通俗理解装饰器](md/138.md) | decorator | V1.0 | ⭐️⭐️⭐️ |
+| 2 | [测试函数运行时间的装饰器](md/136.md) | decorator | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 3 | [统计异常次数装饰器](md/137.md) | decorator,nonlocal | V1.5 | ⭐️⭐️⭐️⭐️ |
+| 4 | [定制递减迭代器](md/139.md) | Iterator | V3.0 | ⭐️⭐️⭐️⭐️ |
+| 5   | [创建迭代器](md/53.md)      | iter,`__iter__` | V1.5  | ⭐️⭐️⭐️ |
+| 6   | [反向迭代器reversed](md/56.md) | reversed | V1.0 | ⭐️⭐️ |
+| 7   | [zip迭代器](md/57.md)     | zip  | V1.5 | ⭐️⭐️⭐️ |
+| 8 | [list分组(生成器版)](md/134.md) | yield,generator | V1.0 | ⭐️⭐️⭐️ |
+| 9 | [列表全展开(生成器版)](md/135.md) | list,yield,generator | V1.0 | ⭐️⭐️⭐️ |
+| 10   | [chain串联小容器为大容器](md/84.md) | itertools,chain | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 11   | [product 使用案例](md/85.md) | product | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 12 | [斐波那契数列前n项](md/126.md) | yield,range | V1.0 | ⭐️⭐️⭐️ |
+
+
+### 绘图
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1 | [turtle绘制奥运五环图](md/140.md) | turtle | V1.0 | ⭐️⭐️⭐️ |
+| 2 | [turtle绘制漫天雪花](md/141.md) | turtle | V1.0 | ⭐️⭐️⭐️ |
+| 3 | [Python词云图](md/142.md) | WordCloud | V1.0 | ⭐️⭐️⭐ |
+| 4 | [Plotly柱状图和折线图](md/143.md) | plotly | V1.0 | ⭐️⭐ |
+| 5 | [seaborn热力图](md/144.md) | seaborn | V1.0 | ⭐️⭐ |
+| 6 | [Pyecharts仪表盘](md/145.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 7 | [Pyecharts漏斗图](md/146.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 8 | [Pyecharts水球图](md/147.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 9 | [Pyecharts饼图](md/148.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 10 | [Pyecharts极坐标图](md/149.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 11 | [Pyecharts词云图](md/150.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 12 | [Pyecharts热力图](md/151.md) | pyecharts | V1.0 | ⭐️⭐ |
+| 13 | [matplotlib绘制动图](md/152.md) | matplotlib | V1.0 | ⭐️⭐ |
+| 14 | [seaborn pairplot图](md/153.md) | seaborn | V1.0 | ⭐️⭐⭐⭐ |
+| 15 | [pyecharts传入Numpy数据绘图失败](md/167.md) | numpy pyecharts  | V1.0 | ⭐️⭐⭐ |
+| 16 | [图像处理包pillow](md/169.md) | pillow  | V1.0 | ⭐️⭐⭐ |
+
+### 数据分析
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1 | [数据分析神器：deepnote](./md/177.md) | deepnote | v1.0 | ⭐️⭐⭐ |
+| 2 | [NumPy 的pad填充方法](md/172.md) | NumPy pad | V1.0 | ⭐️⭐⭐⭐ |
+| 3 | [创建下对角线为1、2、3、4的对角矩阵](md/173.md) | NumPy diag | V1.0 | ⭐️⭐⭐ |
+| 4 | [cut 数据分箱](md/174.md) | Pandas cut | v1.0 | ⭐️⭐⭐ |
+| 5 | [丢弃空值和填充空值](./md/175.md) | Pandas dropna fillna | v1.0 | ⭐️⭐⭐ |
+| 6 | [apply 方法去掉特殊字符](./md/178.md) | pandas apply | v1.0 | ⭐️⭐⭐ |
+| 7 | [使用map对列做特征工程](./md/179.md) | pandas map | v1.0 | ⭐️⭐⭐ |
+| 8 | [category列转数值](./md/180.md) | pandas category | v1.0 | ⭐️⭐⭐ |
+| 9 | [rank排名](./md/181.md) | pandas rank | v1.0 | ⭐️⭐⭐|
+| 10 | [完成数据下采样，调整步长由小时为天](./md/182.md) | pandas resample | v1.0 | ⭐️⭐⭐ |
+| 11 | [如何用 Pandas 快速生成时间序列数据](./md/183.md) | pandas util | v1.0 | ⭐️⭐⭐ |
+| 12 | [如何快速找出 DataFrame 所有列 null 值个数](./md/184.md) | pandas isnull sum | v1.0 | ⭐️⭐⭐ |
+| 13 | [重新排序 DataFrame 的列](./md/185.md) | pandas dataframe | v1.0 | ⭐️⭐⭐ |
+| 14 | [使用 count 统计词条 出现次数](./md/186.md) | pandas count | v1.0 | ⭐️⭐⭐ |
+| 15 | [split 求时分(HH:mm)的分钟差](./md/187.md) | pandas split | v1.0 | ⭐️⭐⭐ |
+| 16 | [melt透视数据小技巧](./md/188.md) | pandas melt | v1.0 | ⭐️⭐⭐ |
+| 17 | [pivot 透视小技巧](./md/189.md) | pandas melt | v1.0 | ⭐️⭐⭐ |
+| 18 | [p随机读取文件的K行，生成N个](./md/190.md) | pandas sample | v1.0 | ⭐️⭐⭐ |
+| 19 | [格式化Pandas的时间列](md/191.md) | pandas apply | v1.0 | ⭐️⭐⭐⭐ |
+
+### 其他常用
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1   | [help 一键帮助](md/51.md)  | help | V1.0 | ⭐️ |
+| 2   | [获取用户输入](md/52.md)     | input | V1.0 | ⭐️ |
+| 3   | [文件读写和mode 取值表](md/54.md) | open,read,write,with,mode | V2.0 | ⭐️⭐️⭐️ |
+| 4   | [operator使用举例](md/58.md) | operator | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 5   | [传输json对象](md/59.md)  | json | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 6   | [获取文件后缀名](md/103.md) | os,splitext | V1.0 | ⭐️⭐️ |
+| 7   | [获取路径中的文件名](md/104.md) | os,split | V1.0 | ⭐️⭐️ |
+| 8   | [批量修改文件后缀](md/105.md) | argparse,listdir | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 9   | [xls批量转换成xlsx](md/106.md) | os,listdir,splitext | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 10   | [获取指定后缀名的文件](md/107.md) | os,listdir,splitext | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 11   | [批量压缩文件](md/108.md) | zipfile | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 12   | [32位加密](md/109.md) | hashlib | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 13   | [年的日历图](md/110.md) | calendar | V1.0 | ⭐️⭐️ |
+| 14   | [判断是否为闰年](md/111.md) | calendar | V1.0 | ⭐️⭐️⭐️ |
+| 15   | [判断月有几天](md/112.md) | calendar,datetime | V1.0 | ⭐️⭐️⭐️ |
+| 16   | [月的第一天](md/113.md) | datetime | V1.0 | ⭐️⭐️ |
+| 17 | [月的最后一天](md/114.md) | calendar,datetime | V1.0 | ⭐️⭐️ |
+| 18 | [获取当前时间](md/115.md) | time,datetime | V1.0 | ⭐️⭐️ |
+| 19 | [字符时间转时间](md/116.md) | time,datetime | V1.0 | ⭐️⭐️ |
+| 20 | [时间转字符时间](md/117.md) | time,datetime | V1.0 | ⭐️⭐️ |
+| 21 | [获得某天后的1~n天](md/133.md) | Calendar,monthrange | V4.0 | ⭐️⭐️⭐️ |
+| 22 | [默认启动主线程](md/118.md) | threading | V1.0 | ⭐️⭐️ |
+| 23 | [创建线程](md/119.md) | threading | V1.0 | ⭐️⭐️ |
+| 24 | [交替获得CPU时间片](md/120.md) | threading | V1.0 | ⭐️⭐️⭐️ |
+| 25 | [多线程抢夺同一个变量](md/121.md) | threading | V1.0 | ⭐️⭐️⭐️ |
+| 26 | [多线程变量竞争引起的问题](md/122.md) | threading | V1.0 | ⭐️⭐️⭐️ |
+| 27 | [多线程锁](md/123.md) | threading,lock | V1.0 | ⭐️⭐️⭐️ |
+| 28 | [时间转数组及常用格式](md/124.md) | time,datetime,format | V1.0 | ⭐️⭐️⭐️ |
+| 29   | [nonlocal用于内嵌函数中](md/21.md) | nonlocal | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 30   | [global 声明全局变量](md/22.md)    | global | V2.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 31 | [共享变量未绑定之坑](md/156.md) | global | V1.0 | ⭐️⭐⭐ |
+| 32 | [优化代码异常输出包](md/168.md) | debugger  | V1.0 | ⭐️⭐⭐ |
+| 33 | [一行代码找到编码](md/170.md) | chardet  | V1.0 | ⭐️⭐⭐ |
+| 34 | [创建SQLite连接](md/192.md) | SQLite | v1.0 | ⭐️⭐⭐⭐ |
+| 35 | [json对象转python对象](md/193.md) | python json | v1.0 | ⭐️⭐⭐⭐ |
+| 36 | [python对象转json对象](md/194.md) | python json | v1.0 | ⭐️⭐⭐⭐ |
+| 37 | [一行代码让 pip 安装加速 100 倍](md/176.md) | pip install | v1.0 | ⭐️⭐⭐ |
+
+
+### 工作常用案例
+| 小例子 | 链接                    | 标签              | 版本          | 难度 |
+| ---- | ---------------------------------- | ---- | ---- | ---- |
+| 1   | [不用else和if实现计算器](md/60.md) | operator | V1.0 | ⭐️⭐️⭐️ |
+| 2   | [去最求平均](md/61.md)      | list,sort,round | V1.0  | ⭐️⭐️⭐️⭐️ |
+| 3   | [打印99乘法表](md/62.md)    | for,range,format | V1.0 | ⭐️⭐️⭐️ |
+| 4   | [递归版flatten函数](md/63.md) | recursion,list,isinstance | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 5   | [列表等分为n份](md/64.md)  | list,ceil | V1.0 | ⭐️⭐️⭐️ |
+| 6   | [压缩列表](md/65.md)       | list,filter | V1.0   | ⭐️⭐️⭐️⭐️ |
+| 7   | [求更长的列表](md/66.md)     | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 8   | [求列表众数](md/67.md)     | max,lambda,count | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 9   | [所有多个列表的最大值](md/68.md) | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 10   | [列表检查重复](md/69.md)     | set  | V1.0 | ⭐️⭐️⭐️ |
+| 11   | [浮点数等差数列](md/71.md)    | range,float | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 12   | [按条件分组](md/72.md)      | lambda | V1.0  | ⭐️⭐️⭐️⭐️ |
+| 13   | [map实现向量运算](md/73.md)  | map,lambda | V1.0 | ⭐️⭐️⭐️ |
+| 14   | [值最大的字典](md/74.md)     | max,lambda | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 15   | [合并两个字典](md/75.md)     | **   | V1.0 | ⭐️⭐️⭐️ |
+| 16   | [Topn 字典](md/76.md)    | heapq,nlargest | V1.0 | ⭐️⭐️⭐️ |
+| 17   | [判断是否为异位词](md/77.md) | collections,Counter | V1.0 | ⭐️⭐️⭐️ |
+| 18   | [逻辑上合并字典](md/78.md) | ChainMap | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 19   | [sample 样本抽样](md/80.md) | random,sample | V1.0 | ⭐️⭐️⭐️ |
+| 20   | [重洗数据集](md/81.md) | shuffle | V1.0 | ⭐️⭐️⭐️ |
+| 21   | [10个均匀分布的坐标点](md/82.md) | random,uniform | V1.0 | ⭐️⭐️⭐️ |
+| 22   | [10个高斯分布的坐标点](md/83.md) | random,gauss | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 23   | [是否互为排序词](md/100.md) | collections,defaultdict | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 24   | [str1是否由str2旋转而来](md/101.md) | str | V1.0 | ⭐️⭐️⭐️ |
+| 25 | [寻找第n次出现位置](md/125.md) | enumerator | V1.0 | ⭐️⭐️⭐️ |
+| 26 | [找出所有重复元素](md/127.md) | calendar,datetime | V1.0 | ⭐️⭐️⭐️⭐️ |
+| 27 | [联合统计次数](md/128.md) | Counter | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 28 | [求两点球面距离](md/199.md) | math asin | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 29 | [获取文件编码](md/200.md) | chardet | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
+| 30 | [格式化json串](md/201.md) | json | V1.0 | ⭐️⭐️⭐️⭐️⭐️ |
 
-# 显示二维码
-img.show()
-```
 
-生成一个orange的二维码：
 
-![](https://imgkr2.cn-bj.ufileos.com/cbd26fd8-27cf-4630-935f-6896822ce483.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=uy1r24x%252Fp5QpI5Wy10Ebdaz%252BpLM%253D&Expires=1603544681)
 
-更多样式，大家可以自己去玩耍。
diff --git a/V3.md b/V3.md
deleted file mode 100644
index f450ca22..00000000
--- a/V3.md
+++ /dev/null
@@ -1,7239 +0,0 @@
-<div align="center">
-<img src="https://img.shields.io/badge/-Python-brightgreen">
-<img src="https://img.shields.io/badge/-%E6%95%B0%E6%8D%AE%E5%88%86%E6%9E%90-yellowgreen">
-<img src="https://img.shields.io/badge/-%E7%AE%97%E6%B3%95-yellow">
-<img src="https://img.shields.io/badge/-%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0-lightgrey">
-<a href="https://static01.imgkr.com/temp/c6e10a16c4764dcdb32587760f6769ec.png" width="28%"><img src="https://img.shields.io/badge/%E5%85%AC%E4%BC%97%E5%8F%B7-Python%E5%B0%8F%E4%BE%8B%E5%AD%90-orange"></a>
-</div>
-<br>
-
-<div align="center">
-<img src="https://static01.imgkr.com/temp/9785963f175849d7bbc8a49ea02a54ea.png" width="20%" />
-</div>
-
-## 介绍
-
-告别枯燥，告别枯燥，致力于打造 Python 经典小例子、小案例。 
-
-<div align="center">
-<img src="https://static01.imgkr.com/temp/f379139a2c5d463799c35c1aa68911d7.png" width="23%"/>
-</div>
-
-<br>
-
-<br>
-
-如果转载本库小例子、小案例，请备注下方链接：
-
-Python小例子 https://github.com/jackzhenguo/python-small-examples
-
-
-
-## 贡献
-
-欢迎贡献小例子到此库
-
-## License
-
-允许按照要求转载，但禁止用于任何商用目的。
-
-## 小例子
-
-###  一、 数字
-
-#### 1 求绝对值
-
-绝对值或复数的模
-
-```python
-In [1]: abs(-6)                                                                 
-Out[1]: 6
-```
-
-#### 2  进制转化
-
-十进制转换为二进制：
-```python
-In [2]: bin(10)                                                                 
-Out[2]: '0b1010'
-```
-
-十进制转换为八进制：
-```python
-In [3]: oct(9)                                                                  
-Out[3]: '0o11'
-```
-
-十进制转换为十六进制：
-```python
-In [4]: hex(15)                                                                 
-Out[4]: '0xf'
-```
-
-#### 3 整数和ASCII互转
-
-十进制整数对应的`ASCII字符`
-```python
-In [1]: chr(65)
-Out[1]: 'A'
-```
-
-查看某个`ASCII字符`对应的十进制数
-```python
-In [1]: ord('A')
-Out[1]: 65
-```
-
-#### 4 元素都为真检查
-所有元素都为真，返回 `True`，否则为`False`
-```python
-In [5]: all([1,0,3,6])                                                          
-Out[5]: False
-```
-```python
-In [6]: all([1,2,3])                                                            
-Out[6]: True
-```
-
-#### 5 元素至少一个为真检查　
-至少有一个元素为真返回`True`，否则`False`
-```python
-In [7]: any([0,0,0,[]])                                                         
-Out[7]: False
-```
-
-```python
-In [8]: any([0,0,1])                                                            
-Out[8]: True
-```
-
-#### 6 判断是真是假　　
-测试一个对象是True, 还是False.
-```python
-In [9]: bool([0,0,0])                                                           
-Out[9]: True
-
-In [10]: bool([])                                                               
-Out[10]: False
-
-In [11]: bool([1,0,1])                                                          
-Out[11]: True
-```
-
-#### 7  创建复数
-
-创建一个复数
-
-```python
-In [1]: complex(1,2)
-Out[1]: (1+2j)
-```
-
-#### 8 取商和余数　　
-
-分别取商和余数
-
-```python
-In [1]: divmod(10,3)
-Out[1]: (3, 1)
-```
-
-#### 9 转为浮点类型　
-
-将一个整数或数值型字符串转换为浮点数
-
-```python
-In [1]: float(3)
-Out[1]: 3.0
-```
-
-如果不能转化为浮点数，则会报`ValueError`:
-
-```python
-In [2]: float('a')
-# ValueError: could not convert string to float: 'a'
-```
-
-#### 10  转为整型　　
-
-int(x, base =10) , x可能为字符串或数值，将x 转换为一个普通整数。如果参数是字符串，那么它可能包含符号和小数点。如果超出了普通整数的表示范围，一个长整数被返回。  
-
-```python
-In [1]: int('12',16)
-Out[1]: 18
-```
-
-#### 11 次幂
-
-base为底的exp次幂，如果mod给出，取余
-
-```python
-In [1]: pow(3, 2, 4)
-Out[1]: 1
-```
-
-#### 12 四舍五入
-
-四舍五入，`ndigits`代表小数点后保留几位：
-
-```python
-In [11]: round(10.0222222, 3)
-Out[11]: 10.022
-
-In [12]: round(10.05,1)
-Out[12]: 10.1
-```
-
-#### 13 链式比较
-
-```python
-i = 3
-print(1 < i < 3)  # False
-print(1 < i <= 3)  # True
-```
-
-###  二、 字符串
-
-#### 14  字符串转字节　　
-字符串转换为字节类型
-
-```python
-In [12]: s = "apple"                                                            
-
-In [13]: bytes(s,encoding='utf-8')                                              
-Out[13]: b'apple'
-```
-
-#### 15 任意对象转为字符串　　
-```python
-In [14]: i = 100                                                                
-
-In [15]: str(i)                                                                 
-Out[15]: '100'
-
-In [16]: str([])                                                                
-Out[16]: '[]'
-
-In [17]: str(tuple())                                                           
-Out[17]: '()'
-```
-
-#### 16 执行字符串表示的代码
-
-将字符串编译成python能识别或可执行的代码，也可以将文字读成字符串再编译。
-
-```python
-In [1]: s  = "print('helloworld')"
-    
-In [2]: r = compile(s,"<string>", "exec")
-    
-In [3]: r
-Out[3]: <code object <module> at 0x0000000005DE75D0, file "<string>", line 1>
-    
-In [4]: exec(r)
-helloworld
-```
-#### 17 计算表达式
-
-将字符串str 当成有效的表达式来求值并返回计算结果取出字符串中内容
-
-```python
-In [1]: s = "1 + 3 +5"
-    ...: eval(s)
-    ...:
-Out[1]: 9
-```
-
-#### 18 字符串格式化　
-
-格式化输出字符串，format(value, format_spec)实质上是调用了value的__format__(format_spec)方法。
-
-```
-In [104]: print("i am {0},age{1}".format("tom",18))
-i am tom,age18
-```
-
-| 3.1415926  | {:.2f}  | 3.14      | 保留小数点后两位             |
-| ---------- | ------- | --------- | ---------------------------- |
-| 3.1415926  | {:+.2f} | +3.14     | 带符号保留小数点后两位       |
-| -1         | {:+.2f} | -1.00     | 带符号保留小数点后两位       |
-| 2.71828    | {:.0f}  | 3         | 不带小数                     |
-| 5          | {:0>2d} | 05        | 数字补零 (填充左边, 宽度为2) |
-| 5          | {:x<4d} | 5xxx      | 数字补x (填充右边, 宽度为4)  |
-| 10         | {:x<4d} | 10xx      | 数字补x (填充右边, 宽度为4)  |
-| 1000000    | {:,}    | 1,000,000 | 以逗号分隔的数字格式         |
-| 0.25       | {:.2%}  | 25.00%    | 百分比格式                   |
-| 1000000000 | {:.2e}  | 1.00e+09  | 指数记法                     |
-| 18         | {:>10d} | ' 18'     | 右对齐 (默认, 宽度为10)      |
-| 18         | {:<10d} | '18 '     | 左对齐 (宽度为10)            |
-| 18         | {:^10d} | ' 18 '    | 中间对齐 (宽度为10)          |
-
-### 三、 函数
-
-#### 19 拿来就用的排序函数
-
-排序：
-
-```python
-In [1]: a = [1,4,2,3,1]
-
-In [2]: sorted(a,reverse=True)
-Out[2]: [4, 3, 2, 1, 1]
-
-In [3]: a = [{'name':'xiaoming','age':18,'gender':'male'},{'name':'
-     ...: xiaohong','age':20,'gender':'female'}]
-In [4]: sorted(a,key=lambda x: x['age'],reverse=False)
-Out[4]:
-[{'name': 'xiaoming', 'age': 18, 'gender': 'male'},
- {'name': 'xiaohong', 'age': 20, 'gender': 'female'}]
-```
-
-#### 20 求和函数
-
-求和：
-
-```python
-In [181]: a = [1,4,2,3,1]
-
-In [182]: sum(a)
-Out[182]: 11
-
-In [185]: sum(a,10) #求和的初始值为10
-Out[185]: 21
-```
-
-#### 21 nonlocal用于内嵌函数中
-
-关键词`nonlocal`常用于函数嵌套中，声明变量`i`为非局部变量；
-如果不声明，`i+=1`表明`i`为函数`wrapper`内的局部变量，因为在`i+=1`引用(reference)时,i未被声明，所以会报`unreferenced variable`的错误。
-
-```python
-def excepter(f):
-    i = 0
-    t1 = time.time()
-    def wrapper(): 
-        try:
-            f()
-        except Exception as e:
-            nonlocal i
-            i += 1
-            print(f'{e.args[0]}: {i}')
-            t2 = time.time()
-            if i == n:
-                print(f'spending time:{round(t2-t1,2)}')
-    return wrapper
-```
-
-#### 22 global 声明全局变量
-
-先回答为什么要有`global`，一个变量被多个函数引用，想让全局变量被所有函数共享。有的伙伴可能会想这还不简单，这样写：
-
-```python
-i = 5
-def f():
-    print(i)
-
-def g():
-    print(i)
-    pass
-
-f()
-g()
-
-```
-
-f和g两个函数都能共享变量`i`，程序没有报错，所以他们依然不明白为什么要用`global`.
-
-但是，如果我想要有个函数对`i`递增，这样：
-
-```python
-def h():
-    i += 1
-
-h()
-```
-
-此时执行程序，bang, 出错了！ 抛出异常：`UnboundLocalError`，原来编译器在解释`i+=1`时会把`i`解析为函数`h()`内的局部变量，很显然在此函数内，编译器找不到对变量`i`的定义，所以会报错。
-
-`global`就是为解决此问题而被提出，在函数h内，显式地告诉编译器`i`为全局变量，然后编译器会在函数外面寻找`i`的定义，执行完`i+=1`后，`i`还为全局变量，值加1：
-
-```python
-i = 0
-def h():
-    global i
-    i += 1
-
-h()
-print(i)
-```
-
-#### 23 交换两元素
-
-```python
-def swap(a, b):
-    return b, a
-
-
-print(swap(1, 0))  # (0,1)
-```
-
-#### 24 操作函数对象
-
-```python
-In [31]: def f():
-    ...:     print('i\'m f')
-    ...:
-
-In [32]: def g():
-    ...:     print('i\'m g')
-    ...:
-
-In [33]: [f,g][1]()
-i'm g
-```
-
-创建函数对象的list，根据想要调用的index，方便统一调用。
-
-#### 25 生成逆序序列
-
-```python
-list(range(10,-1,-1)) # [10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0]
-```
-
-第三个参数为负时，表示从第一个参数开始递减，终止到第二个参数(不包括此边界)
-
-#### 26 函数的五类参数使用例子
-
-python五类参数：位置参数，关键字参数，默认参数，可变位置或关键字参数的使用。
-
-```python
-def f(a,*b,c=10,**d):
-  print(f'a:{a},b:{b},c:{c},d:{d}')
-```
-
-*默认参数`c`不能位于可变关键字参数`d`后.*
-
-调用f:
-
-```python
-In [10]: f(1,2,5,width=10,height=20)
-a:1,b:(2, 5),c:10,d:{'width': 10, 'height': 20}
-```
-
-可变位置参数`b`实参后被解析为元组`(2,5)`;而c取得默认值10; d被解析为字典.
-
-再次调用f:
-
-```python
-In [11]: f(a=1,c=12)
-a:1,b:(),c:12,d:{}
-```
-
-a=1传入时a就是关键字参数，b,d都未传值，c被传入12，而非默认值。
-
-注意观察参数`a`, 既可以`f(1)`,也可以`f(a=1)` 其可读性比第一种更好，建议使用f(a=1)。如果要强制使用`f(a=1)`，需要在前面添加一个**星号**:
-
-```python
-def f(*,a,**b):
-  print(f'a:{a},b:{b}')
-```
-
-此时f(1)调用，将会报错：`TypeError: f() takes 0 positional arguments but 1 was given`
-
-只能`f(a=1)`才能OK.
-
-说明前面的`*`发挥作用，它变为只能传入关键字参数，那么如何查看这个参数的类型呢？借助python的`inspect`模块：
-
-```python
-In [22]: for name,val in signature(f).parameters.items():
-    ...:     print(name,val.kind)
-    ...:
-a KEYWORD_ONLY
-b VAR_KEYWORD
-```
-
-可看到参数`a`的类型为`KEYWORD_ONLY`，也就是仅仅为关键字参数。
-
-但是，如果f定义为：
-
-```python
-def f(a,*b):
-  print(f'a:{a},b:{b}')
-```
-
-查看参数类型：
-
-```python
-In [24]: for name,val in signature(f).parameters.items():
-    ...:     print(name,val.kind)
-    ...:
-a POSITIONAL_OR_KEYWORD
-b VAR_POSITIONAL
-```
-
-可以看到参数`a`既可以是位置参数也可是关键字参数。
-
-#### 27  使用slice对象
-
-生成关于蛋糕的序列cake1：
-
-```python
-In [1]: cake1 = list(range(5,0,-1))
-
-In [2]: b = cake1[1:10:2]
-
-In [3]: b
-Out[3]: [4, 2]
-
-In [4]: cake1
-Out[4]: [5, 4, 3, 2, 1]
-```
-
-再生成一个序列：
-
-```python
-In [5]: from random import randint
-   ...: cake2 = [randint(1,100) for _ in range(100)]
-   ...: # 同样以间隔为2切前10个元素，得到切片d
-   ...: d = cake2[1:10:2]
-In [6]: d
-Out[6]: [75, 33, 63, 93, 15]
-```
-
-你看，我们使用同一种切法，分别切开两个蛋糕cake1,cake2. 后来发现这种切法`极为经典`，又拿它去切更多的容器对象。
-
-那么，为什么不把这种切法封装为一个对象呢？于是就有了slice对象。
-
-定义slice对象极为简单，如把上面的切法定义成slice对象：
-
-```python
-perfect_cake_slice_way = slice(1,10,2)
-#去切cake1
-cake1_slice = cake1[perfect_cake_slice_way] 
-cake2_slice = cake2[perfect_cake_slice_way]
-
-In [11]: cake1_slice
-Out[11]: [4, 2]
-
-In [12]: cake2_slice
-Out[12]: [75, 33, 63, 93, 15]
-```
-
-与上面的结果一致。
-
-对于逆向序列切片，`slice`对象一样可行：
-
-```python
-a = [1,3,5,7,9,0,3,5,7]
-a_ = a[5:1:-1]
-
-named_slice = slice(5,1,-1)
-a_slice = a[named_slice] 
-
-In [14]: a_
-Out[14]: [0, 9, 7, 5]
-
-In [15]: a_slice
-Out[15]: [0, 9, 7, 5]
-```
-
-频繁使用同一切片的操作可使用slice对象抽出来，复用的同时还能提高代码可读性。
-
-#### 28  lambda 函数的动画演示
-
-有些读者反映，`lambda`函数不太会用，问我能不能解释一下。
-
-比如，下面求这个 `lambda`函数：
-
-```python
-def max_len(*lists):
-    return max(*lists, key=lambda v: len(v))
-```
-
-有两点疑惑：
-
-- 参数`v`的取值？ 
-- `lambda`函数有返回值吗？如果有，返回值是多少？
-
-调用上面函数，求出以下三个最长的列表：
-
-```python
-r = max_len([1, 2, 3], [4, 5, 6, 7], [8])
-print(f'更长的列表是{r}')
-```
-
-程序完整运行过程，动画演示如下：
-
-<img src="./img/lambda_show.gif" width="50%"/>
-
-
-结论：
-
-- 参数v的可能取值为`*lists`，也就是 `tuple` 的一个元素。
-
-- `lambda`函数返回值，等于`lambda v`冒号后表达式的返回值。
-
-### 四、 数据结构
-
-#### 29  转为字典　　
-
-创建数据字典
-
-```python
-In [1]: dict()
-Out[1]: {}
-
-In [2]: dict(a='a',b='b')
-Out[2]: {'a': 'a', 'b': 'b'}
-
-In [3]: dict(zip(['a','b'],[1,2]))
-Out[3]: {'a': 1, 'b': 2}
-
-In [4]: dict([('a',1),('b',2)])
-Out[4]: {'a': 1, 'b': 2}
-```
-
-#### 30 冻结集合　　
-
-创建一个不可修改的集合。
-
-```python
-In [1]: frozenset([1,1,3,2,3])
-Out[1]: frozenset({1, 2, 3})
-```
-
-因为不可修改，所以没有像`set`那样的`add`和`pop`方法
-
-#### 31 转为集合类型
-
-返回一个set对象，集合内不允许有重复元素：
-
-```python
-In [159]: a = [1,4,2,3,1]
-
-In [160]: set(a)
-Out[160]: {1, 2, 3, 4}
-```
-
-#### 32 转为切片对象
-
-*class* slice(*start*, *stop*[, *step*])
-
-返回一个表示由 range(start, stop, step) 所指定索引集的 slice对象，它让代码可读性、可维护性变好。
-
-```python
-In [1]: a = [1,4,2,3,1]
-
-In [2]: my_slice_meaning = slice(0,5,2)
-
-In [3]: a[my_slice_meaning]
-Out[3]: [1, 2, 1]
-```
-
-#### 33 转元组
-
- `tuple()` 将对象转为一个不可变的序列类型
-
- ```python
- In [16]: i_am_list = [1,3,5]
- In [17]: i_am_tuple = tuple(i_am_list)
- In [18]: i_am_tuple
- Out[18]: (1, 3, 5)
- ```
-
-### 五、 类和对象
-#### 34 是否可调用　　
-检查对象是否可被调用
-
-```python
-In [1]: callable(str)
-Out[1]: True
-
-In [2]: callable(int)
-Out[2]: True
-```
-
-```python
-In [18]: class Student(): 
-    ...:     def __init__(self,id,name): 
-    ...:         self.id = id 
-    ...:         self.name = name 
-    ...:     def __repr__(self): 
-    ...:         return 'id = '+self.id +', name = '+self.name 
-    ...
-
-In [19]: xiaoming = Student('001','xiaoming')                                   
-
-In [20]: callable(xiaoming)                                                     
-Out[20]: False
-```
-如果能调用`xiaoming()`, 需要重写`Student`类的`__call__`方法：
-
-```python
-In [1]: class Student():
-    ...:     def __init__(self,id,name):
-    ...:         self.id = id
-    ...:         self.name = name
-    ...:     def __repr__(self):
-    ...:         return 'id = '+self.id +', name = '+self.name
-    ...:     def __call__(self):
-    ...:         print('I can be called')
-    ...:         print(f'my name is {self.name}')
-    ...: 
-
-In [2]: t = Student('001','xiaoming')
-
-In [3]: t()
-I can be called
-my name is xiaoming
-```
-
-#### 35  ascii 展示对象　　
-
-调用对象的 `__repr__` 方法，获得该方法的返回值，如下例子返回值为字符串
-
-```python
->>> class Student():
-    def __init__(self,id,name):
-        self.id = id
-        self.name = name
-    def __repr__(self):
-        return 'id = '+self.id +', name = '+self.name
-```
-调用：
-```python
->>> xiaoming = Student(id='1',name='xiaoming')
->>> xiaoming
-id = 1, name = xiaoming
->>> ascii(xiaoming)
-'id = 1, name = xiaoming'
-```
-
-#### 36 类方法　
-
-`classmethod` 装饰器对应的函数不需要实例化，不需要 `self `参数，但第一个参数需要是表示自身类的 cls 参数，可以来调用类的属性，类的方法，实例化对象等。
-
-```python
-In [1]: class Student():
-    ...:     def __init__(self,id,name):
-    ...:         self.id = id
-    ...:         self.name = name
-    ...:     def __repr__(self):
-    ...:         return 'id = '+self.id +', name = '+self.name
-    ...:     @classmethod
-    ...:     def f(cls):
-    ...:         print(cls)
-```
-
-#### 37 动态删除属性　　
-
-删除对象的属性
-
-```python
-In [1]: delattr(xiaoming,'id')
-
-In [2]: hasattr(xiaoming,'id')
-Out[2]: False
-```
-
-
-#### 38 一键查看对象所有方法　
-
-不带参数时返回`当前范围`内的变量、方法和定义的类型列表；带参数时返回`参数`的属性，方法列表。
-
-```python
-In [96]: dir(xiaoming)
-Out[96]:
-['__class__',
- '__delattr__',
- '__dict__',
- '__dir__',
- '__doc__',
- '__eq__',
- '__format__',
- '__ge__',
- '__getattribute__',
- '__gt__',
- '__hash__',
- '__init__',
- '__init_subclass__',
- '__le__',
- '__lt__',
- '__module__',
- '__ne__',
- '__new__',
- '__reduce__',
- '__reduce_ex__',
- '__repr__',
- '__setattr__',
- '__sizeof__',
- '__str__',
- '__subclasshook__',
- '__weakref__',
- 
- 'name']
-```
-
-#### 39 动态获取对象属性　
-
-获取对象的属性
-
-```python
-In [1]: class Student():
-   ...:     def __init__(self,id,name):
-   ...:         self.id = id
-   ...:         self.name = name
-   ...:     def __repr__(self):
-   ...:         return 'id = '+self.id +', name = '+self.name
-
-In [2]: xiaoming = Student(id='001',name='xiaoming')
-In [3]: getattr(xiaoming,'name') # 获取xiaoming这个实例的name属性值
-Out[3]: 'xiaoming'
-```
-
-#### 40 对象是否有这个属性
-
-```python
-In [1]: class Student():
-   ...:     def __init__(self,id,name):
-   ...:         self.id = id
-   ...:         self.name = name
-   ...:     def __repr__(self):
-   ...:         return 'id = '+self.id +', name = '+self.name
-
-In [2]: xiaoming = Student(id='001',name='xiaoming')
-In [3]: hasattr(xiaoming,'name')
-Out[3]: True
-
-In [4]: hasattr(xiaoming,'address')
-Out[4]: False
-```
-
-#### 41 对象门牌号　
-
-返回对象的内存地址
-
-```python
-In [1]: id(xiaoming)
-Out[1]: 98234208
-```
-
-#### 42 isinstance
-
-判断*object*是否为类*classinfo*的实例，是返回true
-
-```python
-In [1]: class Student():
-   ...:     def __init__(self,id,name):
-   ...:         self.id = id
-   ...:         self.name = name
-   ...:     def __repr__(self):
-   ...:         return 'id = '+self.id +', name = '+self.name
-
-In [2]: xiaoming = Student(id='001',name='xiaoming')
-
-In [3]: isinstance(xiaoming,Student)
-Out[3]: True
-```
-
-#### 43 父子关系鉴定
-
-```python
-In [1]: class undergraduate(Student):
-    ...:     def studyClass(self):
-    ...:         pass
-    ...:     def attendActivity(self):
-    ...:         pass
-
-In [2]: issubclass(undergraduate,Student)
-Out[2]: True
-
-In [3]: issubclass(object,Student)
-Out[3]: False
-
-In [4]: issubclass(Student,object)
-Out[4]: True
-```
-
-如果class是classinfo元组中某个元素的子类，也会返回True
-
-```python
-In [1]: issubclass(int,(int,float))
-Out[1]: True
-```
-
-#### 44 所有对象之根
-
-object 是所有类的基类
-
-```python
-In [1]: o = object()
-
-In [2]: type(o)
-Out[2]: object
-```
-
-#### 45  创建属性的两种方式
-
-返回 property 属性，典型的用法：
-
-```python
-class C:
-    def __init__(self):
-        self._x = None
-
-    def getx(self):
-        return self._x
-
-    def setx(self, value):
-        self._x = value
-
-    def delx(self):
-        del self._x
-    # 使用property类创建 property 属性
-    x = property(getx, setx, delx, "I'm the 'x' property.")
-```
-
-使用python装饰器，实现与上完全一样的效果代码：
-
-```python
-class C:
-    def __init__(self):
-        self._x = None
-
-    @property
-    def x(self):
-        return self._x
-
-    @x.setter
-    def x(self, value):
-        self._x = value
-
-    @x.deleter
-    def x(self):
-        del self._x
-```
-
-#### 46 查看对象类型
-
-*class* `type`(*name*, *bases*, *dict*)
-
-传入一个参数时，返回 *object* 的类型：
-
-```python
-In [1]: class Student():
-   ...:     def __init__(self,id,name):
-   ...:         self.id = id
-   ...:         self.name = name
-   ...:     def __repr__(self):
-   ...:         return 'id = '+self.id +', name = '+self.name
-   ...: 
-
-In [2]: xiaoming = Student(id='001',name='xiaoming')
-In [3]: type(xiaoming)
-Out[3]: __main__.Student
-
-In [4]: type(tuple())
-Out[4]: tuple
-```
-
-#### 47 元类
-
-`xiaoming`, `xiaohong`, `xiaozhang` 都是学生，这类群体叫做 `Student`. 
-
-Python 定义类的常见方法，使用关键字 `class`
-
-```python
-In [36]: class Student(object):
-    ...:     pass
-```
-
-`xiaoming`, `xiaohong`, `xiaozhang` 是类的实例，则：
-
-```python
-xiaoming = Student()
-xiaohong = Student()
-xiaozhang = Student()
-```
-
-创建后，xiaoming 的 `__class__` 属性，返回的便是 `Student`类
-
-```python
-In [38]: xiaoming.__class__
-Out[38]: __main__.Student
-```
-
-问题在于，`Student` 类有 `__class__`属性，如果有，返回的又是什么？
-
-```python
-In [39]: xiaoming.__class__.__class__
-Out[39]: type
-```
-
-哇，程序没报错，返回 `type`
-
-那么，我们不妨猜测：`Student` 类，类型就是 `type`
-
-换句话说，`Student`类就是一个**对象**，它的类型就是 `type`
-
-所以，Python 中一切皆对象，**类也是对象**
-
-Python 中，将描述 `Student` 类的类被称为：元类。
-
-按照此逻辑延伸，描述元类的类被称为：*元元类*，开玩笑了~ 描述元类的类也被称为元类。
-
-聪明的朋友会问了，既然 `Student` 类可创建实例，那么 `type` 类可创建实例吗？ 如果能，它创建的实例就叫：类 了。 你们真聪明！
-
-说对了，`type` 类一定能创建实例，比如 `Student` 类了。
-
-```python
-In [40]: Student = type('Student',(),{})
-
-In [41]: Student
-Out[41]: __main__.Student
-```
-
-它与使用 `class` 关键字创建的 `Student` 类一模一样。
-
-Python 的类，因为又是对象，所以和 `xiaoming`，`xiaohong` 对象操作相似。支持：
-
-- 赋值
-- 拷贝
-- 添加属性
-- 作为函数参数
-
-```python
-In [43]: StudentMirror = Student # 类直接赋值 # 类直接赋值
-In [44]: Student.class_property = 'class_property' # 添加类属性
-In [46]: hasattr(Student, 'class_property')
-Out[46]: True
-```
-
-元类，确实使用不是那么多，也许先了解这些，就能应付一些场合。就连 Python 界的领袖 `Tim Peters` 都说：
-
-“元类就是深度的魔法，99%的用户应该根本不必为此操心。
-
-### 六、工具
-
-#### 48 枚举对象　　
-
-返回一个可以枚举的对象，该对象的next()方法将返回一个元组。
-
-```python
-In [1]: s = ["a","b","c"]
-    ...: for i ,v in enumerate(s,1):
-    ...:     print(i,v)
-    ...:
-1 a
-2 b
-3 c
-```
-
-#### 49 查看变量所占字节数
-
-```python
-In [1]: import sys
-
-In [2]: a = {'a':1,'b':2.0}
-
-In [3]: sys.getsizeof(a) # 占用240个字节
-Out[3]: 240
-```
-
-#### 50 过滤器　　
-
-在函数中设定过滤条件，迭代元素，保留返回值为`True`的元素：
-
-```python
-In [1]: fil = filter(lambda x: x>10,[1,11,2,45,7,6,13])
-
-In [2]: list(fil)
-Out[2]: [11, 45, 13]
-```
-
-#### 51 返回对象的哈希值　　
-
-返回对象的哈希值，值得注意的是自定义的实例都是可哈希的，`list`, `dict`, `set`等可变对象都是不可哈希的(unhashable)
-
-  ```python
-In [1]: hash(xiaoming)
-Out[1]: 6139638
-
-In [2]: hash([1,2,3])
-# TypeError: unhashable type: 'list'
-  ```
-
-#### 52  一键帮助　
-
-返回对象的帮助文档
-
-```python
-In [1]: help(xiaoming)
-Help on Student in module __main__ object:
-
-class Student(builtins.object)
- |  Methods defined here:
- |
- |  __init__(self, id, name)
- |
- |  __repr__(self)
- |
- |  Data descriptors defined here:
- |
- |  __dict__
- |      dictionary for instance variables (if defined)
- |
- |  __weakref__
- |      list of weak references to the object (if defined)
-```
-
-### 53 获取用户输入　
-
-获取用户输入内容
-
-```python
-In [1]: input()
-aa
-Out[1]: 'aa'
-```
-
-#### 54 创建迭代器类型
-
-使用`iter(obj, sentinel)`, 返回一个可迭代对象, sentinel可省略(一旦迭代到此元素，立即终止)
-
-```python
-In [1]: lst = [1,3,5]
-
-In [2]: for i in iter(lst):
-    ...:     print(i)
-    ...:
-1
-3
-5
-```
-
-```python
-In [1]: class TestIter(object):
-    ...:     def __init__(self):
-    ...:         self.l=[1,3,2,3,4,5]
-    ...:         self.i=iter(self.l)
-    ...:     def __call__(self):  #定义了__call__方法的类的实例是可调用的
-    ...:         item = next(self.i)
-    ...:         print ("__call__ is called,fowhich would return",item)
-    ...:         return item
-    ...:     def __iter__(self): #支持迭代协议(即定义有__iter__()函数)
-    ...:         print ("__iter__ is called!!")
-    ...:         return iter(self.l)
-In [2]: t = TestIter()
-In [3]: t() # 因为实现了__call__，所以t实例能被调用
-__call__ is called,which would return 1
-Out[3]: 1
-
-In [4]: for e in TestIter(): # 因为实现了__iter__方法，所以t能被迭代
-    ...:     print(e)
-    ...: 
-__iter__ is called!!
-1
-3
-2
-3
-4
-5
-```
-
-#### 55 打开文件
-
-返回文件对象
-
-```python
-In [1]: fo = open('D:/a.txt',mode='r', encoding='utf-8')
-
-In [2]: fo.read()
-Out[2]: '\ufefflife is not so long,\nI use Python to play.'
-```
-
-mode取值表：
-
-| 字符  | 意义                             |
-| :---- | :------------------------------- |
-| `'r'` | 读取（默认）                     |
-| `'w'` | 写入，并先截断文件               |
-| `'x'` | 排它性创建，如果文件已存在则失败 |
-| `'a'` | 写入，如果文件存在则在末尾追加   |
-| `'b'` | 二进制模式                       |
-| `'t'` | 文本模式（默认）                 |
-| `'+'` | 打开用于更新（读取与写入）       |
-
-#### 56 创建range序列
-
-1) range(stop)
-2) range(start, stop[,step])
-
-生成一个不可变序列：
-
-```python
-In [1]: range(11)
-Out[1]: range(0, 11)
-
-In [2]: range(0,11,1)
-Out[2]: range(0, 11)
-```
-
-#### 57 反向迭代器
-
-```python
-In [1]: rev = reversed([1,4,2,3,1])
-
-In [2]: for i in rev:
-     ...:     print(i)
-     ...:
-1
-3
-2
-4
-1
-```
-
-#### 58 聚合迭代器
-
-创建一个聚合了来自每个可迭代对象中的元素的迭代器：
-
-```python
-In [1]: x = [3,2,1]
-In [2]: y = [4,5,6]
-In [3]: list(zip(y,x))
-Out[3]: [(4, 3), (5, 2), (6, 1)]
-
-In [4]: a = range(5)
-In [5]: b = list('abcde')
-In [6]: b
-Out[6]: ['a', 'b', 'c', 'd', 'e']
-In [7]: [str(y) + str(x) for x,y in zip(a,b)]
-Out[7]: ['a0', 'b1', 'c2', 'd3', 'e4']
-```
-
-#### 59 链式操作
-
-```python
-from operator import (add, sub)
-
-
-def add_or_sub(a, b, oper):
-    return (add if oper == '+' else sub)(a, b)
-
-
-add_or_sub(1, 2, '-')  # -1
-```
-
-#### 60 对象序列化
-
-对象序列化，是指将内存中的对象转化为可存储或传输的过程。很多场景，直接一个类对象，传输不方便。
-
-但是，当对象序列化后，就会更加方便，因为约定俗成的，接口间的调用或者发起的 web 请求，一般使用 json 串传输。
-
-实际使用中，一般对类对象序列化。先创建一个 Student 类型，并创建两个实例。
-
-```python
-class Student():
-    def __init__(self,**args):
-        self.ids = args['ids']
-        self.name = args['name']
-        self.address = args['address']
-xiaoming = Student(ids = 1,name = 'xiaoming',address = '北京')
-xiaohong = Student(ids = 2,name = 'xiaohong',address = '南京')
-```
-
-导入 json 模块，调用 dump 方法，就会将列表对象 [xiaoming,xiaohong]，序列化到文件 json.txt 中。
-
-```python
-import json
-
-with open('json.txt', 'w') as f:
-    json.dump([xiaoming,xiaohong], f, default=lambda obj: obj.__dict__, ensure_ascii=False, indent=2, sort_keys=True)
-```
-
-生成的文件内容，如下：
-
-```json
-[
-    {
-        "address":"北京",
-        "ids":1,
-        "name":"xiaoming"
-    },
-    {
-        "address":"南京",
-        "ids":2,
-        "name":"xiaohong"
-    }
-]
-```
-
-### 七、 小案例
-
-
-#### 61 不用else和if实现计算器
-
-```python
-from operator import *
-
-
-def calculator(a, b, k):
-    return {
-        '+': add,
-        '-': sub,
-        '*': mul,
-        '/': truediv,
-        '**': pow
-    }[k](a, b)
-
-
-calculator(1, 2, '+')  # 3
-calculator(3, 4, '**')  # 81
-```
-
-#### 62 去最求平均
-
-```python
-def score_mean(lst):
-    lst.sort()
-    lst2=lst[1:(len(lst)-1)]
-    return round((sum(lst2)/len(lst2)),1)
-
-lst=[9.1, 9.0,8.1, 9.7, 19,8.2, 8.6,9.8]
-score_mean(lst) # 9.1
-```
-
-#### 63 打印99乘法表
-
-打印出如下格式的乘法表
-
-```python
-1*1=1
-1*2=2   2*2=4
-1*3=3   2*3=6   3*3=9
-1*4=4   2*4=8   3*4=12  4*4=16
-1*5=5   2*5=10  3*5=15  4*5=20  5*5=25
-1*6=6   2*6=12  3*6=18  4*6=24  5*6=30  6*6=36
-1*7=7   2*7=14  3*7=21  4*7=28  5*7=35  6*7=42  7*7=49
-1*8=8   2*8=16  3*8=24  4*8=32  5*8=40  6*8=48  7*8=56  8*8=64
-1*9=9   2*9=18  3*9=27  4*9=36  5*9=45  6*9=54  7*9=63  8*9=72  9*9=81
-```
-
-一共有10 行，第`i`行的第`j`列等于：`j*i`，
-
-其中,
-
- `i`取值范围：`1<=i<=9`
-
- `j`取值范围：`1<=j<=i`
-
-根据`例子分析`的语言描述，转化为如下代码：
-
-```python
-for i in range(1, 10):
-    for j in range(1, i+1):
-        print('%d * %d = %d' % (j, i, j * i) , end="\t")
-    print()
-```
-
-#### 64 全展开
-
-对于如下数组：
-
-```
-[[[1,2,3],[4,5]]]
-```
-
-如何完全展开成一维的。这个小例子实现的`flatten`是递归版，两个参数分别表示带展开的数组，输出数组。
-
-```python
-from collections.abc import *
-
-def flatten(lst, out_lst=None):
-    if out_lst is None:
-        out_lst = []
-    for i in lst:
-        if isinstance(i, Iterable): # 判断i是否可迭代
-            flatten(i, out_lst)  # 尾数递归
-        else:
-            out_lst.append(i)    # 产生结果
-    return out_lst
-```
-
-调用`flatten`:
-
-```python
-print(flatten([[1,2,3],[4,5]]))
-print(flatten([[1,2,3],[4,5]], [6,7]))
-print(flatten([[[1,2,3],[4,5,6]]]))
-# 结果：
-[1, 2, 3, 4, 5]
-[6, 7, 1, 2, 3, 4, 5]
-[1, 2, 3, 4, 5, 6]
-```
-
-numpy里的`flatten`与上面的函数实现有些微妙的不同：
-
-```python
-import numpy
-b = numpy.array([[1,2,3],[4,5]])
-b.flatten()
-array([list([1, 2, 3]), list([4, 5])], dtype=object)
-```
-
-#### 65 列表等分
-
-```python
-from math import ceil
-
-def divide(lst, size):
-    if size <= 0:
-        return [lst]
-    return [lst[i * size:(i+1)*size] for i in range(0, ceil(len(lst) / size))]
-
-
-r = divide([1, 3, 5, 7, 9], 2)
-print(r)  # [[1, 3], [5, 7], [9]]
-
-r = divide([1, 3, 5, 7, 9], 0)
-print(r)  # [[1, 3, 5, 7, 9]]
-
-r = divide([1, 3, 5, 7, 9], -3)
-print(r)  # [[1, 3, 5, 7, 9]]
-
-```
-
-#### 66 列表压缩
-
-```python
-def filter_false(lst):
-    return list(filter(bool, lst))
-
-
-r = filter_false([None, 0, False, '', [], 'ok', [1, 2]])
-print(r)  # ['ok', [1, 2]]
-
-```
-
-#### 67 更长列表
-
-```python
-def max_length(*lst):
-    return max(*lst, key=lambda v: len(v))
-
-
-r = max_length([1, 2, 3], [4, 5, 6, 7], [8])
-print(f'更长的列表是{r}')  # [4, 5, 6, 7]
-
-r = max_length([1, 2, 3], [4, 5, 6, 7], [8, 9])
-print(f'更长的列表是{r}')  # [4, 5, 6, 7]
-```
-
-#### 68 求众数
-
-```python
-def top1(lst):
-    return max(lst, default='列表为空', key=lambda v: lst.count(v))
-
-lst = [1, 3, 3, 2, 1, 1, 2]
-r = top1(lst)
-print(f'{lst}中出现次数最多的元素为:{r}')  # [1, 3, 3, 2, 1, 1, 2]中出现次数最多的元素为:1
-```
-
-#### 69 多表之最
-```python 
-def max_lists(*lst):
-    return max(max(*lst, key=lambda v: max(v)))
-
-
-r = max_lists([1, 2, 3], [6, 7, 8], [4, 5])
-print(r)  # 8
-```
-
-#### 70 列表查重
-
-```python
-def has_duplicates(lst):
-    return len(lst) == len(set(lst))
-
-
-x = [1, 1, 2, 2, 3, 2, 3, 4, 5, 6]
-y = [1, 2, 3, 4, 5]
-has_duplicates(x)  # False
-has_duplicates(y)  # True
-```
-
-
-
-#### 71 列表反转
-
-```python
-def reverse(lst):
-    return lst[::-1]
-
-
-r = reverse([1, -2, 3, 4, 1, 2])
-print(r)  # [2, 1, 4, 3, -2, 1]
-```
-
-#### 72 浮点数等差数列
-
-```python
-def rang(start, stop, n):
-    start,stop,n = float('%.2f' % start), float('%.2f' % stop),int('%.d' % n)
-    step = (stop-start)/n
-    lst = [start]
-    while n > 0:
-        start,n = start+step,n-1
-        lst.append(round((start), 2))
-    return lst
-
-rang(1, 8, 10) # [1.0, 1.7, 2.4, 3.1, 3.8, 4.5, 5.2, 5.9, 6.6, 7.3, 8.0]
-```
-
-#### 73 按条件分组
-
-```python
-def bif_by(lst, f):
-    return [ [x for x in lst if f(x)],[x for x in lst if not f(x)]]
-
-records = [25,89,31,34] 
-bif_by(records, lambda x: x<80) # [[25, 31, 34], [89]]
-```
-
-#### 74 map实现向量运算
-
-```python
-#多序列运算函数—map(function,iterabel,iterable2)
-lst1=[1,2,3,4,5,6]
-lst2=[3,4,5,6,3,2]
-list(map(lambda x,y:x*y+1,lst1,lst2))
-### [4, 9, 16, 25, 16, 13]
-```
-
-#### 75 值最大的字典
-
-```python
-def max_pairs(dic):
-    if len(dic) == 0:
-        return dic
-    max_val = max(map(lambda v: v[1], dic.items()))
-    return [item for item in dic.items() if item[1] == max_val]
-
-
-r = max_pairs({'a': -10, 'b': 5, 'c': 3, 'd': 5})
-print(r)  # [('b', 5), ('d', 5)]
-```
-
-#### 76 合并两个字典
-
-```python
-def merge_dict(dic1, dic2):
-    return {**dic1, **dic2}  # python3.5后支持的一行代码实现合并字典
-
-merge_dict({'a': 1, 'b': 2}, {'c': 3})  # {'a': 1, 'b': 2, 'c': 3}
-```
-
-#### 77 topn字典
-
-```python
-from heapq import nlargest
-
-# 返回字典d前n个最大值对应的键
-
-def topn_dict(d, n):
-    return nlargest(n, d, key=lambda k: d[k])
-
-topn_dict({'a': 10, 'b': 8, 'c': 9, 'd': 10}, 3)  # ['a', 'd', 'c']
-```
-
-
-#### 78 异位词
-
-```python
-from collections import Counter
-
-# 检查两个字符串是否 相同字母异序词，简称：互为变位词
-
-def anagram(str1, str2):
-    return Counter(str1) == Counter(str2)
-
-anagram('eleven+two', 'twelve+one')  # True 这是一对神器的变位词
-anagram('eleven', 'twelve')  # False
-```
-
-#### 79 逻辑上合并字典
-(1) 两种合并字典方法
-这是一般的字典合并写法
-
-```python
-dic1 = {'x': 1, 'y': 2 }
-dic2 = {'y': 3, 'z': 4 }
-merged1 = {**dic1, **dic2} # {'x': 1, 'y': 3, 'z': 4}
-```
-
-修改merged['x']=10，dic1中的x值`不变`，`merged`是重新生成的一个`新字典`。
-
-但是，`ChainMap`却不同，它在内部创建了一个容纳这些字典的列表。因此使用ChainMap合并字典，修改merged['x']=10后，dic1中的x值`改变`，如下所示：
-
-```python
-from collections import ChainMap
-merged2 = ChainMap(dic1,dic2)
-print(merged2) # ChainMap({'x': 1, 'y': 2}, {'y': 3, 'z': 4})
-```
-
-#### 80 命名元组提高可读性
-
-```python
-from collections import namedtuple
-Point = namedtuple('Point', ['x', 'y', 'z'])  # 定义名字为Point的元祖，字段属性有x,y,z
-lst = [Point(1.5, 2, 3.0), Point(-0.3, -1.0, 2.1), Point(1.3, 2.8, -2.5)]
-print(lst[0].y - lst[1].y)
-```
-
-使用命名元组写出来的代码可读性更好，尤其处理上百上千个属性时作用更加凸显。
-
-#### 81 样本抽样
-
-使用`sample`抽样，如下例子从100个样本中随机抽样10个。
-
-```python
-from random import randint,sample
-lst = [randint(0,50) for _ in range(100)]
-print(lst[:5])# [38, 19, 11, 3, 6]
-lst_sample = sample(lst,10)
-print(lst_sample) # [33, 40, 35, 49, 24, 15, 48, 29, 37, 24]
-```
-
-#### 82 重洗数据集
-
-使用`shuffle`用来重洗数据集，**值得注意`shuffle`是对lst就地(in place)洗牌，节省存储空间**
-
-```python
-from random import shuffle
-lst = [randint(0,50) for _ in range(100)]
-shuffle(lst)
-print(lst[:5]) # [50, 3, 48, 1, 26]
-```
-
-#### 83 10个均匀分布的坐标点
-
-random模块中的`uniform(a,b)`生成[a,b)内的一个随机数，如下生成10个均匀分布的二维坐标点
-
-```python
-from random import uniform
-In [1]: [(uniform(0,10),uniform(0,10)) for _ in range(10)]
-Out[1]: 
-[(9.244361194237328, 7.684326645514235),
- (8.129267671737324, 9.988395854203773),
- (9.505278771040661, 2.8650440524834107),
- (3.84320100484284, 1.7687190176304601),
- (6.095385729409376, 2.377133802224657),
- (8.522913365698605, 3.2395995841267844),
- (8.827829601859406, 3.9298809217233766),
- (1.4749644859469302, 8.038753079253127),
- (9.005430657826324, 7.58011186920019),
- (8.700789540392917, 1.2217577293254112)]
-```
-
-#### 84 10个高斯分布的坐标点
-
-random模块中的`gauss(u,sigma)`生成均值为u, 标准差为sigma的满足高斯分布的值，如下生成10个二维坐标点，样本误差(y-2*x-1)满足均值为0，标准差为1的高斯分布：
-
-```python
-from random import gauss
-x = range(10)
-y = [2*xi+1+gauss(0,1) for xi in x]
-points = list(zip(x,y))
-### 10个二维点：
-[(0, -0.86789025305992),
- (1, 4.738439437453464),
- (2, 5.190278040856102),
- (3, 8.05270893133576),
- (4, 9.979481700775292),
- (5, 11.960781766216384),
- (6, 13.025427054303737),
- (7, 14.02384035204836),
- (8, 15.33755823101161),
- (9, 17.565074449028497)]
-```
-
-#### 85 chain高效串联多个容器对象
-
-`chain`函数串联a和b，兼顾内存效率同时写法更加优雅。
-
-```python
-from itertools import chain
-a = [1,3,5,0]
-b = (2,4,6)
-
-for i in chain(a,b):
-  print(i)
-### 结果
-1
-3
-5
-0
-2
-4
-6
-```
-
-#### 86 product 案例
-
-```python
-def product(*args, repeat=1):
-    pools = [tuple(pool) for pool in args] * repeat
-    result = [[]]
-    for pool in pools:
-        result = [x+[y] for x in result for y in pool]
-    for prod in result:
-        yield tuple(prod)
-```
-
-
-调用函数：
-
-```python
-rtn = product('xyz', '12', repeat=3)
-print(list(rtn))
-```
-
-### 二、Python字符串和正则
-
-字符串无所不在，字符串的处理也是最常见的操作。本章节将总结和字符串处理相关的一切操作。主要包括基本的字符串操作；高级字符串操作之正则。目前共有`25`个小例子
-
-#### 91 反转字符串
-
-```python
-st="python"
-#方法1
-''.join(reversed(st))
-#方法2
-st[::-1]
-```
-
-#### 92 字符串切片操作
-
-```python
-字符串切片操作——查找替换3或5的倍数
-In [1]:[str("java"[i%3*4:]+"python"[i%5*6:] or i) for i in range(1,15)]
-OUT[1]:['1',
- '2',
- 'java',
- '4',
- 'python',
- 'java',
- '7',
- '8',
- 'java',
- 'python',
- '11',
- 'java',
- '13',
- '14']
-```
-#### 93 join串联字符串
-```python
-In [4]: mystr = ['1',
-   ...:  '2',
-   ...:  'java',
-   ...:  '4',
-   ...:  'python',
-   ...:  'java',
-   ...:  '7',
-   ...:  '8',
-   ...:  'java',
-   ...:  'python',
-   ...:  '11',
-   ...:  'java',
-   ...:  '13',
-   ...:  '14']
-
-In [5]: ','.join(mystr) #用逗号连接字符串
-Out[5]: '1,2,java,4,python,java,7,8,java,python,11,java,13,14'
-```
-
-#### 94 字符串的字节长度
-
-```python
-def str_byte_len(mystr):
-    return (len(mystr.encode('utf-8')))
-
-
-str_byte_len('i love python')  # 13(个字节)
-str_byte_len('字符')  # 6(个字节)
-```
-
-
-
-**以下是正则部分**
-
-```python
-import re
-```
-
-#### 95 查找第一个匹配串
-
-```python
-s = 'i love python very much'
-pat = 'python' 
-r = re.search(pat,s)
-print(r.span()) #(7,13)
-```
-
-#### 96 查找所有 1 的索引
-
-```python
-s = '山东省潍坊市青州第1中学高三1班'
-pat = '1'
-r = re.finditer(pat,s)
-for i in r:
-    print(i)
-
-# <re.Match object; span=(9, 10), match='1'>
-# <re.Match object; span=(14, 15), match='1'>
-```
-
-#### 97 \d 匹配数字[0-9]
-findall找出全部位置的所有匹配
-```python
-s = '一共20行代码运行时间13.59s'
-pat = r'\d+' # +表示匹配数字(\d表示数字的通用字符)1次或多次
-r = re.findall(pat,s)
-print(r)
-# ['20', '13', '59']
-```
-
-#### 98 匹配浮点数和整数
-
-?表示前一个字符匹配0或1次
-```python
-s = '一共20行代码运行时间13.59s'
-pat = r'\d+\.?\d+' # ?表示匹配小数点(\.)0次或1次，这种写法有个小bug，不能匹配到个位数的整数
-r = re.findall(pat,s)
-print(r)
-# ['20', '13.59']
-
-# 更好的写法：
-pat = r'\d+\.\d+|\d+' # A|B，匹配A失败才匹配B
-```
-#### 99 ^匹配字符串的开头
-
-```python
-s = 'This module provides regular expression matching operations similar to those found in Perl'
-pat = r'^[emrt]' # 查找以字符e,m,r或t开始的字符串
-r = re.findall(pat,s)
-print(r)
-# [],因为字符串的开头是字符`T`，不在emrt匹配范围内，所以返回为空
-IN [11]: s2 = 'email for me is guozhennianhua@163.com'
-re.findall('^[emrt].*',s2)# 匹配以e,m,r,t开始的字符串，后面是多个任意字符
-Out[11]: ['email for me is guozhennianhua@163.com']
-
-```
-#### 100 re.I 忽略大小写
-
-```python
-s = 'That'
-pat = r't' 
-r = re.findall(pat,s,re.I)
-In [22]: r
-Out[22]: ['T', 't']
-```
-#### 101 理解compile的作用
-如果要做很多次匹配，可以先编译匹配串：
-```python
-import re
-pat = re.compile('\W+') # \W 匹配不是数字和字母的字符
-has_special_chars = pat.search('ed#2@edc') 
-if has_special_chars:
-    print(f'str contains special characters:{has_special_chars.group(0)}')
-
-###输出结果: 
- # str contains special characters:#   
-
-### 再次使用pat正则编译对象 做匹配
-again_pattern = pat.findall('guozhennianhua@163.com')
-if '@' in again_pattern:
-    print('possibly it is an email')
-
-```
-
-#### 102 使用()捕获单词，不想带空格
-使用`()`捕获
-```python
-s = 'This module provides regular expression matching operations similar to those found in Perl'
-pat = r'\s([a-zA-Z]+)'  
-r = re.findall(pat,s)
-print(r) #['module', 'provides', 'regular', 'expression', 'matching', 'operations', 'similar', 'to', 'those', 'found', 'in', 'Perl']
-```
-看到提取单词中未包括第一个单词，使用`?`表示前面字符出现0次或1次，但是此字符还有表示贪心或非贪心匹配含义，使用时要谨慎。
-```python
-s = 'This module provides regular expression matching operations similar to those found in Perl'
-pat = r'\s?([a-zA-Z]+)'  
-r = re.findall(pat,s)
-print(r) #['This', 'module', 'provides', 'regular', 'expression', 'matching', 'operations', 'similar', 'to', 'those', 'found', 'in', 'Perl']
-```
-
-#### 103 split分割单词
-使用以上方法分割单词不是简洁的，仅仅是为了演示。分割单词最简单还是使用`split`函数。
-```python
-s = 'This module provides regular expression matching operations similar to those found in Perl'
-pat = r'\s+'  
-r = re.split(pat,s)
-print(r) # ['This', 'module', 'provides', 'regular', 'expression', 'matching', 'operations', 'similar', 'to', 'those', 'found', 'in', 'Perl']
-
-### 上面这句话也可直接使用str自带的split函数：
-s.split(' ') #使用空格分隔
-
-### 但是，对于风格符更加复杂的情况，split无能为力，只能使用正则
-
-s = 'This,,,   module ; \t   provides|| regular ; '
-words = re.split('[,\s;|]+',s)  #这样分隔出来，最后会有一个空字符串
-words = [i for i in words if len(i)>0]
-```
-
-#### 104 match从字符串开始位置匹配
-注意`match`,`search`等的不同：
-1) match函数
-```python
-import re
-### match
-mystr = 'This'
-pat = re.compile('hi')
-pat.match(mystr) # None
-pat.match(mystr,1) # 从位置1处开始匹配
-Out[90]: <re.Match object; span=(1, 3), match='hi'>
-```
-2) search函数
-search是从字符串的任意位置开始匹配
-```python
-In [91]: mystr = 'This'
-    ...: pat = re.compile('hi')
-    ...: pat.search(mystr)
-Out[91]: <re.Match object; span=(1, 3), match='hi'>
-```
-
-#### 105 替换匹配的子串
-`sub`函数实现对匹配子串的替换
-```python
-content="hello 12345, hello 456321"    
-pat=re.compile(r'\d+') #要替换的部分
-m=pat.sub("666",content)
-print(m) # hello 666, hello 666
-```
-
-#### 106 贪心捕获
-(.*)表示捕获任意多个字符，尽可能多的匹配字符
-```python
-content='<h>ddedadsad</h><div>graph</div>bb<div>math</div>cc'
-pat=re.compile(r"<div>(.*)</div>")  #贪婪模式
-m=pat.findall(content)
-print(m) #匹配结果为： ['graph</div>bb<div>math']
-```
-#### 107 非贪心捕获
-仅添加一个问号(`?`)，得到结果完全不同，这是非贪心匹配，通过这个例子体会贪心和非贪心的匹配的不同。
-```python
-content='<h>ddedadsad</h><div>graph</div>bb<div>math</div>cc'
-pat=re.compile(r"<div>(.*?)</div>")
-m=pat.findall(content)
-print(m) # ['graph', 'math']
-```
-非贪心捕获，见好就收。
-
-#### 108 常用元字符总结
-
-    . 匹配任意字符  
-    ^ 匹配字符串开始位置 
-    $ 匹配字符串中结束的位置 
-    * 前面的原子重复0次、1次、多次 
-    ? 前面的原子重复0次或者1次 
-    + 前面的原子重复1次或多次
-    {n} 前面的原子出现了 n 次
-    {n,} 前面的原子至少出现 n 次
-    {n,m} 前面的原子出现次数介于 n-m 之间
-    ( ) 分组,需要输出的部分
-
-#### 109 常用通用字符总结
-
-    \s  匹配空白字符 
-    \w  匹配任意字母/数字/下划线 
-    \W  和小写 w 相反，匹配任意字母/数字/下划线以外的字符
-    \d  匹配十进制数字
-    \D  匹配除了十进制数以外的值 
-    [0-9]  匹配一个0-9之间的数字
-    [a-z]  匹配小写英文字母
-    [A-Z]  匹配大写英文字母
-
-#### 110 密码安全检查
-
-密码安全要求：1)要求密码为6到20位; 2)密码只包含英文字母和数字
-
-```python
-pat = re.compile(r'\w{6,20}') # 这是错误的，因为\w通配符匹配的是字母，数字和下划线，题目要求不能含有下划线
-# 使用最稳的方法：\da-zA-Z满足`密码只包含英文字母和数字`
-pat = re.compile(r'[\da-zA-Z]{6,20}')
-```
-选用最保险的`fullmatch`方法，查看是否整个字符串都匹配：
-```python
-pat.fullmatch('qaz12') # 返回 None, 长度小于6
-pat.fullmatch('qaz12wsxedcrfvtgb67890942234343434') # None 长度大于22
-pat.fullmatch('qaz_231') # None 含有下划线
-pat.fullmatch('n0passw0Rd')
-Out[4]: <re.Match object; span=(0, 10), match='n0passw0Rd'>
-```
-
-#### 111 爬取百度首页标题
-
-```python
-import re
-from urllib import request
-
-#爬虫爬取百度首页内容
-data=request.urlopen("http://www.baidu.com/").read().decode()
-
-#分析网页,确定正则表达式
-pat=r'<title>(.*?)</title>'
-
-result=re.search(pat,data)
-print(result) <re.Match object; span=(1358, 1382), match='<title>百度一下，你就知道</title>'>
-
-result.group() # 百度一下，你就知道
-```
-
-#### 112 批量转化为驼峰格式(Camel)
-
-数据库字段名批量转化为驼峰格式
-
-分析过程
-
-```python
-# 用到的正则串讲解
-# \s 指匹配： [ \t\n\r\f\v]
-# A|B：表示匹配A串或B串
-# re.sub(pattern, newchar, string): 
-# substitue代替，用newchar字符替代与pattern匹配的字符所有.
-```
-
-
-
-```python
-# title(): 转化为大写，例子：
-# 'Hello world'.title() # 'Hello World'
-```
-
-
-
-```python
-# print(re.sub(r"\s|_|", "", "He llo_worl\td"))
-s = re.sub(r"(\s|_|-)+", " ",
-           'some_database_field_name').title().replace(" ", "")  
-#结果： SomeDatabaseFieldName
-```
-
-
-
-```python
-# 可以看到此时的第一个字符为大写，需要转化为小写
-s = s[0].lower()+s[1:]  # 最终结果
-```
-
- 
-
-整理以上分析得到如下代码：
-
-```python
-import re
-def camel(s):
-    s = re.sub(r"(\s|_|-)+", " ", s).title().replace(" ", "")
-    return s[0].lower() + s[1:]
-
-# 批量转化
-def batch_camel(slist):
-    return [camel(s) for s in slist]
-```
-
-测试结果：
-
-```python
-s = batch_camel(['student_id', 'student\tname', 'student-add'])
-print(s)
-# 结果
-['studentId', 'studentName', 'studentAdd']
-```
-
-
-
-#### 113 str1是否为str2的permutation
-
-排序词(permutation)：两个字符串含有相同字符，但字符顺序不同。
-
-```python
-from collections import defaultdict
-
-
-def is_permutation(str1, str2):
-    if str1 is None or str2 is None:
-        return False
-    if len(str1) != len(str2):
-        return False
-    unq_s1 = defaultdict(int)
-    unq_s2 = defaultdict(int)
-    for c1 in str1:
-        unq_s1[c1] += 1
-    for c2 in str2:
-        unq_s2[c2] += 1
-
-    return unq_s1 == unq_s2
-```
-
-这个小例子，使用python内置的`defaultdict`，默认类型初始化为`int`，计数默次数都为0. 这个解法本质是 `hash map lookup`
-
-统计出的两个defaultdict：unq_s1，unq_s2，如果相等，就表明str1、 str2互为排序词。
-
-下面测试：
-
-```python
-r = is_permutation('nice', 'cine')
-print(r)  # True
-
-r = is_permutation('', '')
-print(r)  # True
-
-r = is_permutation('', None)
-print(r)  # False
-
-r = is_permutation('work', 'woo')
-print(r)  # False
-
-```
-
-以上就是使用defaultdict的小例子，希望对读者朋友理解此类型有帮助。
-
-#### 114 str1是否由str2旋转而来
-
-`stringbook`旋转后得到`bookstring`,写一段代码验证`str1`是否为`str2`旋转得到。
-
-**思路**
-
-转化为判断：`str1`是否为`str2+str2`的子串
-
-```python
-def is_rotation(s1: str, s2: str) -> bool:
-    if s1 is None or s2 is None:
-        return False
-    if len(s1) != len(s2):
-        return False
-
-    def is_substring(s1: str, s2: str) -> bool:
-        return s1 in s2
-    return is_substring(s1, s2 + s2)
-```
-
-**测试**
-
-```python
-r = is_rotation('stringbook', 'bookstring')
-print(r)  # True
-
-r = is_rotation('greatman', 'maneatgr')
-print(r)  # False
-```
-
-#### 115 正浮点数
-
-从一系列字符串中，挑选出所有正浮点数。
-
-该怎么办？
-
-玩玩正则表达式，用正则搞它！
-
-关键是，正则表达式该怎么写呢？
-
-有了！
-
-`^[1-9]\d*\.\d*$`
-
-`^` 表示字符串开始
-
-`[1-9]` 表示数字1,2,3,4,5,6,7,8,9
-
-`^[1-9]` 连起来表示以数字 `1-9` 作为开头
-
-`\d` 表示一位 `0-9` 的数字
-
-`*` 表示前一位字符出现 0 次，1 次或多次
-
-`\d*` 表示数字出现 0 次，1 次或多次 
-
-`\.` 表示小数点
-
-`\$` 表示字符串以前一位的字符结束
-
-`^[1-9]\d*\.\d*$` 连起来就求出所有大于 1.0 的正浮点数。
-
-那 0.0 到 1.0 之间的正浮点数，怎么求，干嘛不直接汇总到上面的正则表达式中呢？
-
-这样写不行吗：`^[0-9]\d*\.\d*$`
-
-OK!
-
-那我们立即测试下呗
-
-```python
-In [85]: import re
-
-In [87]: recom = re.compile(r'^[0-9]\d*\.\d*$')
-
-In [88]: recom.match('000.2')
-Out[88]: <re.Match object; span=(0, 5), match='000.2'>
-```
-
-结果显示，正则表达式 `^[0-9]\d*\.\d*$` 竟然匹配到 `000.2 `，认为它是一个正浮点数~~~！！！！
-
-晕！！！！！！
-
-所以知道为啥要先匹配大于 1.0 的浮点数了吧！
-
-如果能写出这个正则表达式，再写另一部分就不困难了！
-
-0.0 到 1.0 间的浮点数：`^0\.\d*[1-9]\d*$`
-
-两个式子连接起来就是最终的结果：
-
-`^[1-9]\d*\.\d*|0\.\d*[1-9]\d*$`
-
-如果还是看不懂，看看下面的正则分布剖析图吧：
-
-<img src="./img/image-20200220225043053.png" width="50%"/>
-
-### 三、Python文件、日期和多线程
-
-Python文件IO操作涉及文件读写操作，获取文件`后缀名`，修改后缀名，获取文件修改时间，`压缩`文件，`加密`文件等操作。
-
-Python日期章节，由表示大日期的`calendar`, `date`模块，逐渐过渡到表示时间刻度更小的模块：`datetime`, `time`模块，按照此逻辑展开。
-
-Python`多线程`希望透过5个小例子，帮助你对多线程模型编程本质有些更清晰的认识。
-
-一共总结最常用的`26`个关于文件和时间处理模块的例子。
-
-#### 116 获取后缀名
-
-```python
-import os
-file_ext = os.path.splitext('./data/py/test.py')
-front,ext = file_ext
-In [5]: front
-Out[5]: './data/py/test'
-
-In [6]: ext
-Out[6]: '.py'
-```
-
-#### 117 文件读操作
-
-```python
-import os
-# 创建文件夹
-
-def mkdir(path):
-    isexists = os.path.exists(path)
-    if not isexists:
-        os.mkdir(path)
-# 读取文件信息
-
-def openfile(filename):
-    f = open(filename)
-    fllist = f.read()
-    f.close()
-    return fllist  # 返回读取内容
-```
-
-#### 118  文件写操作
-
-```python
-# 写入文件信息
-# example1
-# w写入，如果文件存在，则清空内容后写入，不存在则创建
-f = open(r"./data/test.txt", "w", encoding="utf-8")
-print(f.write("测试文件写入"))
-f.close
-
-# example2
-# a写入，文件存在，则在文件内容后追加写入，不存在则创建
-f = open(r"./data/test.txt", "a", encoding="utf-8")
-print(f.write("测试文件写入"))
-f.close
-
-# example3
-# with关键字系统会自动关闭文件和处理异常
-with open(r"./data/test.txt", "w") as f:
-    f.write("hello world!")
-```
-
-#### 119 路径中的文件名
-
-```python
-In [11]: import os
-    ...: file_ext = os.path.split('./data/py/test.py')
-    ...: ipath,ifile = file_ext
-    ...:
-
-In [12]: ipath
-Out[12]: './data/py'
-
-In [13]: ifile
-Out[13]: 'test.py'
-```
-
-#### 120 批量修改文件后缀
-
-**批量修改文件后缀**
-
-本例子使用Python的`os`模块和 `argparse`模块，将工作目录`work_dir`下所有后缀名为`old_ext`的文件修改为后缀名为`new_ext`
-
-通过本例子，大家将会大概清楚`argparse`模块的主要用法。
-
-导入模块
-
-```python
-import argparse
-import os
-```
-
-定义脚本参数
-
-```python
-def get_parser():
-    parser = argparse.ArgumentParser(
-        description='工作目录中文件后缀名修改')
-    parser.add_argument('work_dir', metavar='WORK_DIR', type=str, nargs=1,
-                        help='修改后缀名的文件目录')
-    parser.add_argument('old_ext', metavar='OLD_EXT',
-                        type=str, nargs=1, help='原来的后缀')
-    parser.add_argument('new_ext', metavar='NEW_EXT',
-                        type=str, nargs=1, help='新的后缀')
-    return parser
-```
-
-后缀名批量修改
-
-```python
-def batch_rename(work_dir, old_ext, new_ext):
-    """
-    传递当前目录，原来后缀名，新的后缀名后，批量重命名后缀
-    """
-    for filename in os.listdir(work_dir):
-        # 获取得到文件后缀
-        split_file = os.path.splitext(filename)
-        file_ext = split_file[1]
-        # 定位后缀名为old_ext 的文件
-        if old_ext == file_ext:
-            # 修改后文件的完整名称
-            newfile = split_file[0] + new_ext
-            # 实现重命名操作
-            os.rename(
-                os.path.join(work_dir, filename),
-                os.path.join(work_dir, newfile)
-            )
-    print("完成重命名")
-    print(os.listdir(work_dir))
-```
-
-实现Main
-
-```python
-def main():
-    """
-    main函数
-    """
-    # 命令行参数
-    parser = get_parser()
-    args = vars(parser.parse_args())
-    # 从命令行参数中依次解析出参数
-    work_dir = args['work_dir'][0]
-    old_ext = args['old_ext'][0]
-    if old_ext[0] != '.':
-        old_ext = '.' + old_ext
-    new_ext = args['new_ext'][0]
-    if new_ext[0] != '.':
-        new_ext = '.' + new_ext
-
-    batch_rename(work_dir, old_ext, new_ext)
-```
-
-
-
-#### 121 xls批量转换成xlsx
-
-```python
-import os
-
-
-def xls_to_xlsx(work_dir):
-    """
-    传递当前目录，原来后缀名，新的后缀名后，批量重命名后缀
-    """
-    old_ext, new_ext = '.xls', '.xlsx'
-    for filename in os.listdir(work_dir):
-        # 获取得到文件后缀
-        split_file = os.path.splitext(filename)
-        file_ext = split_file[1]
-        # 定位后缀名为old_ext 的文件
-        if old_ext == file_ext:
-            # 修改后文件的完整名称
-            newfile = split_file[0] + new_ext
-            # 实现重命名操作
-            os.rename(
-                os.path.join(work_dir, filename),
-                os.path.join(work_dir, newfile)
-            )
-    print("完成重命名")
-    print(os.listdir(work_dir))
-
-
-xls_to_xlsx('./data')
-
-# 输出结果：
-# ['cut_words.csv', 'email_list.xlsx', 'email_test.docx', 'email_test.jpg', 'email_test.xlsx', 'geo_data.png', 'geo_data.xlsx',
-'iotest.txt', 'pyside2.md', 'PySimpleGUI-4.7.1-py3-none-any.whl', 'test.txt', 'test_excel.xlsx', 'ziptest', 'ziptest.zip']
-```
-
-
-
-#### 122 定制文件不同行
-
-
-比较两个文件在哪些行内容不同，返回这些行的编号，行号编号从1开始。
-
-定义统计文件行数的函数
-
-```python
-# 统计文件个数
-    def statLineCnt(statfile):
-        print('文件名：'+statfile)
-        cnt = 0
-        with open(statfile, encoding='utf-8') as f:
-            while f.readline():
-                cnt += 1
-            return cnt
-```
-
-
-
-统计文件不同之处的子函数：
-
-```python
-# more表示含有更多行数的文件
-        def diff(more, cnt, less):
-            difflist = []
-            with open(less, encoding='utf-8') as l:
-                with open(more, encoding='utf-8') as m:
-                    lines = l.readlines()
-                    for i, line in enumerate(lines):
-                        if line.strip() != m.readline().strip():
-                            difflist.append(i)
-            if cnt - i > 1:
-                difflist.extend(range(i + 1, cnt))
-            return [no+1 for no in difflist]
-```
-
-
-
-主函数：
-
-```python
-# 返回的结果行号从1开始
-# list表示fileA和fileB不同的行的编号
-
-def file_diff_line_nos(fileA, fileB):
-    try:
-        cntA = statLineCnt(fileA)
-        cntB = statLineCnt(fileB)
-        if cntA > cntB:
-            return diff(fileA, cntA, fileB)
-        return diff(fileB, cntB, fileA)
-
-    except Exception as e:
-        print(e)
-```
-
-比较两个文件A和B，拿相对较短的文件去比较，过滤行后的换行符`\n`和空格。
-
-暂未考虑某个文件最后可能有的多行空行等特殊情况
-
-使用`file_diff_line_nos` 函数：
-
-```python
-if __name__ == '__main__':
-    import os
-    print(os.getcwd())
-
-    '''
-    例子：
-    fileA = "'hello world!!!!''\
-            'nice to meet you'\
-            'yes'\
-            'no1'\
-            'jack'"
-    fileB = "'hello world!!!!''\
-            'nice to meet you'\
-            'yes' "
-    '''
-    diff = file_diff_line_nos('./testdir/a.txt', './testdir/b.txt')
-    print(diff)  # [4, 5]
-```
-
-关于文件比较的，实际上，在Python中有对应模块`difflib` , 提供更多其他格式的文件更详细的比较，大家可参考：
-
-> https://docs.python.org/3/library/difflib.html?highlight=difflib#module-difflib
-
-
-
-#### 123 获取指定后缀名的文件
-
-```python
-import os
-
-def find_file(work_dir,extension='jpg'):
-    lst = []
-    for filename in os.listdir(work_dir):
-        print(filename)
-        splits = os.path.splitext(filename)
-        ext = splits[1] # 拿到扩展名
-        if ext == '.'+extension:
-            lst.append(filename)
-    return lst
-
-r = find_file('.','md') 
-print(r) # 返回所有目录下的md文件
-```
-
-
-#### 124 批量获取文件修改时间
-
-```python
-# 获取目录下文件的修改时间
-import os
-from datetime import datetime
-
-print(f"当前时间：{datetime.now().strftime('%Y-%m-%d %H:%M:%S')}")
-
-def get_modify_time(indir):
-    for root, _, files in os.walk(indir):  # 循环D:\works目录和子目录
-        for file in files:
-            absfile = os.path.join(root, file)
-            modtime = datetime.fromtimestamp(os.path.getmtime(absfile))
-            now = datetime.now()
-            difftime = now-modtime
-            if difftime.days < 20:  # 条件筛选超过指定时间的文件
-                print(f"""{absfile}
-                    修改时间[{modtime.strftime('%Y-%m-%d %H:%M:%S')}]
-                    距今[{difftime.days:3d}天{difftime.seconds//3600:2d}时{difftime.seconds%3600//60:2d}]"""
-                      )  # 打印相关信息
-
-
-get_modify_time('./data')
-```
-
-    打印效果：
-    当前时间：2019-12-22 16:38:53
-    ./data\cut_words.csv
-                        修改时间[2019-12-21 10:34:15]
-                        距今[  1天 6时 4]
-    当前时间：2019-12-22 16:38:53
-    ./data\cut_words.csv
-                        修改时间[2019-12-21 10:34:15]
-                        距今[  1天 6时 4]
-    ./data\email_test.docx
-                        修改时间[2019-12-03 07:46:29]
-                        距今[ 19天 8时52]
-    ./data\email_test.jpg
-                        修改时间[2019-12-03 07:46:29]
-                        距今[ 19天 8时52]
-    ./data\email_test.xlsx
-                        修改时间[2019-12-03 07:46:29]
-                        距今[ 19天 8时52]
-    ./data\iotest.txt
-                        修改时间[2019-12-13 08:23:18]
-                        距今[  9天 8时15]
-    ./data\pyside2.md
-                        修改时间[2019-12-05 08:17:22]
-                        距今[ 17天 8时21]
-    ./data\PySimpleGUI-4.7.1-py3-none-any.whl
-                        修改时间[2019-12-05 00:25:47]
-                        距今[ 17天16时13]
-
-#### 125 批量压缩文件
-
-
-```python
-import zipfile  # 导入zipfile,这个是用来做压缩和解压的Python模块；
-import os
-import time
-
-
-def batch_zip(start_dir):
-    start_dir = start_dir  # 要压缩的文件夹路径
-    file_news = start_dir + '.zip'  # 压缩后文件夹的名字
-
-    z = zipfile.ZipFile(file_news, 'w', zipfile.ZIP_DEFLATED)
-    for dir_path, dir_names, file_names in os.walk(start_dir):
-        # 这一句很重要，不replace的话，就从根目录开始复制
-        f_path = dir_path.replace(start_dir, '')
-        f_path = f_path and f_path + os.sep  # 实现当前文件夹以及包含的所有文件的压缩
-        for filename in file_names:
-            z.write(os.path.join(dir_path, filename), f_path + filename)
-    z.close()
-    return file_news
-
-
-batch_zip('./data/ziptest')
-
-
-```
-
-#### 126 32位加密
-
-```python
-import hashlib
-# 对字符串s实现32位加密
-
-
-def hash_cry32(s):
-    m = hashlib.md5()
-    m.update((str(s).encode('utf-8')))
-    return m.hexdigest()
-
-
-print(hash_cry32(1))  # c4ca4238a0b923820dcc509a6f75849b
-print(hash_cry32('hello'))  # 5d41402abc4b2a76b9719d911017c592
-```
-
-#### 127 年的日历图
-
-```python
-import calendar
-from datetime import date
-mydate = date.today()
-year_calendar_str = calendar.calendar(2019)
-print(f"{mydate.year}年的日历图：{year_calendar_str}\n")
-```
-
-打印结果：
-
-```python
-2019
-
-      January                   February                   March
-Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su
-    1  2  3  4  5  6                   1  2  3                   1  2  3
- 7  8  9 10 11 12 13       4  5  6  7  8  9 10       4  5  6  7  8  9 10
-14 15 16 17 18 19 20      11 12 13 14 15 16 17      11 12 13 14 15 16 17
-21 22 23 24 25 26 27      18 19 20 21 22 23 24      18 19 20 21 22 23 24
-28 29 30 31               25 26 27 28               25 26 27 28 29 30 31
-
-       April                      May                       June
-Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su
- 1  2  3  4  5  6  7             1  2  3  4  5                      1  2
- 8  9 10 11 12 13 14       6  7  8  9 10 11 12       3  4  5  6  7  8  9
-15 16 17 18 19 20 21      13 14 15 16 17 18 19      10 11 12 13 14 15 16
-22 23 24 25 26 27 28      20 21 22 23 24 25 26      17 18 19 20 21 22 23
-29 30                     27 28 29 30 31            24 25 26 27 28 29 30
-
-        July                     August                  September
-Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su
- 1  2  3  4  5  6  7                1  2  3  4                         1
- 8  9 10 11 12 13 14       5  6  7  8  9 10 11       2  3  4  5  6  7  8
-15 16 17 18 19 20 21      12 13 14 15 16 17 18       9 10 11 12 13 14 15
-22 23 24 25 26 27 28      19 20 21 22 23 24 25      16 17 18 19 20 21 22
-29 30 31                  26 27 28 29 30 31         23 24 25 26 27 28 29
-                                                    30
-
-      October                   November                  December
-Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su
-    1  2  3  4  5  6                   1  2  3                         1
- 7  8  9 10 11 12 13       4  5  6  7  8  9 10       2  3  4  5  6  7  8
-14 15 16 17 18 19 20      11 12 13 14 15 16 17       9 10 11 12 13 14 15
-21 22 23 24 25 26 27      18 19 20 21 22 23 24      16 17 18 19 20 21 22
-28 29 30 31               25 26 27 28 29 30         23 24 25 26 27 28 29
-                                                    30 31
-```
-
-#### 128 判断是否为闰年
-
-```python
-import calendar
-from datetime import date
-
-mydate = date.today()
-is_leap = calendar.isleap(mydate.year)
-print_leap_str = "%s年是闰年" if is_leap else "%s年不是闰年\n"
-print(print_leap_str % mydate.year)
-```
-
-打印结果：
-
-```python
-2019年不是闰年
-```
-
-#### 129 月的日历图
-
-```python
-import calendar
-from datetime import date
-
-mydate = date.today()
-month_calendar_str = calendar.month(mydate.year, mydate.month)
-
-print(f"{mydate.year}年-{mydate.month}月的日历图：{month_calendar_str}\n")
-```
-
-打印结果：
-
-```python
-December 2019
-Mo Tu We Th Fr Sa Su
-                   1
- 2  3  4  5  6  7  8
- 9 10 11 12 13 14 15
-16 17 18 19 20 21 22
-23 24 25 26 27 28 29
-30 31
-```
-
-#### 130 月有几天
-
-```python
-import calendar
-from datetime import date
-
-mydate = date.today()
-weekday, days = calendar.monthrange(mydate.year, mydate.month)
-print(f'{mydate.year}年-{mydate.month}月的第一天是那一周的第{weekday}天\n')
-print(f'{mydate.year}年-{mydate.month}月共有{days}天\n')
-```
-
-打印结果：
-
-```python
-2019年-12月的第一天是那一周的第6天
-
-2019年-12月共有31天
-```
-
-
-
-#### 131 月第一天
-
-```python
-from datetime import date
-mydate = date.today()
-month_first_day = date(mydate.year, mydate.month, 1)
-print(f"当月第一天:{month_first_day}\n")
-```
-
-打印结果：
-
-```python
-# 当月第一天:2019-12-01
-```
-
-
-
-#### 131 月最后一天
-
-```python
-from datetime import date
-import calendar
-mydate = date.today()
-_, days = calendar.monthrange(mydate.year, mydate.month)
-month_last_day = date(mydate.year, mydate.month, days)
-print(f"当月最后一天:{month_last_day}\n")
-```
-
-打印结果：
-
-```python
-当月最后一天:2019-12-31
-```
-
-
-
-#### 132 获取当前时间
-
-```python
-from datetime import date, datetime
-from time import localtime
-
-today_date = date.today()
-print(today_date)  # 2019-12-22
-
-today_time = datetime.today()
-print(today_time)  # 2019-12-22 18:02:33.398894
-
-local_time = localtime()
-print(strftime("%Y-%m-%d %H:%M:%S", local_time))  # 转化为定制的格式 2019-12-22 18:13:41
-```
-
-
-
-#### 133 字符时间转时间
-
-```python
-from time import strptime
-
-# parse str time to struct time
-struct_time = strptime('2019-12-22 10:10:08', "%Y-%m-%d %H:%M:%S")
-print(struct_time) # struct_time类型就是time中的一个类
-
-# time.struct_time(tm_year=2019, tm_mon=12, tm_mday=22, tm_hour=10, tm_min=10, tm_sec=8, tm_wday=6, tm_yday=356, tm_isdst=-1)
-```
-
-
-
-#### 134 时间转字符时间
-
-```python
-from time import strftime, strptime, localtime
-
-In [2]: print(localtime()) #这是输入的时间
-Out[2]: time.struct_time(tm_year=2019, tm_mon=12, tm_mday=22, tm_hour=18, tm_min=24, tm_sec=56, tm_wday=6, tm_yday=356, tm_isdst=0)
-
-print(strftime("%m-%d-%Y %H:%M:%S", localtime()))  # 转化为定制的格式
-# 这是字符串表示的时间：   12-22-2019 18:26:21
-```
-
-
-
-#### 135 默认启动主线程
-
-一般的，程序默认执行只在一个线程，这个线程称为主线程，例子演示如下：
-
-导入线程相关的模块 `threading`:
-
-```python
-import threading
-```
-
-threading的类方法 `current_thread()`返回当前线程：
-
-```python
-t = threading.current_thread()
-print(t) # <_MainThread(MainThread, started 139908235814720)>
-```
-
-所以，验证了程序默认是在`MainThead`中执行。
-
-`t.getName()`获得这个线程的名字，其他常用方法，`getName()`获得线程`id`,`isAlive()`判断线程是否存活等。
-
-```python
-print(t.getName()) # MainThread
-print(t.ident) # 139908235814720
-print(t.isAlive()) # True
-```
-
-以上这些仅是介绍多线程的`背景知识`，因为到目前为止，我们有且仅有一个"干活"的主线程
-
-#### 136 创建线程
-
-创建一个线程：
-
-```python
-my_thread = threading.Thread()
-```
-
-创建一个名称为`my_thread`的线程：
-
-```python
-my_thread = threading.Thread(name='my_thread')
-```
-
-创建线程的目的是告诉它帮助我们做些什么，做些什么通过参数`target`传入，参数类型为`callable`，函数就是可调用的：
-
-```python
-def print_i(i):
-    print('打印i:%d'%(i,))
-my_thread = threading.Thread(target=print_i,args=(1,))
-```
-
-`my_thread`线程已经全副武装，但是我们得按下发射按钮，启动start()，它才开始真正起飞。
-
-```python
-my_thread().start()
-```
-
-打印结果如下，其中`args`指定函数`print_i`需要的参数i，类型为元祖。
-
-```python
-打印i:1
-```
-
-至此，多线程相关的核心知识点，已经总结完毕。但是，仅仅知道这些，还不够！光纸上谈兵，当然远远不够。
-
-接下来，聊聊应用多线程编程，最本质的一些东西。
-
-**3 交替获得CPU时间片**
-
-为了更好解释，假定计算机是单核的，尽管对于`cpython`，这个假定有些多余。
-
-开辟3个线程，装到`threads`中:
-
-```python
-import time
-from datetime import datetime
-import threading
-
-
-def print_time():
-    for _ in range(5): # 在每个线程中打印5次
-        time.sleep(0.1) # 模拟打印前的相关处理逻辑
-        print('当前线程%s,打印结束时间为:%s'%(threading.current_thread().getName(),datetime.today()))
-
-
-threads = [threading.Thread(name='t%d'%(i,),target=print_time) for i in range(3)]
-```
-
-启动3个线程：
-
-```python
-[t.start() for t in threads]
-```
-
-打印结果如下，`t0`,`t1`,`t2`三个线程，根据操作系统的调度算法，轮询获得CPU时间片，注意观察，`t2`线程可能被连续调度，从而获得时间片。
-
-```python
-当前线程t0,打印结束时间为:2020-01-12 02:27:15.705235
-当前线程t1,打印结束时间为:2020-01-12 02:27:15.705402
-当前线程t2,打印结束时间为:2020-01-12 02:27:15.705687
-当前线程t0,打印结束时间为:2020-01-12 02:27:15.805767
-当前线程t1,打印结束时间为:2020-01-12 02:27:15.805886
-当前线程t2,打印结束时间为:2020-01-12 02:27:15.806044
-当前线程t0,打印结束时间为:2020-01-12 02:27:15.906200
-当前线程t2,打印结束时间为:2020-01-12 02:27:15.906320
-当前线程t1,打印结束时间为:2020-01-12 02:27:15.906433
-当前线程t0,打印结束时间为:2020-01-12 02:27:16.006581
-当前线程t1,打印结束时间为:2020-01-12 02:27:16.006766
-当前线程t2,打印结束时间为:2020-01-12 02:27:16.007006
-当前线程t2,打印结束时间为:2020-01-12 02:27:16.107564
-当前线程t0,打印结束时间为:2020-01-12 02:27:16.107290
-当前线程t1,打印结束时间为:2020-01-12 02:27:16.107741
-```
-
-#### 137 多线程抢夺同一个变量
-
-多线程编程，存在抢夺同一个变量的问题。
-
-比如下面例子，创建的10个线程同时竞争全局变量`a`:
-​
-
-```python
-import threading
-
-
-a = 0
-def add1():
-    global a    
-    a += 1
-    print('%s  adds a to 1: %d'%(threading.current_thread().getName(),a))
-    
-threads = [threading.Thread(name='t%d'%(i,),target=add1) for i in range(10)]
-[t.start() for t in threads]
-```
-
-执行结果：
-
-```python
-t0  adds a to 1: 1
-t1  adds a to 1: 2
-t2  adds a to 1: 3
-t3  adds a to 1: 4
-t4  adds a to 1: 5
-t5  adds a to 1: 6
-t6  adds a to 1: 7
-t7  adds a to 1: 8
-t8  adds a to 1: 9
-t9  adds a to 1: 10
-```
-
-结果一切正常，每个线程执行一次，把`a`的值加1，最后`a` 变为10，一切正常。
-
-运行上面代码十几遍，一切也都正常。
-
-所以，我们能下结论：这段代码是线程安全的吗？
-
-NO！
-
-多线程中，只要存在同时读取和修改一个全局变量的情况，如果不采取其他措施，就一定不是线程安全的。
-
-尽管，有时，某些情况的资源竞争，暴露出问题的概率`极低极低`：
-
-本例中，如果线程0 在修改a后，其他某些线程还是get到的是没有修改前的值，就会暴露问题。
-
-
-
-但是在本例中，`a = a + 1`这种修改操作，花费的时间太短了，短到我们无法想象。所以，线程间轮询执行时，都能get到最新的a值。所以，暴露问题的概率就变得微乎其微。
-
-#### 138 代码稍作改动，叫问题暴露出来
-
-只要弄明白问题暴露的原因，叫问题出现还是不困难的。
-
-想象数据库的写入操作，一般需要耗费我们可以感知的时间。
-
-为了模拟这个写入动作，简化期间，我们只需要延长修改变量`a`的时间，问题很容易就会还原出来。
-
-```python
-import threading
-import time
-
-
-a = 0
-def add1():
-    global a    
-    tmp = a + 1
-    time.sleep(0.2) # 延时0.2秒，模拟写入所需时间
-    a = tmp
-    print('%s  adds a to 1: %d'%(threading.current_thread().getName(),a))
-    
-threads = [threading.Thread(name='t%d'%(i,),target=add1) for i in range(10)]
-[t.start() for t in threads]
-```
-
-重新运行代码，只需一次，问题立马完全暴露，结果如下：
-
-```python
-t0  adds a to 1: 1
-t1  adds a to 1: 1
-t2  adds a to 1: 1
-t3  adds a to 1: 1
-t4  adds a to 1: 1
-t5  adds a to 1: 1
-t7  adds a to 1: 1
-t6  adds a to 1: 1
-t8  adds a to 1: 1
-t9  adds a to 1: 1
-```
-
-看到，10个线程全部运行后，`a`的值只相当于一个线程执行的结果。
-
-下面分析，为什么会出现上面的结果：
-
-这是一个很有说服力的例子，因为在修改a前，有0.2秒的休眠时间，某个线程延时后，CPU立即分配计算资源给其他线程。直到分配给所有线程后，根据结果反映出，0.2秒的休眠时长还没耗尽，这样每个线程get到的a值都是0，所以才出现上面的结果。
-
-
-
-以上最核心的三行代码：
-
-```python
-tmp = a + 1
-time.sleep(0.2) # 延时0.2秒，模拟写入所需时间
-a = tmp
-```
-
-#### 139 加上一把锁，避免以上情况出现
-
-知道问题出现的原因后，要想修复问题，也没那么复杂。
-
-通过python中提供的锁机制，某段代码只能单线程执行时，上锁，其他线程等待，直到释放锁后，其他线程再争锁，执行代码，释放锁，重复以上。
-
-创建一把锁`locka`:
-
-```python
-import threading
-import time
-
-
-locka = threading.Lock()
-```
-
-通过 `locka.acquire()` 获得锁，通过`locka.release()`释放锁，它们之间的这些代码，只能单线程执行。
-
-```python
-a = 0
-def add1():
-    global a    
-    try:
-        locka.acquire() # 获得锁
-        tmp = a + 1
-        time.sleep(0.2) # 延时0.2秒，模拟写入所需时间
-        a = tmp
-    finally:
-        locka.release() # 释放锁
-    print('%s  adds a to 1: %d'%(threading.current_thread().getName(),a))
-    
-threads = [threading.Thread(name='t%d'%(i,),target=add1) for i in range(10)]
-[t.start() for t in threads]
-```
-
-执行结果如下：
-
-```python
-t0  adds a to 1: 1
-t1  adds a to 1: 2
-t2  adds a to 1: 3
-t3  adds a to 1: 4
-t4  adds a to 1: 5
-t5  adds a to 1: 6
-t6  adds a to 1: 7
-t7  adds a to 1: 8
-t8  adds a to 1: 9
-t9  adds a to 1: 10
-```
-
-一起正常，其实这已经是单线程顺序执行了，就本例子而言，已经失去多线程的价值，并且还带来了因为线程创建开销，浪费时间的副作用。
-
-程序中只有一把锁，通过 `try...finally`还能确保不发生死锁。但是，当程序中启用多把锁，还是很容易发生死锁。
-
-注意使用场合，避免死锁，是我们在使用多线程开发时需要注意的一些问题。
-
-#### 140 1 分钟掌握 time 模块
-
-time 模块提供时间相关的类和函数
-
-记住一个类：`struct_time`，9 个整数组成的元组
-
-记住下面 5 个最常用函数
-
-首先导入`time`模块
-
-```python
-import time
-```
-
-**1 此时此刻时间浮点数**
-
-```python
-In [58]: seconds = time.time()
-In [60]: seconds
-Out[60]: 1582341559.0950701
-```
-
-**2 时间数组**
-
-```python
-In [61]: local_time = time.localtime(seconds)
-
-In [62]: local_time
-Out[62]: time.struct_time(tm_year=2020, tm_mon=2, tm_mday=22, tm_hour=11, tm_min=19, tm_sec=19, tm_wday=5, tm_yday=53, tm_isdst=0)
-```
-
-**3 时间字符串**
-
-`time.asctime` 语义： `as convert time`
-
-```python
-In [63]: str_time = time.asctime(local_time)
-
-In [64]: str_time
-Out[64]: 'Sat Feb 22 11:19:19 2020'
-```
-
-**4 格式化时间字符串**
-
-`time.strftime` 语义： `string format time`
-
-```python
-In [65]: format_time = time.strftime('%Y-%m-%d %H:%M:%S',local_time)
-
-In [66]: format_time
-Out[66]: '2020-02-22 11:19:19'
-```
-
-**5 字符时间转时间数组**
-
-```python
-In [68]: str_to_struct = time.strptime(format_time,'%Y-%m-%d %H:%M:%S')
-
-In [69]: str_to_struct
-Out[69]: time.struct_time(tm_year=2020, tm_mon=2, tm_mday=22, tm_hour=11, tm_min=19, tm_sec=19, tm_wday=5, tm_yday=53, tm_isdst=-1)
-```
-
-最后再记住常用字符串格式
-
-**常用字符串格式**
-
-%m：月
-
-%M: 分钟
-
-```markdown
-    %Y  Year with century as a decimal number.
-    %m  Month as a decimal number [01,12].
-    %d  Day of the month as a decimal number [01,31].
-    %H  Hour (24-hour clock) as a decimal number [00,23].
-    %M  Minute as a decimal number [00,59].
-    %S  Second as a decimal number [00,61].
-    %z  Time zone offset from UTC.
-    %a  Locale's abbreviated weekday name.
-    %A  Locale's full weekday name.
-    %b  Locale's abbreviated month name.
-```
-
-#### 141 4G 内存处理 10G 大小的文件
-
-4G 内存处理 10G 大小的文件，单机怎么做？
-
-下面的讨论基于的假定：可以单独处理一行数据，行间数据相关性为零。
-
-方法一：
-
-仅使用 Python 内置模板，逐行读取到内存。
-
-使用 yield，好处是解耦读取操作和处理操作：
-
-```python
-def python_read(filename):
-    with open(filename,'r',encoding='utf-8') as f:
-        while True:
-            line = f.readline()
-            if not line:
-                return
-            yield line
-```
-
-以上每次读取一行，逐行迭代，逐行处理数据
-
-```python
-if __name__ == '__main__':
-    g = python_read('./data/movies.dat')
-    for c in g:
-        print(c)
-        # process c
-```
-
-方法二：
-
-方法一有缺点，逐行读入，频繁的 IO 操作拖累处理效率。是否有一次 IO ，读取多行的方法？
-
-`pandas` 包 `read_csv` 函数，参数有 38 个之多，功能非常强大。
-
-关于单机处理大文件，`read_csv` 的 `chunksize` 参数能做到，设置为 `5`， 意味着一次读取 5 行。
-
-```python
-def pandas_read(filename,sep=',',chunksize=5):
-    reader = pd.read_csv(filename,sep,chunksize=chunksize)
-    while True:
-        try:
-            yield reader.get_chunk()
-        except StopIteration:
-            print('---Done---')
-            break
-```
-
-使用如同方法一：
-```python
-if __name__ == '__main__':
-    g = pandas_read('./data/movies.dat',sep="::")
-    for c in g:
-        print(c)
-        # process c
-```
-
-以上就是单机处理大文件的两个方法，推荐使用方法二，更加灵活。除了工作中会用到，面试中也有时被问到。
-
-### 四、Python三大利器
-
-Python中的三大利器包括：`迭代器`，`生成器`，`装饰器`，利用好它们才能开发出最高性能的Python程序，涉及到的内置模块 `itertools`提供迭代器相关的操作。此部分收录有意思的例子共计`15`例。
-
-
-#### 142 寻找第n次出现位置
-
-```python
-def search_n(s, c, n):
-    size = 0
-    for i, x in enumerate(s):
-        if x == c:
-            size += 1
-        if size == n:
-            return i
-    return -1
-
-
-
-print(search_n("fdasadfadf", "a", 3))# 结果为7，正确
-print(search_n("fdasadfadf", "a", 30))# 结果为-1，正确
-```
-
-
-#### 143 斐波那契数列前n项
-
-```python
-def fibonacci(n):
-    a, b = 1, 1
-    for _ in range(n):
-        yield a
-        a, b = b, a + b
-
-
-list(fibonacci(5))  # [1, 1, 2, 3, 5]
-```
-
-#### 144 找出所有重复元素
-
-```python
-from collections import Counter
-
-
-def find_all_duplicates(lst):
-    c = Counter(lst)
-    return list(filter(lambda k: c[k] > 1, c))
-
-
-find_all_duplicates([1, 2, 2, 3, 3, 3])  # [2,3]
-```
-
-#### 145 联合统计次数
-Counter对象间可以做数学运算
-
-```python
-from collections import Counter
-a = ['apple', 'orange', 'computer', 'orange']
-b = ['computer', 'orange']
-
-ca = Counter(a)
-cb = Counter(b)
-#Counter对象间可以做数学运算
-ca + cb  # Counter({'orange': 3, 'computer': 2, 'apple': 1})
-
-
-# 进一步抽象，实现多个列表内元素的个数统计
-
-
-def sumc(*c):
-    if (len(c) < 1):
-        return
-    mapc = map(Counter, c)
-    s = Counter([])
-    for ic in mapc: # ic 是一个Counter对象
-        s += ic
-    return s
-
-
-#Counter({'orange': 3, 'computer': 3, 'apple': 1, 'abc': 1, 'face': 1})
-sumc(a, b, ['abc'], ['face', 'computer'])
-
-```
-
-#### 146 groupby单字段分组
-
-天气记录：
-
-```python
-a = [{'date': '2019-12-15', 'weather': 'cloud'},
- {'date': '2019-12-13', 'weather': 'sunny'},
- {'date': '2019-12-14', 'weather': 'cloud'}]
-```
-
-按照天气字段`weather`分组汇总：
-
-```python
-from itertools import groupby
-for k, items in  groupby(a,key=lambda x:x['weather']):
-     print(k)
-```
-
-输出结果看出，分组失败！原因：分组前必须按照分组字段`排序`，这个很坑~
-
-```python
-cloud
-sunny
-cloud
-```
-
-修改代码：
-
-```python
-a.sort(key=lambda x: x['weather'])
-for k, items in  groupby(a,key=lambda x:x['weather']):
-     print(k)
-     for i in items:
-         print(i)
-```
-
-输出结果：
-
-```python
-cloud
-{'date': '2019-12-15', 'weather': 'cloud'}
-{'date': '2019-12-14', 'weather': 'cloud'}
-sunny
-{'date': '2019-12-13', 'weather': 'sunny'}
-```
-
-#### 147 itemgetter和key函数
-
-注意到`sort`和`groupby`所用的`key`函数，除了`lambda`写法外，还有一种简写，就是使用`itemgetter`：
-
-```python
-a = [{'date': '2019-12-15', 'weather': 'cloud'},
- {'date': '2019-12-13', 'weather': 'sunny'},
- {'date': '2019-12-14', 'weather': 'cloud'}]
-from operator import itemgetter
-from itertools import groupby
-
-a.sort(key=itemgetter('weather'))
-for k, items in groupby(a, key=itemgetter('weather')):
-     print(k)
-     for i in items:
-         print(i)
-```
-
-结果：
-
-```python
-cloud
-{'date': '2019-12-15', 'weather': 'cloud'}
-{'date': '2019-12-14', 'weather': 'cloud'}
-sunny
-{'date': '2019-12-13', 'weather': 'sunny'}
-```
-
-#### 148 groupby多字段分组
-
-`itemgetter`是一个类，`itemgetter('weather')`返回一个可调用的对象，它的参数可有多个：
-
-```python
-from operator import itemgetter
-from itertools import groupby
-
-a.sort(key=itemgetter('weather', 'date'))
-for k, items in groupby(a, key=itemgetter('weather')):
-     print(k)
-     for i in items:
-         print(i)
-```
-
-结果如下，使用`weather`和`date`两个字段排序`a`，
-
-```python
-cloud
-{'date': '2019-12-14', 'weather': 'cloud'}
-{'date': '2019-12-15', 'weather': 'cloud'}
-sunny
-{'date': '2019-12-13', 'weather': 'sunny'}
-```
-
-注意这个结果与上面结果有些微妙不同，这个更多是我们想看到和使用更多的。
-
-#### 149 sum函数计算和聚合同时做
-
-Python中的聚合类函数`sum`,`min`,`max`第一个参数是`iterable`类型，一般使用方法如下：
-
-```python
-a = [4,2,5,1]
-sum([i+1 for i in a]) # 16
-```
-
-使用列表生成式`[i+1 for i in a]`创建一个长度与`a`一行的临时列表，这步完成后，再做`sum`聚合。
-
-试想如果你的数组`a`长度十百万级，再创建一个这样的临时列表就很不划算，最好是一边算一边聚合，稍改动为如下：
-
-```python
-a = [4,2,5,1]
-sum(i+1 for i in a) # 16
-```
-
-此时`i+1 for i in a`是`(i+1 for i in a)`的简写，得到一个生成器(`generator`)对象，如下所示：
-
-```python
-In [8]:(i+1 for i in a)
-OUT [8]:<generator object <genexpr> at 0x000002AC7FFA8CF0>
-```
-
-生成器每迭代一步吐出(`yield`)一个元素并计算和聚合后，进入下一次迭代，直到终点。
-
-#### 150 list分组(生成器版)
-
-```python
-from math import ceil
-
-def divide_iter(lst, n):
-    if n <= 0:
-        yield lst
-        return
-    i, div = 0, ceil(len(lst) / n)
-    while i < n:
-        yield lst[i * div: (i + 1) * div]
-        i += 1
-
-list(divide_iter([1, 2, 3, 4, 5], 0))  # [[1, 2, 3, 4, 5]]
-list(divide_iter([1, 2, 3, 4, 5], 2))  # [[1, 2, 3], [4, 5]]
-```
-
-#### 151 列表全展开（生成器版）
-```python
-#多层列表展开成单层列表
-a=[1,2,[3,4,[5,6],7],8,["python",6],9]
-def function(lst):
-    for i in lst:
-        if type(i)==list:
-            yield from function(i)
-        else:
-            yield i
-print(list(function(a))) # [1, 2, 3, 4, 5, 6, 7, 8, 'python', 6, 9]
-```
-
-#### 152 测试函数运行时间的装饰器
-```python
-#测试函数执行时间的装饰器示例
-import time
-def timing_func(fn):
-    def wrapper():
-        start=time.time()
-        fn()   #执行传入的fn参数
-        stop=time.time()
-        return (stop-start)
-    return wrapper
-@timing_func
-def test_list_append():
-    lst=[]
-    for i in range(0,100000):
-        lst.append(i)  
-@timing_func
-def test_list_compre():
-    [i for i in range(0,100000)]  #列表生成式
-a=test_list_append()
-c=test_list_compre()
-print("test list append time:",a)
-print("test list comprehension time:",c)
-print("append/compre:",round(a/c,3))
-
-test list append time: 0.0219423770904541
-test list comprehension time: 0.007980823516845703
-append/compre: 2.749
-```
-
-#### 153 统计异常出现次数和时间的装饰器
-
-
-写一个装饰器，统计某个异常重复出现指定次数时，经历的时长。
-```python
-import time
-import math
-
-
-def excepter(f):
-    i = 0
-    t1 = time.time()
-    def wrapper(): 
-        try:
-            f()
-        except Exception as e:
-            nonlocal i
-            i += 1
-            print(f'{e.args[0]}: {i}')
-            t2 = time.time()
-            if i == n:
-                print(f'spending time:{round(t2-t1,2)}')
-    return wrapper
-
-```
-
-关键词`nonlocal`常用于函数嵌套中，声明变量i为非局部变量；
-
-如果不声明，`i+=1`表明`i`为函数`wrapper`内的局部变量，因为在`i+=1`引用(reference)时,`i`未被声明，所以会报`unreferenced variable`的错误。
-
-使用创建的装饰函数`excepter`, `n`是异常出现的次数。
-
-共测试了两类常见的异常：`被零除`和`数组越界`。
-
-```python
-n = 10 # except count
-
-@excepter
-def divide_zero_except():
-    time.sleep(0.1)
-    j = 1/(40-20*2)
-
-# test zero divived except
-for _ in range(n):
-    divide_zero_except()
-
-
-@excepter
-def outof_range_except():
-    a = [1,3,5]
-    time.sleep(0.1)
-    print(a[3])
-# test out of range except
-for _ in range(n):
-    outof_range_except()
-
-```
-
-打印出来的结果如下：
-```python
-division by zero: 1
-division by zero: 2
-division by zero: 3
-division by zero: 4
-division by zero: 5
-division by zero: 6
-division by zero: 7
-division by zero: 8
-division by zero: 9
-division by zero: 10
-spending time:1.01
-list index out of range: 1
-list index out of range: 2
-list index out of range: 3
-list index out of range: 4
-list index out of range: 5
-list index out of range: 6
-list index out of range: 7
-list index out of range: 8
-list index out of range: 9
-list index out of range: 10
-spending time:1.01
-```
-
-
-#### 154 测试运行时长的装饰器
-
-
-```python
-#测试函数执行时间的装饰器示例
-import time
-def timing(fn):
-    def wrapper():
-        start=time.time()
-        fn()   #执行传入的fn参数
-        stop=time.time()
-        return (stop-start)
-    return wrapper
-
-@timing
-def test_list_append():
-    lst=[]
-    for i in range(0,100000):
-        lst.append(i)  
-
-@timing
-def test_list_compre():
-    [i for i in range(0,100000)]  #列表生成式
-
-a=test_list_append()
-c=test_list_compre()
-print("test list append time:",a)
-print("test list comprehension time:",c)
-print("append/compre:",round(a/c,3))
-
-# test list append time: 0.0219
-# test list comprehension time: 0.00798
-# append/compre: 2.749
-```
-
-#### 155 装饰器通俗理解
-
-再看一个装饰器：
-
-```python
-def call_print(f):
-  def g():
-    print('you\'re calling %s function'%(f.__name__,))
-  return g
-```
-
-使用`call_print`装饰器：
-
-```python
-@call_print
-def myfun():
-  pass
- 
-@call_print
-def myfun2():
-  pass
-```
-
-myfun()后返回：
-
-```python
-In [27]: myfun()
-you're calling myfun function
-
-In [28]: myfun2()
-you're calling myfun2 function
-```
-
-**使用call_print**
-
-你看，`@call_print`放置在任何一个新定义的函数上面，都会默认输出一行，你正在调用这个函数的名。
-
-这是为什么呢？注意观察新定义的`call_print`函数(加上@后便是装饰器):
-
-```python
-def call_print(f):
-  def g():
-    print('you\'re calling %s function'%(f.__name__,))
-  return g
-```
-
-它必须接受一个函数`f`，然后返回另外一个函数`g`.
-
-**装饰器本质**
-
-本质上，它与下面的调用方式效果是等效的：
-
-```
-def myfun():
-  pass
-
-def myfun2():
-  pass
-  
-def call_print(f):
-  def g():
-    print('you\'re calling %s function'%(f.__name__,))
-  return g
-```
-
-下面是最重要的代码：
-
-```
-myfun = call_print(myfun)
-myfun2 = call_print(myfun2)
-```
-
-大家看明白吗？也就是call_print(myfun)后不是返回一个函数吗，然后再赋值给myfun.
-
-再次调用myfun, myfun2时，效果是这样的：
-
-```python
-In [32]: myfun()
-you're calling myfun function
-
-In [33]: myfun2()
-you're calling myfun2 function
-```
-
-你看，这与装饰器的实现效果是一模一样的。装饰器的写法可能更加直观些，所以不用显示的这样赋值：`myfun = call_print(myfun)`，`myfun2 = call_print(myfun2)`，但是装饰器的这种封装，猛一看，有些不好理解。
-
-#### 156 定制递减迭代器
-
-```python
-#编写一个迭代器，通过循环语句，实现对某个正整数的依次递减1，直到0.
-class Descend(Iterator):
-    def __init__(self,N):
-        self.N=N
-        self.a=0
-    def __iter__(self):
-        return self 
-    def __next__(self):
-        while self.a<self.N:
-            self.N-=1
-            return self.N
-        raise StopIteration
-    
-descend_iter=Descend(10)
-print(list(descend_iter))
-[9, 8, 7, 6, 5, 4, 3, 2, 1, 0]
-```
-
-核心要点：
-
-1 `__nex__ `名字不能变，实现定制的迭代逻辑
-
-2 `raise StopIteration`：通过 raise 中断程序，必须这样写
-
-
-
-### 五、Python绘图
-
-Python常用的绘图工具包括：`matplotlib`, `seaborn`, `plotly`等，以及一些其他专用于绘制某类图如词云图等的包，描绘绘图轨迹的`turtle`包等。本章节将会使用一些例子由易到难的阐述绘图的经典小例子，目前共收录`27`个。
-
-#### 157 turtle绘制奥运五环图
-
-turtle绘图的函数非常好用，基本看到函数名字，就能知道它的含义，下面使用turtle，仅用15行代码来绘制奥运五环图。
-
-1 导入库
-
-```python
-import turtle as p
-```
-
-2 定义画圆函数
-
-```python
-def drawCircle(x,y,c='red'):
-    p.pu()# 抬起画笔
-    p.goto(x,y) # 绘制圆的起始位置
-    p.pd()# 放下画笔
-    p.color(c)# 绘制c色圆环
-    p.circle(30,360) #绘制圆：半径，角度
-```
-
-3 画笔基本设置
-
-```python
-p = turtle
-p.pensize(3) # 画笔尺寸设置3
-```
-
-4 绘制五环图
-
-调用画圆函数
-
-```python
-drawCircle(0,0,'blue')
-drawCircle(60,0,'black')
-drawCircle(120,0,'red')
-drawCircle(90,-30,'green')
-drawCircle(30,-30,'yellow')    
-
-p.done()
-```
-
-结果：
-
-![](./img/turtle1.png)
-
-#### 158 turtle绘制漫天雪花
-
-
-导入模块
-
-导入 `turtle`库和 python的 `random`
-
-```python
-import turtle as p
-import random
-```
-
-绘制雪花
-
-```python
-def snow(snow_count):
-    p.hideturtle()
-    p.speed(500)
-    p.pensize(2)
-    for i in range(snow_count):
-        r = random.random()
-        g = random.random()
-        b = random.random()
-        p.pencolor(r, g, b)
-        p.pu()
-        p.goto(random.randint(-350, 350), random.randint(1, 270))
-        p.pd()
-        dens = random.randint(8, 12)
-        snowsize = random.randint(10, 14)
-        for _ in range(dens):
-            p.forward(snowsize)  # 向当前画笔方向移动snowsize像素长度
-            p.backward(snowsize)  # 向当前画笔相反方向移动snowsize像素长度
-            p.right(360 / dens)  # 顺时针移动360 / dens度
-
-```
-
-绘制地面
-
-```python
-def ground(ground_line_count):
-    p.hideturtle()
-    p.speed(500)
-    for i in range(ground_line_count):
-        p.pensize(random.randint(5, 10))
-        x = random.randint(-400, 350)
-        y = random.randint(-280, -1)
-        r = -y / 280
-        g = -y / 280
-        b = -y / 280
-        p.pencolor(r, g, b)
-        p.penup()  # 抬起画笔
-        p.goto(x, y)  # 让画笔移动到此位置
-        p.pendown()  # 放下画笔
-        p.forward(random.randint(40, 100))  # 眼当前画笔方向向前移动40~100距离
-```
-
-主函数
-
-```python
-def main():
-    p.setup(800, 600, 0, 0)
-    # p.tracer(False)
-    p.bgcolor("black")
-    snow(30)
-    ground(30)
-    # p.tracer(True)
-    p.mainloop()
-
-main()
-```
-
-
-动态图结果展示：
-
-<img src="./img/turtlesnow.gif" alt="Sample"  width="50%">
-
-#### 159 wordcloud词云图
-
-
-```python
-import hashlib
-import pandas as pd
-from wordcloud import WordCloud
-geo_data=pd.read_excel(r"../data/geo_data.xlsx")
-print(geo_data)
-# 0     深圳
-# 1     深圳
-# 2     深圳
-# 3     深圳
-# 4     深圳
-# 5     深圳
-# 6     深圳
-# 7     广州
-# 8     广州
-# 9     广州
-
-words = ','.join(x for x in geo_data['city'] if x != []) #筛选出非空列表值
-wc = WordCloud(
-    background_color="green", #背景颜色"green"绿色
-    max_words=100, #显示最大词数
-    font_path='./fonts/simhei.ttf', #显示中文
-    min_font_size=5,
-    max_font_size=100,
-    width=500  #图幅宽度
-    )
-x = wc.generate(words)
-x.to_file('../data/geo_data.png')
-```
-
-#### 160 plotly画柱状图和折线图
-
-```python
-#柱状图+折线图
-import plotly.graph_objects as go
-fig = go.Figure()
-fig.add_trace(
-    go.Scatter(
-        x=[0, 1, 2, 3, 4, 5],
-        y=[1.5, 1, 1.3, 0.7, 0.8, 0.9]
-    ))
-fig.add_trace(
-    go.Bar(
-        x=[0, 1, 2, 3, 4, 5],
-        y=[2, 0.5, 0.7, -1.2, 0.3, 0.4]
-    ))
-fig.show()
-```
-
-<img src="img/bar1.png" width=50%/>
-
-
-#### 161 seaborn热力图
-
-```python
-# 导入库
-import seaborn as sns
-import pandas as pd
-import numpy as np
-import matplotlib.pyplot as plt
-
-# 生成数据集
-data = np.random.random((6,6))
-np.fill_diagonal(data,np.ones(6))
-features = ["prop1","prop2","prop3","prop4","prop5", "prop6"]
-data = pd.DataFrame(data, index = features, columns=features)
-print(data)
-# 绘制热力图
-heatmap_plot = sns.heatmap(data, center=0, cmap='gist_rainbow')
-plt.show()
-```
-
-<img src="./img/heatmap.png" width="50%" />
-
-#### 162 matplotlib折线图
-
-模块名称：example_utils.py，里面包括三个函数，各自功能如下：
-
-```python
-import matplotlib.pyplot as plt
-
-# 创建画图fig和axes
-def setup_axes():
-    fig, axes = plt.subplots(ncols=3, figsize=(6.5,3))
-    for ax in fig.axes:
-        ax.set(xticks=[], yticks=[])
-    fig.subplots_adjust(wspace=0, left=0, right=0.93)
-    return fig, axes
-# 图片标题
-def title(fig, text, y=0.9):
-    fig.suptitle(text, size=14, y=y, weight='semibold', x=0.98, ha='right',
-                 bbox=dict(boxstyle='round', fc='floralwhite', ec='#8B7E66',
-                           lw=2))
-# 为数据添加文本注释
-def label(ax, text, y=0):
-    ax.annotate(text, xy=(0.5, 0.00), xycoords='axes fraction', ha='center',
-                style='italic',
-                bbox=dict(boxstyle='round', facecolor='floralwhite',
-                          ec='#8B7E66'))
-```
-
-<img src="./img/matplotlib-line.png" width="60%"/>
-
-```python
-import numpy as np
-import matplotlib.pyplot as plt
-
-import example_utils
-
-x = np.linspace(0, 10, 100)
-
-fig, axes = example_utils.setup_axes()
-for ax in axes:
-    ax.margins(y=0.10)
-
-# 子图1 默认plot多条线，颜色系统分配
-for i in range(1, 6):
-    axes[0].plot(x, i * x)
-
-# 子图2 展示线的不同linestyle
-for i, ls in enumerate(['-', '--', ':', '-.']):
-    axes[1].plot(x, np.cos(x) + i, linestyle=ls)
-
-# 子图3 展示线的不同linestyle和marker
-for i, (ls, mk) in enumerate(zip(['', '-', ':'], ['o', '^', 's'])):
-    axes[2].plot(x, np.cos(x) + i * x, linestyle=ls, marker=mk, markevery=10)
-
-# 设置标题
-# example_utils.title(fig, '"ax.plot(x, y, ...)": Lines and/or markers', y=0.95)
-# 保存图片
-fig.savefig('plot_example.png', facecolor='none')
-# 展示图片
-plt.show()
-```
-
-#### 163 matplotlib散点图
-<img src="./img/matplotlib-scatter" width="60%"/>
-
-对应代码：
-
-```python
-"""
-散点图的基本用法
-"""
-import numpy as np
-import matplotlib.pyplot as plt
-
-import example_utils
-
-# 随机生成数据
-np.random.seed(1874)
-x, y, z = np.random.normal(0, 1, (3, 100))
-t = np.arctan2(y, x)
-size = 50 * np.cos(2 * t)**2 + 10
-
-fig, axes = example_utils.setup_axes()
-
-# 子图1
-axes[0].scatter(x, y, marker='o',  color='darkblue', facecolor='white', s=80)
-example_utils.label(axes[0], 'scatter(x, y)')
-
-# 子图2
-axes[1].scatter(x, y, marker='s', color='darkblue', s=size)
-example_utils.label(axes[1], 'scatter(x, y, s)')
-
-# 子图3
-axes[2].scatter(x, y, s=size, c=z,  cmap='gist_ncar')
-example_utils.label(axes[2], 'scatter(x, y, s, c)')
-
-# example_utils.title(fig, '"ax.scatter(...)": Colored/scaled markers',
-#                     y=0.95)
-fig.savefig('scatter_example.png', facecolor='none')
-
-plt.show()
-```
-
-#### 164 matplotlib柱状图
-
-<img src="./img/1578811155501.png" width="60%"/>
-
-对应代码：
-
-```python
-import numpy as np
-import matplotlib.pyplot as plt
-
-import example_utils
-
-
-def main():
-    fig, axes = example_utils.setup_axes()
-
-    basic_bar(axes[0])
-    tornado(axes[1])
-    general(axes[2])
-
-    # example_utils.title(fig, '"ax.bar(...)": Plot rectangles')
-    fig.savefig('bar_example.png', facecolor='none')
-    plt.show()
-
-# 子图1
-def basic_bar(ax):
-    y = [1, 3, 4, 5.5, 3, 2]
-    err = [0.2, 1, 2.5, 1, 1, 0.5]
-    x = np.arange(len(y))
-    ax.bar(x, y, yerr=err, color='lightblue', ecolor='black')
-    ax.margins(0.05)
-    ax.set_ylim(bottom=0)
-    example_utils.label(ax, 'bar(x, y, yerr=e)')
-
-# 子图2
-def tornado(ax):
-    y = np.arange(8)
-    x1 = y + np.random.random(8) + 1
-    x2 = y + 3 * np.random.random(8) + 1
-    ax.barh(y, x1, color='lightblue')
-    ax.barh(y, -x2, color='salmon')
-    ax.margins(0.15)
-    example_utils.label(ax, 'barh(x, y)')
-
-# 子图3
-def general(ax):
-    num = 10
-    left = np.random.randint(0, 10, num)
-    bottom = np.random.randint(0, 10, num)
-    width = np.random.random(num) + 0.5
-    height = np.random.random(num) + 0.5
-    ax.bar(left, height, width, bottom, color='salmon')
-    ax.margins(0.15)
-    example_utils.label(ax, 'bar(l, h, w, b)')
-
-
-main()
-```
-
-#### 165 matplotlib等高线图
-
-<img src="./img/matplotlib-contour1.png" width="60%"/>
-
-对应代码：
-
-```python
-import matplotlib.pyplot as plt
-import numpy as np
-from matplotlib.cbook import get_sample_data
-
-import example_utils
-
-z = np.load(get_sample_data('bivariate_normal.npy'))
-
-fig, axes = example_utils.setup_axes()
-
-axes[0].contour(z, cmap='gist_earth')
-example_utils.label(axes[0], 'contour')
-
-axes[1].contourf(z, cmap='gist_earth')
-example_utils.label(axes[1], 'contourf')
-
-axes[2].contourf(z, cmap='gist_earth')
-cont = axes[2].contour(z, colors='black')
-axes[2].clabel(cont, fontsize=6)
-example_utils.label(axes[2], 'contourf + contour\n + clabel')
-
-# example_utils.title(fig, '"contour, contourf, clabel": Contour/label 2D data',
-#                     y=0.96)
-fig.savefig('contour_example.png', facecolor='none')
-
-plt.show()
-```
-
-#### 166 imshow图
-
-<img src="./img/matplotlib-imshow.png" width="60%"/>
-
-对应代码：
-
-```python
-import matplotlib.pyplot as plt
-import numpy as np
-from matplotlib.cbook import get_sample_data
-from mpl_toolkits import axes_grid1
-
-import example_utils
-
-
-def main():
-    fig, axes = setup_axes()
-    plot(axes, *load_data())
-    # example_utils.title(fig, '"ax.imshow(data, ...)": Colormapped or RGB arrays')
-    fig.savefig('imshow_example.png', facecolor='none')
-    plt.show()
-
-
-def plot(axes, img_data, scalar_data, ny):
-
-    # 默认线性插值
-    axes[0].imshow(scalar_data, cmap='gist_earth', extent=[0, ny, ny, 0])
-
-    # 最近邻插值
-    axes[1].imshow(scalar_data, cmap='gist_earth', interpolation='nearest',
-                   extent=[0, ny, ny, 0])
-
-    # 展示RGB/RGBA数据
-    axes[2].imshow(img_data)
-
-
-def load_data():
-    img_data = plt.imread(get_sample_data('5.png'))
-    ny, nx, nbands = img_data.shape
-    scalar_data = np.load(get_sample_data('bivariate_normal.npy'))
-    return img_data, scalar_data, ny
-
-
-def setup_axes():
-    fig = plt.figure(figsize=(6, 3))
-    axes = axes_grid1.ImageGrid(fig, [0, 0, .93, 1], (1, 3), axes_pad=0)
-
-    for ax in axes:
-        ax.set(xticks=[], yticks=[])
-    return fig, axes
-
-
-main()
-```
-
-#### 167 pyecharts绘制仪表盘
-
-使用pip install pyecharts 安装，版本为 v1.6，pyecharts绘制仪表盘，只需要几行代码：
-
-```python
-from pyecharts import charts
-
-# 仪表盘
-gauge = charts.Gauge()
-gauge.add('Python小例子', [('Python机器学习', 30), ('Python基础', 70.),
-                        ('Python正则', 90)])
-gauge.render(path="./data/仪表盘.html")
-print('ok')
-```
-
-仪表盘中共展示三项，每项的比例为30%,70%,90%，如下图默认名称显示第一项：Python机器学习，完成比例为30%
-
-<img src="./img/pyecharts-funnel.png" width="40%"/>
-
-#### 168 pyecharts漏斗图
-
-```python
-from pyecharts import options as opts
-from pyecharts.charts import Funnel, Page
-from random import randint
-
-def funnel_base() -> Funnel:
-  c = (
-    Funnel()
-    .add("豪车", [list(z) for z in zip(['宝马', '法拉利', '奔驰', '奥迪', '大众', '丰田', '特斯拉'],
-                 [randint(1, 20) for _ in range(7)])])
-    .set_global_opts(title_opts=opts.TitleOpts(title="豪车漏斗图"))
-  )
-  return c
-funnel_base().render('./img/car_fnnel.html')
-```
-
-以7种车型及某个属性值绘制的漏斗图，属性值大越靠近漏斗的大端。
-
-<img src="./img/1578811483265.png" width="50%"/>
-
-#### 169 pyecharts日历图
-
-```python
-import datetime
-import random
-from pyecharts import options as opts
-from pyecharts.charts import Calendar
-
-def calendar_interval_1() -> Calendar:
-    begin = datetime.date(2019, 1, 1)
-    end = datetime.date(2019, 12, 27)
-    data = [
-        [str(begin + datetime.timedelta(days=i)), random.randint(1000, 25000)]
-        for i in range(0, (end - begin).days + 1, 2)  # 隔天统计
-    ]
-    calendar = (
-      Calendar(init_opts=opts.InitOpts(width="1200px")).add(
-            "", data, calendar_opts=opts.CalendarOpts(range_="2019"))
-        .set_global_opts(
-            title_opts=opts.TitleOpts(title="Calendar-2019年步数统计"),
-            visualmap_opts=opts.VisualMapOpts(
-                max_=25000,
-                min_=1000,
-                orient="horizontal",
-                is_piecewise=True,
-                pos_top="230px",
-                pos_left="100px",
-            ),
-        )
-    )
-    return calendar
-
-calendar_interval_1().render('./img/calendar.html')
-```
-
-绘制2019年1月1日到12月27日的步行数，官方给出的图形宽度`900px`不够，只能显示到9月份，本例使用`opts.InitOpts(width="1200px")`做出微调，并且`visualmap`显示所有步数，每隔一天显示一次：
-
-<img src="./img/1578811543851.png" width="50%"/>
-
-#### 170 pyecharts绘制graph图
-
-```python
-import json
-import os
-from pyecharts import options as opts
-from pyecharts.charts import Graph, Page
-
-def graph_base() -> Graph:
-    nodes = [
-        {"name": "cus1", "symbolSize": 10},
-        {"name": "cus2", "symbolSize": 30},
-        {"name": "cus3", "symbolSize": 20}
-    ]
-    links = []
-    for i in nodes:
-        if i.get('name') == 'cus1':
-            continue
-        for j in nodes:
-            if j.get('name') == 'cus1':
-                continue
-            links.append({"source": i.get("name"), "target": j.get("name")})
-    c = (
-        Graph()
-        .add("", nodes, links, repulsion=8000)
-        .set_global_opts(title_opts=opts.TitleOpts(title="customer-influence"))
-    )
-    return c
-```
-
-构建图，其中客户点1与其他两个客户都没有关系(`link`)，也就是不存在有效边：
-
-<img src="./img/1578811569529.png" width="50%"/>
-
-#### 171 pyecharts水球图
-
-```python
-from pyecharts import options as opts
-from pyecharts.charts import Liquid, Page
-from pyecharts.globals import SymbolType
-
-def liquid() -> Liquid:
-    c = (
-        Liquid()
-        .add("lq", [0.67, 0.30, 0.15])
-        .set_global_opts(title_opts=opts.TitleOpts(title="Liquid"))
-    )
-    return c
-
-liquid().render('./img/liquid.html')
-```
-
-水球图的取值`[0.67, 0.30, 0.15]`表示下图中的`三个波浪线`，一般代表三个百分比:
-
-<img src="./img/liquid.gif" width="50%"/>
-
-#### 172 pyecharts饼图
-
-```python
-from pyecharts import options as opts
-from pyecharts.charts import Pie
-from random import randint
-
-def pie_base() -> Pie:
-    c = (
-        Pie()
-        .add("", [list(z) for z in zip(['宝马', '法拉利', '奔驰', '奥迪', '大众', '丰田', '特斯拉'],
-                                       [randint(1, 20) for _ in range(7)])])
-        .set_global_opts(title_opts=opts.TitleOpts(title="Pie-基本示例"))
-        .set_series_opts(label_opts=opts.LabelOpts(formatter="{b}: {c}"))
-    )
-    return c
-
-pie_base().render('./img/pie_pyecharts.html')
-```
-<img src="./img/20191229105841.png" width="50%"/>
-
-
-#### 173 pyecharts极坐标图
-
-```python
-import random
-from pyecharts import options as opts
-from pyecharts.charts import Page, Polar
-
-def polar_scatter0() -> Polar:
-    data = [(alpha, random.randint(1, 100)) for alpha in range(101)] # r = random.randint(1, 100)
-    print(data)
-    c = (
-        Polar()
-        .add("", data, type_="bar", label_opts=opts.LabelOpts(is_show=False))
-        .set_global_opts(title_opts=opts.TitleOpts(title="Polar"))
-    )
-    return c
-
-polar_scatter0().render('./img/polar.html')
-```
-
-极坐标表示为`(夹角,半径)`，如(6,94)表示夹角为6，半径94的点：
-
-<img src="./img/1578811648010.png" width="50%"/>
-
-#### 174 pyecharts词云图
-
-```python
-from pyecharts import options as opts
-from pyecharts.charts import Page, WordCloud
-from pyecharts.globals import SymbolType
-
-words = [
-    ("Python", 100),
-    ("C++", 80),
-    ("Java", 95),
-    ("R", 50),
-    ("JavaScript", 79),
-    ("C", 65)
-]
-
-def wordcloud() -> WordCloud:
-    c = (
-        WordCloud()
-        # word_size_range: 单词字体大小范围
-        .add("", words, word_size_range=[20, 100], shape='cardioid')
-        .set_global_opts(title_opts=opts.TitleOpts(title="WordCloud"))
-    )
-    return c
-
-wordcloud().render('./img/wordcloud.html')
-```
-
-`("C",65)`表示在本次统计中C语言出现65次
-
-<img src="./img/1578811675413.png" width="50%"/>
-
-#### 175 pyecharts系列柱状图
-
-```python
-from pyecharts import options as opts
-from pyecharts.charts import Bar
-from random import randint
-
-def bar_series() -> Bar:
-    c = (
-        Bar()
-        .add_xaxis(['宝马', '法拉利', '奔驰', '奥迪', '大众', '丰田', '特斯拉'])
-        .add_yaxis("销量", [randint(1, 20) for _ in range(7)])
-        .add_yaxis("产量", [randint(1, 20) for _ in range(7)])
-        .set_global_opts(title_opts=opts.TitleOpts(title="Bar的主标题", subtitle="Bar的副标题"))
-    )
-    return c
-
-bar_series().render('./img/bar_series.html')
-```
-
-<img src="./img/1578811781930.png" width="50%"/>
-
-#### 176 pyecharts热力图
-
-```python
-import random
-from pyecharts import options as opts
-from pyecharts.charts import HeatMap
-
-def heatmap_car() -> HeatMap:
-    x = ['宝马', '法拉利', '奔驰', '奥迪', '大众', '丰田', '特斯拉']
-    y = ['中国','日本','南非','澳大利亚','阿根廷','阿尔及利亚','法国','意大利','加拿大']
-    value = [[i, j, random.randint(0, 100)]
-             for i in range(len(x)) for j in range(len(y))]
-    c = (
-        HeatMap()
-        .add_xaxis(x)
-        .add_yaxis("销量", y, value)
-        .set_global_opts(
-            title_opts=opts.TitleOpts(title="HeatMap"),
-            visualmap_opts=opts.VisualMapOpts(),
-        )
-    )
-    return c
-
-heatmap_car().render('./img/heatmap_pyecharts.html')
-```
-
-热力图描述的实际是三维关系，x轴表示车型，y轴表示国家，每个色块的颜色值代表销量，颜色刻度尺显示在左下角，颜色越红表示销量越大。
-
-<img src="./img/image-20191229101724665.png" width="50%"/>
-
-
-
-#### 178 matplotlib绘制动画
-
-`matplotlib`是python中最经典的绘图包，里面`animation`模块能绘制动画。
-
-首先导入小例子使用的模块：
-```python
-from matplotlib import pyplot as plt
-from matplotlib import animation
-from random import randint, random
-```
-
-生成数据，`frames_count`是帧的个数，`data_count`每个帧的柱子个数
-
-```python
-class Data:
-    data_count = 32
-    frames_count = 2
-
-    def __init__(self, value):
-        self.value = value
-        self.color = (0.5, random(), random()) #rgb
-
-    # 造数据
-    @classmethod
-    def create(cls):
-        return [[Data(randint(1, cls.data_count)) for _ in range(cls.data_count)]
-                for frame_i in range(cls.frames_count)]
-```
-
-绘制动画：`animation.FuncAnimation`函数的回调函数的参数`fi`表示第几帧，注意要调用`axs.cla()`清除上一帧。
-
-```python
-def draw_chart():
-    fig = plt.figure(1, figsize=(16, 9))
-    axs = fig.add_subplot(111)
-    axs.set_xticks([])
-    axs.set_yticks([])
-
-    # 生成数据
-    frames = Data.create()
-
-    def animate(fi):
-        axs.cla()  # clear last frame
-        axs.set_xticks([])
-        axs.set_yticks([])
-        return axs.bar(list(range(Data.data_count)),        # X
-                       [d.value for d in frames[fi]],       # Y
-                       1,                                   # width
-                       color=[d.color for d in frames[fi]]  # color
-                       )
-    # 动画展示
-    anim = animation.FuncAnimation(fig, animate, frames=len(frames))
-    plt.show()
-
-
-draw_chart()
-```
-
-#### 179 pyecharts绘图属性设置方法
-
-昨天一位读者朋友问我`pyecharts`中，y轴如何显示在右侧。先说下如何设置，同时阐述例子君是如何找到找到此属性的。
-
-这是pyecharts中一般的绘图步骤：
-```python
-from pyecharts.faker import Faker
-from pyecharts import options as opts
-from pyecharts.charts import Bar
-from pyecharts.commons.utils import JsCode
-
-def bar_base() -> Bar:
-    c = (
-        Bar()
-        .add_xaxis(Faker.choose())
-        .add_yaxis("商家A", Faker.values())
-        .set_global_opts(title_opts=opts.TitleOpts(title="Bar-基本示例", subtitle="我是副标题"))
-    )
-    return c
-
-bar_base().render('./bar.html')
-```
-那么，如何设置y轴显示在右侧，添加一行代码：
-```python
-.set_global_opts(yaxis_opts=opts.AxisOpts(position='right'))
-```
-也就是：
-```python
-c = (
-        Bar()
-        .add_xaxis(Faker.choose())
-        .add_yaxis("商家A", Faker.values())
-        .set_global_opts(title_opts=opts.TitleOpts(title="Bar-基本示例", subtitle="我是副标题"))
-        .set_global_opts(yaxis_opts=opts.AxisOpts(position='right'))
-    )
-```
-
-如何锁定这个属性，首先应该在set_global_opts函数的参数中找，它一共有以下`11`个设置参数，它们位于模块`charts.py`:
-```python
-title_opts: types.Title = opts.TitleOpts(),
-legend_opts: types.Legend = opts.LegendOpts(),
-tooltip_opts: types.Tooltip = None,
-toolbox_opts: types.Toolbox = None,
-brush_opts: types.Brush = None,
-xaxis_opts: types.Axis = None,
-yaxis_opts: types.Axis = None,
-visualmap_opts: types.VisualMap = None,
-datazoom_opts: types.DataZoom = None,
-graphic_opts: types.Graphic = None,
-axispointer_opts: types.AxisPointer = None,
-```
-因为是设置y轴显示在右侧，自然想到设置参数`yaxis_opts`，因为其类型为`types.Axis`，所以再进入`types.py`，同时定位到`Axis`：
-```python
-Axis = Union[opts.AxisOpts, dict, None]
-```
-Union是pyecharts中可容纳多个类型的并集列表，也就是Axis可能为`opts.AxisOpt`, `dict`, 或`None`三种类型。查看第一个`opts.AxisOpt`类，它共定义以下`25`个参数：
-```python
-type_: Optional[str] = None,
-name: Optional[str] = None,
-is_show: bool = True,
-is_scale: bool = False,
-is_inverse: bool = False,
-name_location: str = "end",
-name_gap: Numeric = 15,
-name_rotate: Optional[Numeric] = None,
-interval: Optional[Numeric] = None,
-grid_index: Numeric = 0,
-position: Optional[str] = None,
-offset: Numeric = 0,
-split_number: Numeric = 5,
-boundary_gap: Union[str, bool, None] = None,
-min_: Union[Numeric, str, None] = None,
-max_: Union[Numeric, str, None] = None,
-min_interval: Numeric = 0,
-max_interval: Optional[Numeric] = None,
-axisline_opts: Union[AxisLineOpts, dict, None] = None,
-axistick_opts: Union[AxisTickOpts, dict, None] = None,
-axislabel_opts: Union[LabelOpts, dict, None] = None,
-axispointer_opts: Union[AxisPointerOpts, dict, None] = None,
-name_textstyle_opts: Union[TextStyleOpts, dict, None] = None,
-splitarea_opts: Union[SplitAreaOpts, dict, None] = None,
-splitline_opts: Union[SplitLineOpts, dict] = SplitLineOpts(),
-```
-观察后尝试参数`position`，结合官档：`https://pyecharts.org/#/zh-cn/global_options?id=axisopts%ef%bc%9a%e5%9d%90%e6%a0%87%e8%bd%b4%e9%85%8d%e7%bd%ae%e9%a1%b9`，介绍x轴设置position时有bottom, top, 所以y轴设置很可能就是left,right.
-
-OK！
-
-#### 180 pyecharts绘图属性设置方法(下)
-
-<img src="./img/pyecharts-bar.png" width="50%"/>
-
-**分步讲解如何配置为上图**
-
-1)柱状图显示效果动画对应控制代码：
-
-```python
-animation_opts=opts.AnimationOpts(
-                    animation_delay=500, animation_easing="cubicOut"
-                )
-```
-2)柱状图显示主题对应控制代码：
-```python
-theme=ThemeType.MACARONS
-```
-3)添加x轴对应的控制代码：
-```python
-add_xaxis( ["草莓", "芒果", "葡萄", "雪梨", "西瓜", "柠檬", "车厘子"]
-```
-4)添加y轴对应的控制代码：
-```python
-add_yaxis("A", Faker.values(),
-```
-5)修改柱间距对应的控制代码：
-```python
-category_gap="50%"
-```
-
-6)A系列柱子是否显示对应的控制代码：
-```python
-is_selected=True
-```
-
-7)A系列柱子颜色渐变对应的控制代码：
-```python
-itemstyle_opts={
-            "normal": {
-                "color": JsCode("""new echarts.graphic.LinearGradient(0, 0, 0, 1, [{
-                    offset: 0,
-                    color: 'rgba(0, 244, 255, 1)'
-                }, {
-                    offset: 1,
-                    color: 'rgba(0, 77, 167, 1)'
-                }], false)"""),
-                "barBorderRadius": [6, 6, 6, 6],
-                "shadowColor": 'rgb(0, 160, 221)',
-            }}
-```
-8)A系列柱子最大和最小值`标记点`对应的控制代码：
-```python
-markpoint_opts=opts.MarkPointOpts(
-                data=[
-                    opts.MarkPointItem(type_="max", name="最大值"),
-                    opts.MarkPointItem(type_="min", name="最小值"),
-                ]
-            )
-```
-9)A系列柱子最大和最小值`标记线`对应的控制代码：
-```python
-markline_opts=opts.MarkLineOpts(
-                data=[
-                    opts.MarkLineItem(type_="min", name="最小值"),
-                    opts.MarkLineItem(type_="max", name="最大值")
-                ]
-            )
-```
-10)柱状图标题对应的控制代码：
-```python
-title_opts=opts.TitleOpts(title="Bar-参数使用例子"
-```
-11)柱状图非常有用的toolbox显示对应的控制代码：
-```python
-toolbox_opts=opts.ToolboxOpts()
-```
-
-12)Y轴显示在右侧对应的控制代码：
-```python
-yaxis_opts=opts.AxisOpts(position="right")
-```
-13)Y轴名称对应的控制代码：
-```python
-yaxis_opts=opts.AxisOpts(,name="Y轴")
-```
-14)数据轴区域放大缩小设置对应的控制代码：
-```python
-datazoom_opts=opts.DataZoomOpts()
-```
-
-**完整代码**
-
-```python
-def bar_border_radius():
-    c = (
-        Bar(init_opts=opts.InitOpts(
-                animation_opts=opts.AnimationOpts(
-                    animation_delay=500, animation_easing="cubicOut"
-                ),
-                theme=ThemeType.MACARONS))
-        .add_xaxis( ["草莓", "芒果", "葡萄", "雪梨", "西瓜", "柠檬", "车厘子"])
-        .add_yaxis("A", Faker.values(),category_gap="50%",markpoint_opts=opts.MarkPointOpts(),is_selected=True)
-        .set_series_opts(itemstyle_opts={
-            "normal": {
-                "color": JsCode("""new echarts.graphic.LinearGradient(0, 0, 0, 1, [{
-                    offset: 0,
-                    color: 'rgba(0, 244, 255, 1)'
-                }, {
-                    offset: 1,
-                    color: 'rgba(0, 77, 167, 1)'
-                }], false)"""),
-                "barBorderRadius": [6, 6, 6, 6],
-                "shadowColor": 'rgb(0, 160, 221)',
-            }}, markpoint_opts=opts.MarkPointOpts(
-                data=[
-                    opts.MarkPointItem(type_="max", name="最大值"),
-                    opts.MarkPointItem(type_="min", name="最小值"),
-                ]
-            ),markline_opts=opts.MarkLineOpts(
-                data=[
-                    opts.MarkLineItem(type_="min", name="最小值"),
-                    opts.MarkLineItem(type_="max", name="最大值")
-                ]
-            ))
-        .set_global_opts(title_opts=opts.TitleOpts(title="Bar-参数使用例子"), toolbox_opts=opts.ToolboxOpts(),yaxis_opts=opts.AxisOpts(position="right",name="Y轴"),datazoom_opts=opts.DataZoomOpts(),)
-        
-    )
-
-    return c
-
-bar_border_radius().render()
-```
-
-#### 181 pyecharts原来可以这样快速入门(上)
-
-最近两天，翻看下`pyecharts`的源码，感叹这个框架写的真棒，思路清晰，设计简洁，通俗易懂，推荐读者们有空也阅读下。
-
-bee君是被pyecharts官档介绍-五个特性所吸引：
-
-1)简洁的 API 设计，使用如丝滑般流畅，支持链式调用;
-
-2)囊括了 30+ 种常见图表，应有尽有;
-
-3)支持主流 Notebook 环境，Jupyter Notebook 和 JupyterLab;
-
-4)可轻松集成至 Flask，Django 等主流 Web 框架;
-
-5)高度灵活的配置项，可轻松搭配出精美的图表
-
-pyecharts 确实也如上面五个特性介绍那样，使用起来非常方便。那么，有些读者不禁好奇会问，pyecharts 是如何做到的？
-
-我们不妨从pyecharts官档`5分钟入门pyecharts`章节开始，由表(最高层函数)及里(底层函数也就是所谓的`源码`)，一探究竟。
-
-
-
-**官方第一个例子**
-
-不妨从官档给出的第一个例子说起，
-
-```python
-from pyecharts.charts import Bar
-
-bar = Bar()
-bar.add_xaxis(["衬衫", "羊毛衫", "雪纺衫", "裤子", "高跟鞋", "袜子"])
-bar.add_yaxis("商家A", [5, 20, 36, 10, 75, 90])
-# render 会生成本地 HTML 文件，默认会在当前目录生成 render.html 文件
-# 也可以传入路径参数，如 bar.render("mycharts.html")
-bar.render()
-```
-
-第一行代码：`from pyecharts.charts import Bar`，先上一张源码中`包的结构图`：
-
-![](./img/pyecharts1.jpg)
-
-`bar.py`模块中定义了类`Bar(RectChart)`，如下所示：
-
-```python
-class Bar(RectChart):
-    """
-    <<< Bar Chart >>>
-
-    Bar chart presents categorical data with rectangular bars
-    with heights or lengths proportional to the values that they represent.
-    """
-```
-
-
-
-这里有读者可能会有以下两个问题：
-
-1)为什么根据图1中的包结构，为什么不这么写：`from pyecharts.charts.basic_charts import Bar`
-
-![](./img/pyechart2.jpg)
-
-
-
-答：请看图2中`__init__.py`模块，文件内容如下，看到导入`charts`包，而非`charts.basic_charts`
-
-```python
-from pyecharts import charts, commons, components, datasets, options, render, scaffold
-from pyecharts._version import __author__, __version__
-```
-
-2)`Bar(RectChart)`是什么意思
-
-答：RectChart是Bar的子类
-
-下面4行代码，很好理解，没有特殊性。
-
-pyecharts主要两个大版本,0.5基版本和1.0基版本，从1.0基版本开始全面支持`链式调用`，bee君也很喜爱这种链式调用模式，代码看起来更加紧凑：
-
-```python
-from pyecharts.charts import Bar
-
-bar = (
-    Bar()
-    .add_xaxis(["衬衫", "羊毛衫", "雪纺衫", "裤子", "高跟鞋", "袜子"])
-    .add_yaxis("商家A", [5, 20, 36, 10, 75, 90])
-)
-bar.render()
-```
-
-实现`链式调用`也没有多难，保证返回类本身`self`即可，如果非要有其他返回对象，那么要提到类内以便被全局共享，
-
-add_xaxis函数返回`self`
-
-```python
-    def add_xaxis(self, xaxis_data: Sequence):
-        self.options["xAxis"][0].update(data=xaxis_data)
-        self._xaxis_data = xaxis_data
-        return self
-```
-
-add_yaxis函数同样返回`self`.
-
-#### 182 pyecharts原来可以这样快速入门(中)
-
-**一切皆options**
-
-pyecharts用起来很爽的另一个重要原因，`参数配置项`封装的非常nice，通过定义一些列基础的配置组件，比如`global_options.py`模块中定义的配置对象有以下`27`个
-
-```python
-    AngleAxisItem,
-    AngleAxisOpts,
-    AnimationOpts,
-    Axis3DOpts,
-    AxisLineOpts,
-    AxisOpts,
-    AxisPointerOpts,
-    AxisTickOpts,
-    BrushOpts,
-    CalendarOpts,
-    DataZoomOpts,
-    Grid3DOpts,
-    GridOpts,
-    InitOpts,
-    LegendOpts,
-    ParallelAxisOpts,
-    ParallelOpts,
-    PolarOpts,
-    RadarIndicatorItem,
-    RadiusAxisItem,
-    RadiusAxisOpts,
-    SingleAxisOpts,
-    TitleOpts,
-    ToolBoxFeatureOpts,
-    ToolboxOpts,
-    TooltipOpts,
-    VisualMapOpts,
-```
-
-#### 183 pyecharts原来可以这样快速入门(下)
-
-**第二个例子**
-
-了解上面的配置对象后，再看官档给出的第二个例子，与第一个例子相比，增加了一行代码：`set_global_opts`函数
-
-```python
-from pyecharts.charts import Bar
-from pyecharts import options as opts
-
-# V1 版本开始支持链式调用
-# 你所看到的格式其实是 `black` 格式化以后的效果
-# 可以执行 `pip install black` 下载使用
-bar = (
-    Bar()
-    .add_xaxis(["衬衫", "羊毛衫", "雪纺衫", "裤子", "高跟鞋", "袜子"])
-    .add_yaxis("商家A", [5, 20, 36, 10, 75, 90])
-    .set_global_opts(title_opts=opts.TitleOpts(title="主标题", subtitle="副标题"))
-    
-bar.render()
-```
-
-`set_global_opts`函数在pyecharts中被高频使用，它定义在底层基础模块`Chart.py`中，它是前面说到的`RectChart`的子类，`Bar`类的孙子类。
-
-浏览下函数的参数：
-
-```python
-def set_global_opts(
-        self,
-        title_opts: types.Title = opts.TitleOpts(),
-        legend_opts: types.Legend = opts.LegendOpts(),
-        tooltip_opts: types.Tooltip = None,
-        toolbox_opts: types.Toolbox = None,
-        brush_opts: types.Brush = None,
-        xaxis_opts: types.Axis = None,
-        yaxis_opts: types.Axis = None,
-        visualmap_opts: types.VisualMap = None,
-        datazoom_opts: types.DataZoom = None,
-        graphic_opts: types.Graphic = None,
-        axispointer_opts: types.AxisPointer = None,
-    ):
-```
-
-以第二个参数`title_opts`为例，说明`pyecharts`中参数赋值的风格。
-
-首先，`title_opts`是`默认参数`，默认值为`opts.TitleOpts()`，这个对象在上一节中，我们提到过，是`global_options.py`模块中定义的`27`个配置对象种的一个。
-
-其次，pyecharts中为了增强代码可读性，参数的类型都显示的给出。此处它的类型为：`types.Title`. 这是什么类型？它的类型不是`TitleOpts`吗？不急，看看Title这个类型的定义：
-
-```python
-Title = Union[opts.TitleOpts, dict]
-```
-
-原来`Title`可能是`opts.TitleOpts`, 也可能是python原生的`dict`. 通过`Union`实现的就是这种`类型效果`。所以这就解释了官档中为什么说也可以使用字典配置参数的问题，如下官档：
-
-```python
-  # 或者直接使用字典参数
-    # .set_global_opts(title_opts={"text": "主标题", "subtext": "副标题"})
-)
-```
-
-最后，真正的关于图表的标题相关的属性都被封装到TitleOpts类中，比如`title`,`subtitle`属性，查看源码，TitleOpts对象还有更多属性：
-
-```python
-class TitleOpts(BasicOpts):
-    def __init__(
-        self,
-        title: Optional[str] = None,
-        title_link: Optional[str] = None,
-        title_target: Optional[str] = None,
-        subtitle: Optional[str] = None,
-        subtitle_link: Optional[str] = None,
-        subtitle_target: Optional[str] = None,
-        pos_left: Optional[str] = None,
-        pos_right: Optional[str] = None,
-        pos_top: Optional[str] = None,
-        pos_bottom: Optional[str] = None,
-        padding: Union[Sequence, Numeric] = 5,
-        item_gap: Numeric = 10,
-        title_textstyle_opts: Union[TextStyleOpts, dict, None] = None,
-        subtitle_textstyle_opts: Union[TextStyleOpts, dict, None] = None,
-    ):
-```
-
-OK. 到此跟随5分钟入门的官档，结合两个例子实现的背后源码，探讨了：
-
-1)与包结构组织相关的`__init__.py`；
-
-2)类的继承关系:Bar->RectChart->Chart；
-
-3)链式调用；
-
-4)重要的参数配置包`options`，以TitleOpts类为例，`set_global_opts`将它装载到Bar类中实现属性自定义。
-
-#### 184 1 分钟学会画 pairplot 图
-
-seaborn 绘图库，基于 matplotlib 开发，提供更高层绘图接口。
-
-学习使用 seaborn 绘制 `pairplot` 图
-
-`pairplot` 图能直观的反映出两两特征间的关系，帮助我们对数据集建立初步印象，更好的完成分类和聚类任务。
-
-使用 skearn 导入经典的 Iris 数据集，共有 150 条记录，4 个特征，target 有三种不同值。如下所示：
-
-```markdown
-     sepal_length  sepal_width  petal_length  petal_width    species
-0             5.1          3.5           1.4          0.2     setosa
-1             4.9          3.0           1.4          0.2     setosa
-2             4.7          3.2           1.3          0.2     setosa
-3             4.6          3.1           1.5          0.2     setosa
-4             5.0          3.6           1.4          0.2     setosa
-..            ...          ...           ...          ...        ...
-145           6.7          3.0           5.2          2.3  virginica
-146           6.3          2.5           5.0          1.9  virginica
-147           6.5          3.0           5.2          2.0  virginica
-148           6.2          3.4           5.4          2.3  virginica
-149           5.9          3.0           5.1          1.8  virginica
-```
-
-使用 seaborn 绘制 `sepal_length`, `petal_length` 两个特征间的关系矩阵：
-
-```python
-from sklearn.datasets import load_iris
-import matplotlib.pyplot as plt
-import seaborn as sns
-from sklearn import tree
-
-sns.set(style="ticks")
-
-df02 = df.iloc[:,[0,2,4]] # 选择一对特征
-sns.pairplot(df02)
-plt.show()
-```
-
-<img src="./img/pairplot2.png" width="30%"/>
-
-设置颜色多显：
-
-```
-sns.pairplot(df02, hue="species")
-plt.show()
-```
-
-绘制所有特征散点矩阵：
-
-```
-sns.pairplot(df, hue="species")
-plt.show()
-```
-
-<img src="./img/pairplot.png" width="50%"/>
-
-### 六、 Python 坑点和工具
-
-#### 185 含单个元素的元组
-
-Python中有些函数的参数类型为元组，其内有1个元素，这样创建是错误的：
-
-```python
-c = (5) # NO!
-```
-
-它实际创建一个整型元素5，必须要在元素后加一个`逗号`:
-
-```python
-c = (5,) # YES!
-```
-
-#### 186 默认参数设为空
-
-含有默认参数的函数，如果类型为容器，且设置为空：
-
-```python
-def f(a,b=[]):  # NO!
-    print(b)
-    return b
-
-ret = f(1)
-ret.append(1)
-ret.append(2)
-# 当再调用f(1)时，预计打印为 []
-f(1)
-# 但是却为 [1,2]
-```
-
-这是可变类型的默认参数之坑，请务必设置此类默认参数为None：
-
-```python
-def f(a,b=None): # YES!
-    pass
-```
-
-#### 187 共享变量未绑定之坑
-
-有时想要多个函数共享一个全局变量，但却在某个函数内试图修改它为局部变量：
-
-```python
-i = 1
-def f():
-    i+=1 #NO!
-    
-def g():
-    print(i)
-```
-
-应该在f函数内显示声明`i`为global变量：
-
-```python
-i = 1
-def f():
-    global i # YES!
-    i+=1
-```
-
-#### 188 lambda自由参数之坑
-
-排序和分组的key函数常使用lambda，表达更加简洁，但是有个坑新手容易掉进去：
-
-```python
-a = [lambda x: x+i for i in range(3)] # NO!
-for f in a:
-    print(f(1))
-# 你可能期望输出：1,2,3
-```
-
-但是实际却输出: 3,3,3. 定义lambda使用的`i`被称为自由参数，它只在调用lambda函数时，值才被真正确定下来，这就犹如下面打印出2，你肯定确信无疑吧。
-
-```python
-a = 0
-a = 1
-a = 2
-def f(a):
-    print(a)
-```
-
-正确做法是转化`自由参数`为lambda函数的`默认参数`：
-
-```python
-a = [lambda x,i=i: x+i for i in range(3)] # YES!
-```
-
-#### 189 各种参数使用之坑
-
-Python强大多变，原因之一在于函数参数类型的多样化。方便的同时，也为使用者带来更多的约束规则。如果不了解这些规则，调用函数时，可能会出现如下一些语法异常：
-
-*(1) SyntaxError: positional argument follows keyword argument*
-
-
-*(2) TypeError: f() missing 1 required keyword-only argument: 'b'*
-
-
-*(3) SyntaxError: keyword argument repeated*
-
-*(4) TypeError: f() missing 1 required positional argument: 'b'*
-
-*(5) TypeError: f() got an unexpected keyword argument 'a'*
-
-*(6) TypeError: f() takes 0 positional arguments but 1 was given*
-
-
-总结主要的参数使用规则
-
-位置参数
-
-`位置参数`的定义：`函数调用`时根据函数定义的参数位（形参）置来传递参数，是最常见的参数类型。
-
-```python
-def f(a):
-  return a
-
-f(1) # 位置参数 
-```
-位置参数不能缺少：
-```python
-def f(a,b):
-  pass
-
-f(1) # TypeError: f() missing 1 required positional argument: 'b'
-```
-
-**规则1：位置参数必须一一对应，缺一不可**
-
-关键字参数
-
-在函数调用时，通过‘键--值’方式为函数形参传值，不用按照位置为函数形参传值。
-
-```python
-def f(a):
-  print(f'a:{a}')
-```
-这么调用，`a`就是关键字参数：
-```python
-f(a=1)
-```
-但是下面调用就不OK:
-```python
-f(a=1,20.) # SyntaxError: positional argument follows keyword argument
-```
-
-**规则2：关键字参数必须在位置参数右边**
-
-
-下面调用也不OK:
-```python
-f(1,width=20.,width=30.) #SyntaxError: keyword argument repeated
-
-```
-
-**规则3：对同一个形参不能重复传值**
-
-
-默认参数
-
-在定义函数时，可以为形参提供默认值。对于有默认值的形参，调用函数时如果为该参数传值，则使用传入的值，否则使用默认值。如下`b`是默认参数：
-```python
-def f(a,b=1):
-  print(f'a:{a}, b:{b}')
-
-```
-
-
-**规则4：无论是函数的定义还是调用，默认参数的定义应该在位置形参右面**
-
-只在定义时赋值一次；默认参数通常应该定义成不可变类型
-
-
-可变位置参数
-
-如下定义的参数a为可变位置参数：
-```python
-def f(*a):
-  print(a)
-```
-调用方法：
-```python
-f(1) #打印结果为元组： (1,)
-f(1,2,3) #打印结果：(1, 2, 3)
-```
-
-但是，不能这么调用：
-```python
-f(a=1) # TypeError: f() got an unexpected keyword argument 'a'
-```
-
-
-可变关键字参数
-
-如下`a`是可变关键字参数：
-```python
-def f(**a):
-  print(a)
-```
-调用方法：
-```python
-f(a=1) #打印结果为字典：{'a': 1}
-f(a=1,b=2,width=3) #打印结果：{'a': 1, 'b': 2, 'width': 3}
-```
-
-但是，不能这么调用：
-```python
-f(1) TypeError: f() takes 0 positional arguments but 1 was given
-```
-
-接下来，单独推送分析一个小例子，综合以上各种参数类型的函数及调用方法，敬请关注。
-
-#### 190 列表删除之坑
-
-删除一个列表中的元素，此元素可能在列表中重复多次：
-
-```python
-def del_item(lst,e):
-    return [lst.remove(i) for i in e if i==e] # NO!
-```
-
-考虑删除这个序列[1,3,3,3,5]中的元素3，结果发现只删除其中两个：
-
-```python
-del_item([1,3,3,3,5],3) # 结果：[1,3,5]
-```
-
-正确做法：
-
-```python
-def del_item(lst,e):
-    d = dict(zip(range(len(lst)),lst)) # YES! 构造字典
-    return [v for k,v in d.items() if v!=e]
-
-```
-
-#### 191 列表快速复制之坑
-
-在python中`*`与列表操作，实现快速元素复制：
-
-```python
-a = [1,3,5] * 3 # [1,3,5,1,3,5,1,3,5]
-a[0] = 10 # [10, 2, 3, 1, 2, 3, 1, 2, 3]
-```
-
-如果列表元素为列表或字典等复合类型：
-
-```python
-a = [[1,3,5],[2,4]] * 3 # [[1, 3, 5], [2, 4], [1, 3, 5], [2, 4], [1, 3, 5], [2, 4]]
-
-a[0][0] = 10 #  
-```
-
-结果可能出乎你的意料，其他`a[1[0]`等也被修改为10
-
-```python
-[[10, 3, 5], [2, 4], [10, 3, 5], [2, 4], [10, 3, 5], [2, 4]]
-```
-
-这是因为*复制的复合对象都是浅引用，也就是说id(a[0])与id(a[2])门牌号是相等的。如果想要实现深复制效果，这么做：
-
-```python
-a = [[] for _ in range(3)]
-```
-
-#### 192 字符串驻留
-```python
-In [1]: a = 'something'
-    ...: b = 'some'+'thing'
-    ...: id(a)==id(b)
-Out[1]: True
-```
-如果上面例子返回`True`，但是下面例子为什么是`False`:
-```python
-In [1]: a = '@zglg.com'
-
-In [2]: b = '@zglg'+'.com'
-
-In [3]: id(a)==id(b)
-Out[3]: False
-```
-这与Cpython 编译优化相关，行为称为`字符串驻留`，但驻留的字符串中只包含字母，数字或下划线。
-
-#### 193 相同值的不可变对象
-```python
-In [5]: d = {}
-    ...: d[1] = 'java'
-    ...: d[1.0] = 'python'
-
-In [6]: d
-Out[6]: {1: 'python'}
-
-### key=1,value=java的键值对神奇消失了
-In [7]: d[1]
-Out[7]: 'python'
-In [8]: d[1.0]
-Out[8]: 'python'
-```
-这是因为具有相同值的不可变对象在Python中始终具有`相同的哈希值`
-
-由于存在`哈希冲突`，不同值的对象也可能具有相同的哈希值。
-
-#### 194 对象销毁顺序
-创建一个类`SE`:
-```python
-class SE(object):
-  def __init__(self):
-    print('init')
-  def __del__(self):
-    print('del')
-```
-创建两个SE实例，使用`is`判断：
-```python
-In [63]: SE() is SE()
-init
-init
-del
-del
-Out[63]: False
-
-```
-创建两个SE实例，使用`id`判断：
-```python
-In [64]: id(SE()) == id(SE())
-init
-del
-init
-del
-Out[64]: True
-```
-
-调用`id`函数, Python 创建一个 SE 类的实例，并使用`id`函数获得内存地址后，销毁内存丢弃这个对象。
-
-当连续两次进行此操作, Python会将相同的内存地址分配给第二个对象，所以两个对象的id值是相同的.
-
-
-但是is行为却与之不同，通过打印顺序就可以看到。
-
-#### 195 充分认识for
-```python
-In [65]: for i in range(5):
-    ...:   print(i)
-    ...:   i = 10
-0
-1
-2
-3
-4
-```
-为什么不是执行一次就退出？
-
-按照for在Python中的工作方式, i = 10 并不会影响循环。range(5)生成的下一个元素就被解包，并赋值给目标列表的变量`i`.
-
-#### 196 认识执行时机
-
-```python
-array = [1, 3, 5]
-g = (x for x in array if array.count(x) > 0)
-```
-`g`为生成器，list(g)后返回`[1,3,5]`，因为每个元素肯定至少都出现一次。所以这个结果这不足为奇。但是，请看下例：
-```python
-array = [1, 3, 5]
-g = (x for x in array if array.count(x) > 0)
-array = [5, 7, 9]
-```
-请问,list(g)等于多少？这不是和上面那个例子结果一样吗，结果也是`[1,3,5]`，但是：
-```python
-In [74]: list(g)
-Out[74]: [5]
-```
-
-这有些不可思议~~ 原因在于：
-
-生成器表达式中, in 子句在声明时执行, 而条件子句则是在运行时执行。
-
-
-所以代码：
-```python
-array = [1, 3, 5]
-g = (x for x in array if array.count(x) > 0)
-array = [5, 7, 9]
-```
-
-等价于：
-```python
-g = (x for x in [1,3,5] if [5,7,9].count(x) > 0)
-```
-
-#### 197 创建空集合错误
-
-这是Python的一个集合：`{1,3,5}`，它里面没有重复元素，在去重等场景有重要应用。下面这样创建空集合是错误的：
-
-```python
-empty = {} #NO!
-```
-
-cpython会解释它为字典
-
-使用内置函数`set()`创建空集合：
-
-```python
-empty = set() #YES!
-```
-
-#### 198 pyecharts传入Numpy数据绘图失败
-
-echarts使用广泛，echarts+python结合后的包：pyecharts，同样可很好用，但是传入Numpy的数据，像下面这样绘图会失败：
-
-```python
-from pyecharts.charts import Bar
-import pyecharts.options as opts
-import numpy as np
-c = (
-    Bar()
-    .add_xaxis([1, 2, 3, 4, 5])
-    # 传入Numpy数据绘图失败！
-    .add_yaxis("商家A", np.array([0.1, 0.2, 0.3, 0.4, 0.5]))
-)
-
-c.render()
-```
-
-<img src="./img/image-20200129164119080.png" width="50%"/>
-
-由此可见pyecharts对Numpy数据绘图不支持，传入原生Python的list:
-
-```python
-from pyecharts.charts import Bar
-import pyecharts.options as opts
-import numpy as np
-c = (
-    Bar()
-    .add_xaxis([1, 2, 3, 4, 5])
-    # 传入Python原生list
-    .add_yaxis("商家A", np.array([0.1, 0.2, 0.3, 0.4, 0.5]).tolist())
-)
-
-c.render()
-```
-
-<img src="./img/image-20200129164339971.png" width="50%"/>
-
-#### 199 优化代码异常输出包
-
-一行代码优化输出的异常信息
-```python
-pip install pretty-errors
-```
-
-写一个函数测试：
-
-```python
-def divided_zero():
-    for i in range(10, -1, -1):
-        print(10/i)
-
-
-divided_zero()
-```
-
-在没有import这个`pretty-errors`前，输出的错误信息有些冗余：
-
-```python
-Traceback (most recent call last):
-  File "c:\Users\HUAWEI\.vscode\extensions\ms-python.python-2019.11.50794\pythonFiles\ptvsd_launcher.py", line 43, in <module>
-    main(ptvsdArgs)
-  File "c:\Users\HUAWEI\.vscode\extensions\ms-python.python-2019.11.50794\pythonFiles\lib\python\old_ptvsd\ptvsd\__main__.py",
-line 432, in main
-    run()
-  File "c:\Users\HUAWEI\.vscode\extensions\ms-python.python-2019.11.50794\pythonFiles\lib\python\old_ptvsd\ptvsd\__main__.py",
-line 316, in run_file
-    runpy.run_path(target, run_name='__main__')
-  File "D:\anaconda3\lib\runpy.py", line 263, in run_path
-    pkg_name=pkg_name, script_name=fname)
-  File "D:\anaconda3\lib\runpy.py", line 96, in _run_module_code
-    mod_name, mod_spec, pkg_name, script_name)
-  File "D:\anaconda3\lib\runpy.py", line 85, in _run_code
-    exec(code, run_globals)
-  File "d:\source\sorting-visualizer-master\sorting\debug_test.py", line 6, in <module>
-    divided_zero()
-  File "d:\source\sorting-visualizer-master\sorting\debug_test.py", line 3, in divided_zero
-    print(10/i)
-ZeroDivisionError: division by zero
-```
-
-我们使用刚安装的`pretty_errors`，`import`下:
-
-```python
-import pretty_errors
-
-def divided_zero():
-    for i in range(10, -1, -1):
-        print(10/i)
-
-divided_zero()
-```
-
-此时看看输出的错误信息，非常精简只有2行，去那些冗余信息：
-
-```python
-ZeroDivisionError:
-division by zero
-```
-
-完整的输出信息如下图片所示：
-
-<img src="./img/errors.png" width="50%"/>
-
-#### 200 图像处理包pillow
-
-两行代码实现旋转和缩放图像
-
-首先安装pillow:
-
-```python
-pip install pillow
-```
-
-旋转图像下面图像45度：
-
-<img src="./img/plotly2.png" width="40%"/>
-
-```python
-In [1]: from PIL import Image
-In [2]: im = Image.open('./img/plotly2.png')
-In [4]: im.rotate(45).show()
-```
-
-旋转45度后的效果图
-
-<img src="./img/image-20200105085120611.png" width="40%"/>
-
-等比例缩放图像：
-
-```python
-im.thumbnail((128,72),Image.ANTIALIAS)
-```
-
-缩放后的效果图：
-
-![](./img/pillow_suofang.png)
-
-
-
-过滤图像后的效果图：
-
-```python
-from PIL import ImageFilter
-im.filter(ImageFilter.CONTOUR).show()
-```
-
-<img src="./img/pillow_filter.png" width="40%"/>
-
-#### 201 一行代码找到编码
-
-兴高采烈地，从网页上抓取一段 `content`
-
-但是，一 `print ` 就不那么兴高采烈了，结果看到一串这个：
-
-```markdown
-b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python'
-```
-
-这是啥？ 又 x 又 c 的！
-
-再一看，哦，原来是十六进制字节串 (`bytes`)，`\x` 表示十六进制
-
-接下来，你一定想转化为人类能看懂的语言，想到 `decode`：
-
-```python
-In [3]: b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python'.decode()
-UnicodeDecodeError                        Traceback (most recent call last)
-<ipython-input-3-7d0ea6148880> in <module>
-UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc8 in position 0: invalid continuation byte
-```
-
-马上，一盆冷水泼头上，抛异常了。。。。。
-
-根据提示，`UnicodeDecodeError`，这是 unicode 解码错误。
-
-原来，`decode` 默认的编码方法：`utf-8` 
-
-所以排除  b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python' 使用 `utf-8` 的编码方式
-
-可是，这不是四选一选择题啊，逐个排除不正确的！
-
-编码方式几十种，不可能逐个排除吧。
-
-那就猜吧！！！！！！！！！！！！！
-
-**人生苦短，我用Python**
-
-**Python， 怎忍心让你受累呢~**
-
-尽量三行代码解决问题
-
-**第一步，安装 chardet**  它是 char detect 的缩写。
-
-**第二步，pip install chardet**
-
-**第三步，出结果**
-
-```python
-In [6]: chardet.detect(b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python')
-Out[6]: {'encoding': 'GB2312', 'confidence': 0.99, 'language': 'Chinese'}
-```
-
-编码方法：gb2312
-
-解密字节串：
-
-```python
-In [7]: b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python'.decode('gb2312')
-Out[7]: '人生苦短，我用Python'
-```
-
-目前，`chardet` 包支持的检测编码几十种。
-
-### 八、Python 实战
-
-
-#### 219 环境搭建
-
-区分几个小白容易混淆的概念：pycharm，python解释器，conda安装，pip安装，总结来说：
-
-- `pycharm`是python开发的集成开发环境(Integrated Development Environment，简称IDE)，它本身无法执行Python代码
-- `python解释器`才是真正执行代码的工具，pycharm里可设置Python解释器，一般去python官网下载python3.7或python3.8版本；如果安装过`anaconda`，它里面必然也包括一个某版本的Python解释器；pycharm配置python解释器选择哪一个都可以。
-- anaconda是python常用包的合集，并提供给我们使用`conda`命令非常方便的安装各种Python包。
-- `conda安装`：我们安装过anaconda软件后，就能够使用conda命令下载anaconda源里(比如中科大镜像源)的包
-- `pip安装`：类似于conda安装的python安装包的方法，更加全面
-
-**修改镜像源**
-
-在使用安装`conda` 安装某些包会出现慢或安装失败问题，最有效方法是修改镜像源为国内镜像源。之前都选用清华镜像源，但是2019年后已停止服务。推荐选用中科大镜像源。
-
-先查看已经安装过的镜像源，cmd窗口执行命令：
-
-```python
-conda config --show
-```
-
-查看配置项`channels`，如果显示带有`tsinghua`，则说明已安装过清华镜像。
-
-```python
-channels:
-- https://mirrors.tuna.tsinghua.edu.cn/tensorflow/linux/cpu/
-- https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/msys2/
-- https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
-- https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
-- https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
-```
-
-下一步，使用`conda config --remove channels url地址 `删除清华镜像，如下命令删除第一个。然后，依次删除所有镜像源
-
-```python
-conda config --remove channels https://mirrors.tuna.tsinghua.edu.cn/tensorflow/linux/cpu/
-```
-
-添加目前可用的中科大镜像源：
-
-```
-conda config --add channels https://mirrors.ustc.edu.cn/anaconda/pkgs/free/
-```
-
-并设置搜索时显示通道地址：
-
-```python
-conda config --set show_channel_urls yes
-```
-
-确认是否安装镜像源成功，执行`conda config --show`，找到`channels`值为如下：
-
-```
-channels:
-  - https://mirrors.ustc.edu.cn/anaconda/pkgs/free/
-  - defaults
-```
-
-Done~
-
-#### 220 pytorch慢到无法安装，怎么办？
-
-**1 安装慢到装不上**
-
-最近几天，后台几个小伙伴问我，无论pip还是conda安装`pytorch`都太慢了，都是安装官方文档去做的，就是超时装不上，无法开展下一步，卡脖子的感觉太不好受。
-
-这些小伙伴按照pytorch官档提示，选择好后，完整复制上面命令`conda install pytorch torchvision cudatoolkit=10.1 -c pytorch`到cmd中，系统是windows.
-
-<img  src="./img/image-20200128182456390.png" width="50%"/>
-
-接下来提示，conda需要安装的包，他们操作选择`y`，继续安装，但是在安装时，发现进度条几乎一动不动。
-
-反复尝试，就是这样，有些无奈，还感叹怎么深度学习的路一开始就TMD的这么难！
-
-**2 这样能正常安装**
-
-无论是安装`cpu`版还是`cuda`版，网上关于这些的参考资料太多了，无非就是cuda硬件和cuda开发包的版本要对应，python版本要对应等，这些bee君觉得都不是事。
-
-就像几位读者朋友遇到的问题，关键还是如何解决`慢到无法装`的问题。
-
-最有效方法是添加镜像源，常见的清华或中科大。
-
-先查看是否已经安装相关镜像源，windows系统在`cmd`窗口中执行命令：
-
-```python
-conda config --show
-```
-
-bee君这里显示：
-```python
-channels:
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/menpo/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/bioconda/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/msys2/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/
-  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
-```
-说明已经安装好清华的镜像源。如果没有安装，请参考下面命令安装源：
-```python
-conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
-```
-依次安装上面所有的源。
-
-并设置搜索时显示通道地址，执行下面命令：
-
-```python
-conda config --set show_channel_urls yes
-```
-
-**3 最关键一步**
-
-有的读者问我，他们已经都安装好镜像源，但是为什么安装还是龟速？问他们，是用哪个命令，他们回复：`conda install pytorch torchvision cudatoolkit=10.1 -c pytorch`
-
-好吧，执行上面命令，因为命令最后是`-c pytorch`，所以默认还是从conda源下载，新安装的清华等源没有用上。
-
-正确命令：`conda install pytorch torchvision cudatoolkit=10.1`，也就是去掉`-c pytorch`
-
-并且在安装时，也能看到使用了清华源。并且安装速度直线提升，顺利done
-
-**4 测试是否安装成功**
-
-结合官档，执行下面代码，`torch.cuda.is_available()`返回`True`，说明安装cuda成功。
-
-```python
-In [1]: import torch
-
-In [2]: torch.cuda
-Out[2]: <module 'torch.cuda' from 'D:\\Programs\\anaconda\\lib\\site-packages\\torch\\cuda\\__init__.py'>
-
-In [3]: torch.cuda.is_available()
-Out[3]: True
-
-In [4]: from __future__ import print_function
-
-In [5]: x = torch.rand(5,3)
-
-In [6]: print(x)
-tensor([[0.0604, 0.1135, 0.2656],
-        [0.5353, 0.9246, 0.3004],
-        [0.4872, 0.9592, 0.2215],
-        [0.2598, 0.5031, 0.6093],
-        [0.2986, 0.1599, 0.5862]])
-```
-
-这篇文章主要讨论安装`pytorch`慢到不能装的问题及方案，希望对读者朋友们有帮助。
-
-#### 221 自动群发邮件
-
-Python自动群发邮件
-
-```python
-import smtplib
-from email import (header)
-from email.mime import (text, application, multipart)
-import time
-
-def sender_mail():
-    smt_p = smtplib.SMTP()
-    smt_p.connect(host='smtp.qq.com', port=25)
-    sender, password = '113097485@qq.com', "**************"
-    smt_p.login(sender, password)
-    receiver_addresses, count_num = [
-        'guozhennianhua@163.com', 'xiaoxiazi99@163.com'], 1
-    for email_address in receiver_addresses:
-        try:
-            msg = multipart.MIMEMultipart()
-            msg['From'] = "zhenguo"
-            msg['To'] = email_address
-            msg['subject'] = header.Header('这是邮件主题通知', 'utf-8')
-            msg.attach(text.MIMEText(
-                '这是一封测试邮件，请勿回复本邮件~', 'plain', 'utf-8'))
-            smt_p.sendmail(sender, email_address, msg.as_string())
-            time.sleep(10)
-            print('第%d次发送给%s' % (count_num, email_address))
-            count_num = count_num + 1
-        except Exception as e:
-            print('第%d次给%s发送邮件异常' % (count_num, email_address))
-            continue
-    smt_p.quit()
-
-sender_mail()
-```
-
-
-
-注意：
-发送邮箱是qq邮箱，所以要在qq邮箱中设置开启SMTP服务，设置完成时会生成一个授权码，将这个授权码赋值给文中的`password`变量
-
-#### 222 二分搜索
-
-二分搜索是程序员必备的算法，无论什么场合，都要非常熟练地写出来。
-
-小例子描述：
-在**有序数组**`arr`中，指定区间`[left,right]`范围内，查找元素`x`
-如果不存在，返回`-1`
-
-二分搜索`binarySearch`实现的主逻辑
-
-```python
-def binarySearch(arr, left, right, x):
-    while left <= right:
-
-        mid = int(left + (right - left) / 2); # 找到中间位置。求中点写成(left+right)/2更容易溢出，所以不建议这样写
-
-        # 检查x是否出现在位置mid
-        if arr[mid] == x:
-            print('found %d 在索引位置%d 处' %(x,mid))
-            return mid
-
-            # 假如x更大，则不可能出现在左半部分
-        elif arr[mid] < x:
-            left = mid + 1 #搜索区间变为[mid+1,right]
-            print('区间缩小为[%d,%d]' %(mid+1,right))
-
-        # 同理，假如x更小，则不可能出现在右半部分
-        elif x<arr[mid]:
-            right = mid - 1 #搜索区间变为[left,mid-1]
-            print('区间缩小为[%d,%d]' %(left,mid-1))
-
-    # 假如搜索到这里，表明x未出现在[left,right]中
-    return -1
-```
-
-在`Ipython`交互界面中，调用`binarySearch`的小Demo:
-
-```python
-In [8]: binarySearch([4,5,6,7,10,20,100],0,6,5)
-区间缩小为[0,2]
-found 5 at 1
-Out[8]: 1
-
-In [9]: binarySearch([4,5,6,7,10,20,100],0,6,4)
-区间缩小为[0,2]
-区间缩小为[0,0]
-found 4 at 0
-Out[9]: 0
-
-In [10]: binarySearch([4,5,6,7,10,20,100],0,6,20)
-区间缩小为[4,6]
-found 20 at 5
-Out[10]: 5
-
-In [11]: binarySearch([4,5,6,7,10,20,100],0,6,100)
-区间缩小为[4,6]
-区间缩小为[6,6]
-found 100 at 6
-Out[11]: 6
-```
-
-#### 223 爬取天气数据并解析温度值
-
-爬取天气数据并解析温度值
-
-素材来自朋友袁绍，感谢！
-
-爬取的html 结构
-
-<img src="./img/1.png" width="50%"/>
-
-```python
-import requests
-from lxml import etree
-import pandas as pd
-import re
-
-url = 'http://www.weather.com.cn/weather1d/101010100.shtml#input'
-with requests.get(url) as res:
-    content = res.content
-    html = etree.HTML(content)
-```
-
-
-
-通过lxml模块提取值
-
-lxml比beautifulsoup解析在某些场合更高效
-
-```python
-location = html.xpath('//*[@id="around"]//a[@target="_blank"]/span/text()')
-temperature = html.xpath('//*[@id="around"]/div/ul/li/a/i/text()')
-```
-
-结果：
-
-```python
-['香河', '涿州', '唐山', '沧州', '天津', '廊坊', '太原', '石家庄', '涿鹿', '张家口', '保定', '三河', '北京孔庙', '北京国子监', '中国地质博物馆', '月坛公
-园', '明城墙遗址公园', '北京市规划展览馆', '什刹海', '南锣鼓巷', '天坛公园', '北海公园', '景山公园', '北京海洋馆']
-
-['11/-5°C', '14/-5°C', '12/-6°C', '12/-5°C', '11/-1°C', '11/-5°C', '8/-7°C', '13/-2°C', '8/-6°C', '5/-9°C', '14/-6°C', '11/-4°C', '13/-3°C'
-, '13/-3°C', '12/-3°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-2°C', '12/-2°C', '12/-3°C']
-```
-
-
-构造DataFrame对象
-
-```python
-df = pd.DataFrame({'location':location, 'temperature':temperature})
-print('温度列')
-print(df['temperature'])
-```
-
-正则解析温度值
-
-```python
-df['high'] = df['temperature'].apply(lambda x: int(re.match('(-?[0-9]*?)/-?[0-9]*?°C', x).group(1) ) )
-df['low'] = df['temperature'].apply(lambda x: int(re.match('-?[0-9]*?/(-?[0-9]*?)°C', x).group(1) ) )
-print(df)
-```
-
-详细说明子字符创捕获
-
-除了简单地判断是否匹配之外，正则表达式还有提取子串的强大功能。用`()`表示的就是要提取的分组（group）。比如：`^(\d{3})-(\d{3,8})$`分别定义了两个组，可以直接从匹配的字符串中提取出区号和本地号码
-
-```python
-m = re.match(r'^(\d{3})-(\d{3,8})$', '010-12345')
-print(m.group(0))
-print(m.group(1))
-print(m.group(2))
-
-# 010-12345
-# 010
-# 12345
-```
-
-如果正则表达式中定义了组，就可以在`Match`对象上用`group()`方法提取出子串来。
-
-注意到`group(0)`永远是原始字符串，`group(1)`、`group(2)`……表示第1、2、……个子串。
-
-
-最终结果
-
-```kepython
-Name: temperature, dtype: object
-    location temperature  high  low
-0         香河     11/-5°C    11   -5
-1         涿州     14/-5°C    14   -5
-2         唐山     12/-6°C    12   -6
-3         沧州     12/-5°C    12   -5
-4         天津     11/-1°C    11   -1
-5         廊坊     11/-5°C    11   -5
-6         太原      8/-7°C     8   -7
-7        石家庄     13/-2°C    13   -2
-8         涿鹿      8/-6°C     8   -6
-9        张家口      5/-9°C     5   -9
-10        保定     14/-6°C    14   -6
-11        三河     11/-4°C    11   -4
-12      北京孔庙     13/-3°C    13   -3
-13     北京国子监     13/-3°C    13   -3
-14   中国地质博物馆     12/-3°C    12   -3
-15      月坛公园     12/-3°C    12   -3
-16   明城墙遗址公园     13/-3°C    13   -3
-17  北京市规划展览馆     12/-2°C    12   -2
-18       什刹海     12/-3°C    12   -3
-19      南锣鼓巷     13/-3°C    13   -3
-20      天坛公园     12/-2°C    12   -2
-21      北海公园     12/-2°C    12   -2
-22      景山公园     12/-2°C    12   -2
-23     北京海洋馆     12/-3°C    12   -3
-```
-
-### 十、数据分析
-
-本项目基于Kaggle电影影评数据集，通过这个系列，你将学到如何进行数据探索性分析(EDA)，学会使用数据分析利器`pandas`，会用绘图包`pyecharts`，以及EDA时可能遇到的各种实际问题及一些处理技巧。
-
-
-
-本项目需要导入的包：
-
-```python
-import pandas as pd
-import numpy as np
-import matplotlib.pyplot as plt
-from pyecharts.charts import Bar,Grid,Line
-import pyecharts.options as opts
-from pyecharts.globals import ThemeType
-```
-
-#### 1 创建DataFrame
-pandas中一个dataFrame实例：
-```python
-Out[89]:
-        a  val
-0  apple1  1.0
-1  apple2  2.0
-2  apple3  3.0
-3  apple4  4.0
-4  apple5  5.0
-```
-
-我们的**目标**是变为如下结构：
-```python
-a  apple1  apple2  apple3  apple4  apple5
-0     1.0     2.0     3.0     4.0     5.0
-```
-
-乍看可使用`pivot`，但很难一步到位。
-
-所以另辟蹊径，提供一种简单且好理解的方法：
-
-```python
-In [113]: pd.DataFrame(index=[0],columns=df.a,data=dict(zip(df.a,df.val)))
-Out[113]:
-a  apple1  apple2  apple3  apple4  apple5
-0     1.0     2.0     3.0     4.0     5.0
-```
-以上方法是重新创建一个DataFrame,直接把`df.a`所有可能取值作为新dataframe的列，index调整为`[0]`，注意类型必须是数组类型(array-like 或者 Index)，两个轴确定后，`data`填充数据域。
-
-```python
-In [116]: dict(zip(df.a,df.val))
-Out[116]: {'apple1': 1.0, 'apple2': 2.0, 'apple3': 3.0, 'apple4': 4.0, 'apple5': 5.0}
-```
-
-
-
-#### 2 导入数据
-数据来自kaggle，共包括三个文件：
-
-1. movies.dat
-2. ratings.dat
-3. users.dat
-
-`movies.dat`包括三个字段：['Movie ID', 'Movie Title', 'Genre']
-
-使用pandas导入此文件：
-
-```python
-import pandas as pd
-
-movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
-```
-
-导入后，显示前5行：
-
-```python
-   Movie ID                                        Movie Title  \
-0         8      Edison Kinetoscopic Record of a Sneeze (1894)   
-1        10                La sortie des usines Lumi猫re (1895)   
-2        12                      The Arrival of a Train (1896)   
-3        25  The Oxford and Cambridge University Boat Race ...   
-4        91                         Le manoir du diable (1896)   
-5       131                           Une nuit terrible (1896)   
-6       417                      Le voyage dans la lune (1902)   
-7       439                     The Great Train Robbery (1903)   
-8       443        Hiawatha, the Messiah of the Ojibway (1903)   
-9       628                    The Adventures of Dollie (1908)  
-                                          Genre  
-0                             Documentary|Short  
-1                             Documentary|Short  
-2                             Documentary|Short  
-3                                           NaN  
-4                                  Short|Horror  
-5                           Short|Comedy|Horror  
-6  Short|Action|Adventure|Comedy|Fantasy|Sci-Fi  
-7                    Short|Action|Crime|Western  
-8                                           NaN  
-9                                  Action|Short  
-```
-
-
-
-次导入其他两个数据文件
-
-`users.dat`:
-
-```python
-users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
-print(users.head())
-```
-
-结果：
-
-```python
-   User ID  Twitter ID
-0        1   397291295
-1        2    40501255
-2        3   417333257
-3        4   138805259
-4        5  2452094989
-5        6   391774225
-6        7    47317010
-7        8    84541461
-8        9  2445803544
-9       10   995885060
-```
-
-
-
-`rating.data`:
-
-```python
-ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
-print(ratings.head())
-```
-
-结果：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp
-0        1    111161      10        1373234211
-1        1    117060       7        1373415231
-2        1    120755       6        1373424360
-3        1    317919       6        1373495763
-4        1    454876      10        1373621125
-5        1    790724       8        1374641320
-6        1    882977       8        1372898763
-7        1   1229238       9        1373506523
-8        1   1288558       5        1373154354
-9        1   1300854       8        1377165712
-```
-
- **read_csv 使用说明**
-
-说明，本次导入`dat`文件使用`pandas.read_csv`函数。
-
-第一个位置参数`./data/movietweetings/ratings.dat` 表示文件的相对路径
-
-第二个关键字参数：`delimiter='::'`，表示文件分隔符使用`::`
-
-后面几个关键字参数分别代表使用的引擎，文件没有表头，所以`header`为`None;`
-
-导入后dataframe的列名使用`names`关键字设置，这个参数大家可以记住，比较有用。
-
-
-
-Kaggle电影数据集第一节，我们使用数据处理利器 `pandas`， 函数`read_csv` 导入给定的三个数据文件。
-
-```python
-import pandas as pd
-
-movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
-users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
-ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
-```
-
-用到的`read_csv`，某些重要的参数，如何使用在上一节也有所提到。下面开始数据探索分析(EDA)
-
-> 找出得分前10喜剧(comedy)
-
-
-
-#### 3 处理组合值
-
-表`movies`字段`Genre`表示电影的类型，可能有多个值，分隔符为`|`，取值也可能为`None`.
-
-针对这类字段取值，可使用Pandas中Series提供的`str`做一步转化，**注意它是向量级的**，下一步，如Python原生的`str`类似，使用`contains`判断是否含有`comedy`字符串：
-
-```python
-mask = movies.Genre.str.contains('comedy',case=False,na=False)
-```
-
-注意使用的两个参数：`case`, `na`
-
-case为 False，表示对大小写不敏感；
-na Genre列某个单元格为`NaN`时，我们使用的充填值，此处填充为`False`
-
-返回的`mask`是一维的`Series`，结构与 movies.Genre相同，取值为True 或 False.
-
-观察结果：
-
-```python
-0    False
-1    False
-2    False
-3    False
-4    False
-5     True
-6     True
-7    False
-8    False
-9    False
-Name: Genre, dtype: bool
-
-```
-
-
- #### 4 访问某列
-
-得到掩码mask后，pandas非常方便地能提取出目标记录：
-
-```python
-comedy = movies[mask]
-comdey_ids = comedy['Movie ID']
-
-```
-
-以上，在pandas中被最频率使用，不再解释。看结果`comedy_ids.head()`：
-
-```python
-5      131
-6      417
-15    2354
-18    3863
-19    4099
-20    4100
-21    4101
-22    4210
-23    4395
-25    4518
-Name: Movie ID, dtype: int64
-
-```
-
-
-
-1-4介绍`数据读入`，`处理组合值`，`索引数据`等, pandas中使用较多的函数，基于Kaggle真实电影影评数据集，最后得到所有`喜剧 ID`：
-
-```python
-5      131
-6      417
-15    2354
-18    3863
-19    4099
-20    4100
-21    4101
-22    4210
-23    4395
-25    4518
-Name: Movie ID, dtype: int64
-
-```
-
-下面继续数据探索之旅~
-
-#### 5 连接两个表
-
-拿到所有喜剧的ID后，要想找出其中平均得分最高的前10喜剧，需要关联另一张表：`ratings`:
-
-再回顾下ratings表结构：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp
-0        1    111161      10        1373234211
-1        1    117060       7        1373415231
-2        1    120755       6        1373424360
-3        1    317919       6        1373495763
-4        1    454876      10        1373621125
-5        1    790724       8        1374641320
-6        1    882977       8        1372898763
-7        1   1229238       9        1373506523
-8        1   1288558       5        1373154354
-9        1   1300854       8        1377165712
-
-```
-
-
-pandas 中使用`join`关联两张表，连接字段是`Movie ID`，如果顺其自然这么使用`join`：
-
-```python
-combine = ratings.join(comedy, on='Movie ID', rsuffix='2')
-
-```
-
-左右滑动，查看完整代码
-
-大家可验证这种写法，仔细一看，会发现结果非常诡异。
-
-究其原因，这是pandas join函数使用的一个算是坑点，它在官档中介绍，连接右表时，此处右表是`comedy`，它的`index`要求是连接字段，也就是 `Movie ID`. 
-
-左表的index不要求，但是要在参数 `on`中给定。
-
-**以上是要注意的一点**
-
-修改为：
-
-```python
-combine = ratings.join(comedy.set_index('Movie ID'), on='Movie ID')
-print(combine.head(10))
-
-```
-
-以上是OK的写法
-
-观察结果：
-
-```python
-   User ID  Movie ID  Rating  Rating Timestamp Movie Title Genre
-0        1    111161      10        1373234211         NaN   NaN
-1        1    117060       7        1373415231         NaN   NaN
-2        1    120755       6        1373424360         NaN   NaN
-3        1    317919       6        1373495763         NaN   NaN
-4        1    454876      10        1373621125         NaN   NaN
-5        1    790724       8        1374641320         NaN   NaN
-6        1    882977       8        1372898763         NaN   NaN
-7        1   1229238       9        1373506523         NaN   NaN
-8        1   1288558       5        1373154354         NaN   NaN
-9        1   1300854       8        1377165712         NaN   NaN
-
-```
-
-Genre列为`NaN`表明，这不是喜剧。需要筛选出此列不为`NaN` 的记录。
-
-#### 6 按列筛选
-
-pandas最方便的地方，就是向量化运算，尽可能减少了for循环的嵌套。
-
-按列筛选这种常见需求，自然可以轻松应对。
-
-为了照顾初次接触 pandas 的朋友，分两步去写：
-
-```python
-mask = pd.notnull(combine['Genre'])
-
-```
-
-结果是一列只含`True 或 False`的值
-
-```python
-result = combine[mask]
-print(result.head())
-
-```
-
-结果中，Genre字段中至少含有一个Comedy字符串，表明验证了我们以上操作是OK的。
-
-```python
-    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
-12        1   1588173       9        1372821281      Warm Bodies (2013)   
-13        1   1711425       3        1372604878        21 & Over (2013)   
-14        1   2024432       8        1372703553   Identity Thief (2013)   
-17        1   2101441       1        1372633473  Spring Breakers (2012)   
-28        2   1431045       7        1457733508         Deadpool (2016)   
-
-                             Genre  
-12           Comedy|Horror|Romance  
-13                          Comedy  
-14    Adventure|Comedy|Crime|Drama  
-17              Comedy|Crime|Drama  
-28  Action|Adventure|Comedy|Sci-Fi  
-
-
-```
-
-
-
-截止目前已经求出所有喜剧电影`result`，前5行如下，Genre中都含有`Comedy`字符串：
-```python
-    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
-12        1   1588173       9        1372821281      Warm Bodies (2013)   
-13        1   1711425       3        1372604878        21 & Over (2013)   
-14        1   2024432       8        1372703553   Identity Thief (2013)   
-17        1   2101441       1        1372633473  Spring Breakers (2012)   
-28        2   1431045       7        1457733508         Deadpool (2016)   
-
-                             Genre  
-12           Comedy|Horror|Romance  
-13                          Comedy  
-14    Adventure|Comedy|Crime|Drama  
-17              Comedy|Crime|Drama  
-28  Action|Adventure|Comedy|Sci-Fi  
-```
-
-
-
-#### 7 按照Movie ID 分组
-
-result中会有很多观众对同一部电影的打分，所以要求得分前10的喜剧，先按照`Movie ID`分组，然后求出平均值：
-```python
-score_as_movie = result.groupby('Movie ID').mean()
-```
-
-前5行显示如下：
-```python
-               User ID  Rating  Rating Timestamp
-Movie ID                                        
-131       34861.000000     7.0      1.540639e+09
-417       34121.409091     8.5      1.458680e+09
-2354       6264.000000     8.0      1.456343e+09
-3863      43803.000000    10.0      1.430439e+09
-4099      25084.500000     7.0      1.450323e+09
-```
-
-#### 8 按照电影得分排序
-
-```python
-score_as_movie.sort_values(by='Rating', ascending = False,inplace=True)
-score_as_movie
-```
-前5行显示如下：
-```python
-	User ID	Rating	Rating Timestamp
-Movie ID			
-7134690	30110.0	10.0	1.524974e+09
-416889	1319.0	10.0	1.543320e+09
-57840	23589.0	10.0	1.396802e+09
-5693562	50266.0	10.0	1.511024e+09
-5074	43803.0	10.0	1.428352e+09
-```
-都是满分？这有点奇怪，会不会这些电影都只有几个人评分，甚至只有1个？评分样本个数太少，显然最终的平均分数不具有太强的说服力。
-
-所以，下面要进行每部电影的评分人数统计
-
-#### 9 分组后使用聚合函数
-
-根据`Movie ID`分组后，使用`count`函数统计`每组个数`，只保留count列，最后得到`watchs2`:
-
-```python
-watchs = result.groupby('Movie ID').agg(['count'])
-watchs2 = watchs['Rating']['count']
-```
-打印前20行：
-```python
-print(watchs2.head(20))
-```
-结果：
-```python
-Movie ID
-131      1
-417     22
-2354     1
-3863     1
-4099     2
-4100     1
-4101     1
-4210     1
-4395     1
-4518     1
-4546     2
-4936     2
-5074     1
-5571     1
-6177     1
-6414     3
-6684     1
-6689     1
-7145     1
-7162     2
-Name: count, dtype: int64
-```
-果然，竟然有这么多电影的评论数只有1次！样本个数太少，评论的平均值也就没有什么说服力。
-
-查看`watchs2`一些重要统计量：
-```python
-watchs2.describe()
-```
-结果：
-```python
-count    10740.000000
-mean        20.192086
-std         86.251411
-min          1.000000
-25%          1.000000
-50%          2.000000
-75%          7.000000
-max       1843.000000
-Name: count, dtype: float64
-```
-共有10740部**喜剧**电影被评分，平均打分次数20次，标准差86，75%的电影样本打分次数小于7次，最小1次，最多1843次。
-
-#### 10 频率分布直方图
-
-绘制评论数的频率分布直方图，便于更直观的观察电影被评论的分布情况。上面分析到，75%的电影打分次数小于7次，所以绘制打分次数小于20次的直方图：
-
-```python
-fig = plt.figure(figsize=(12,8))
-histn = plt.hist(watchs2[watchs2 <=19],19,histtype='step')
-plt.scatter([i+1 for i in range(len(histn[0]))],histn[0])
-```
-
-![](./img/20200131094927.jpg)
-
-`histn`元祖表示个数和对应的被分割的区间，查看`histn[0]`:
-```python
-array([4383., 1507.,  787.,  541.,  356.,  279.,  209.,  163.,  158.,
-        118.,  114.,   90.,  104.,   81.,   80.,   73.,   62.,   65.,
-         52.])
-```
-```python
-sum(histn[0]) # 9222
-```
-看到电影评论次数1到19次的喜剧电影9222部，共有10740部喜剧电影，大约`86%`的喜剧电影评论次数`小于20次`，有`1518`部电影评论数不小于20次。
-
-我们肯定希望挑选出被评论次数尽可能多的电影，因为难免会有水军和滥竽充数等`异常评论`行为。那么，如何准确的量化最小抽样量呢？
-
-
-
-#### 11 最小抽样量
-
-根据统计学的知识，最小抽样量和Z值、样本方差和样本误差相关，下面给出具体的求解最小样本量的计算方法。
-
-采用如下计算公式：
-
-$$ n = \frac{Z^2\sigma^2}{E^2} $$
-
-
-此处，$Z$ 值取为95%的置信度对应的Z值也就是1.96，样本误差取为均值的2.5%.
-
-根据以上公式，编写下面代码：
-
-```python
-n3 = result.groupby('Movie ID').agg(['count','mean','std'])
-n3r = n3[n3['Rating']['count']>=20]['Rating']
-```
-只计算影评超过20次，且满足最小样本量的电影。计算得到的`n3r`前5行：
-```python
-	count	mean	std
-Movie ID			
-417	22	8.500000	1.263027
-12349	68	8.485294	1.227698
-15324	20	8.350000	1.039990
-15864	51	8.431373	1.374844
-17925	44	8.636364	1.259216
-```
-进一步求出最小样本量：
-```python
-nmin = (1.96**2*n3r['std']**2) / ( (n3r['mean']*0.025)**2 )
-```
-`nmin`前5行：
-```python
-Movie ID
-417         135.712480
-12349       128.671290
-15324        95.349276
-15864       163.434005
-17925       130.668350
-```
-
-筛选出满足最小抽样量的喜剧电影：
-
-```python
-n3s = n3r[ n3r['count'] >= nmin ]
-```
-结果显示如下，因此共有`173`部电影满足最小样本抽样量。
-
-```python
-
-count	mean	std
-Movie ID			
-53604	129	8.635659	1.230714
-57012	207	8.449275	1.537899
-70735	224	8.839286	1.190799
-75686	209	8.095694	1.358885
-88763	296	8.945946	1.026984
-...	...	...	...
-6320628	860	7.966279	1.469924
-6412452	276	7.510870	1.389529
-6662050	22	10.000000	0.000000
-6966692	907	8.673649	1.286455
-7131622	1102	7.851180	1.751500
-173 rows × 3 columns
-```
-
-#### 12 去重和连表
-
-按照平均得分从大到小排序：
-```python
-n3s_sort = n3s.sort_values(by='mean',ascending=False)
-```
-结果：
-```python
-	count	mean	std
-Movie ID			
-6662050	22	10.000000	0.000000
-4921860	48	10.000000	0.000000
-5262972	28	10.000000	0.000000
-5512872	353	9.985836	0.266123
-3863552	199	9.010050	1.163372
-...	...	...	...
-1291150	647	6.327666	1.785968
-2557490	546	6.307692	1.858434
-1478839	120	6.200000	0.728761
-2177771	485	6.150515	1.523922
-1951261	1091	6.083410	1.736127
-173 rows × 3 columns
-```
-仅靠`Movie ID`还是不知道哪些电影，连接`movies`表：
-```python
-ms = movies.drop_duplicates(subset=['Movie ID'])
-ms = ms.set_index('Movie ID')
-n3s_final = n3s_drops.join(ms,on='Movie ID')
-```
-
-#### 13 结果分析
-
-喜剧榜单前50名：
-```python
-Movie Title
-Five Minutes (2017)
-MSG 2 the Messenger (2015)
-Avengers: Age of Ultron Parody (2015)
-Be Somebody (2016)
-Bajrangi Bhaijaan (2015)
-Back to the Future (1985)
-La vita 鐚?bella (1997)
-The Intouchables (2011)
-The Sting (1973)
-Coco (2017)
-Toy Story 3 (2010)
-3 Idiots (2009)
-Green Book (2018)
-Dead Poets Society (1989)
-The Apartment (1960)
-P.K. (2014)
-The Truman Show (1998)
-Am鑼卨ie (2001)
-Inside Out (2015)
-Toy Story 4 (2019)
-Toy Story (1995)
-Finding Nemo (2003)
-Dr. Strangelove or: How I Learned to Stop Worrying and Love the Bomb (1964)
-Home Alone (1990)
-Zootopia (2016)
-Up (2009)
-Monsters, Inc. (2001)
-La La Land (2016)
-Relatos salvajes (2014)
-En man som heter Ove (2015)
-Snatch (2000)
-Lock, Stock and Two Smoking Barrels (1998)
-How to Train Your Dragon 2 (2014)
-As Good as It Gets (1997)
-Guardians of the Galaxy (2014)
-The Grand Budapest Hotel (2014)
-Fantastic Mr. Fox (2009)
-Silver Linings Playbook (2012)
-Sing Street (2016)
-Deadpool (2016)
-Annie Hall (1977)
-Pride (2014)
-In Bruges (2008)
-Big Hero 6 (2014)
-Groundhog Day (1993)
-The Breakfast Club (1985)
-Little Miss Sunshine (2006)
-Deadpool 2 (2018)
-The Terminal (2004)
-```
-
-前10名评论数图：
-
-![](./img/2020013109495711.jpg)
-
-代码：
-```python
-x = n3s_final['Movie Title'][:10].tolist()[::-1]
-y = n3s_final['count'][:10].tolist()[::-1]
-bar = (
-    Bar()
-    .add_xaxis(x)
-    .add_yaxis('评论数',y,category_gap='50%')
-    .reversal_axis()
-    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影被评论次数"),
-                    toolbox_opts=opts.ToolboxOpts(),)
-)
-grid = (
-    Grid(init_opts=opts.InitOpts(theme=ThemeType.LIGHT))
-    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
-)
-grid.render_notebook()
-```
-
-前10名得分图：
-
-![](./img/2020013109500812.jpg)
-
-代码：
-```python
-x = n3s_final['Movie Title'][:10].tolist()[::-1]
-y = n3s_final['mean'][:10].round(3).tolist()[::-1]
-bar = (
-    Bar()
-    .add_xaxis(x)
-    .add_yaxis('平均得分',y,category_gap='50%')
-    .reversal_axis()
-    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影平均得分"),
-                    xaxis_opts=opts.AxisOpts(min_=8.0,name='平均得分'),
-                    toolbox_opts=opts.ToolboxOpts(),)
-)
-grid = (
-    Grid(init_opts=opts.InitOpts(theme=ThemeType.MACARONS))
-    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
-)
-grid.render_notebook()
-```
-
-
-
-#### 14 生成哑变量
-
-分类变量的数值化，是指将枚举类变量转化为indicator变量或称dummy变量。
-
-那么什么是`indicator变量`，看看如下例子，A变量解析为：`[1,0,0]`, B解析为：`[0,1,0]`, C解析为：`[0,0,1]`
-```python
-In [8]: s = pd.Series(list('ABCA'))
-In [9]: pd.get_dummies(s)
-Out[9]:
-   A  B  C
-0  1  0  0
-1  0  1  0
-2  0  0  1
-3  1  0  0
-```
-
-如果输入的字符有4个唯一值，看到字符a被解析为[1,0,0,0]，向量长度为4.
-
-```python
-In [5]: s = pd.Series(list('abaccd'))
-In [6]: pd.get_dummies(s)
-Out[6]:
-   a  b  c  d
-0  1  0  0  0
-1  0  1  0  0
-2  1  0  0  0
-3  0  0  1  0
-4  0  0  1  0
-5  0  0  0  1
-```
-
-也就是说dummy向量的长度等于输入字符串中，唯一字符的个数。
-
-#### 15 讨厌的SettingWithCopyWarning！！！
-
-Pandas 处理数据，太好用了，谁用谁知道！
-
-使用过 Pandas 的，几乎都会遇到一个警告：
-
-*SettingWithCopyWarning*
-
-非常烦人！
-
-尤其是刚接触 Pandas 的，完全不理解为什么弹出这么一串：
-
-```python
-d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
-A value is trying to be set on a copy of a slice from a DataFrame.
-Try using .loc[row_indexer,col_indexer] = value instead
-
-See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy 
-```
-
-归根结底，是因为代码中出现`链式操作`...
-
-有人就问了，什么是`链式操作`?
-
-这样的：
-
-```python
-tmp = df[df.a<4]
-tmp['c'] = 200
-```
-
-先记住这个最典型的情况，即可！
-
-有的人就问了：出现这个 Warning, 需要理会它吗？ 
-
-如果结果不对，当然要理会；如果结果对，不care.
-
-举个例子~~
-
-```python
-import pandas as  pd
-
-df = pd.DataFrame({'a':[1,3,5],'b':[4,2,7]},index=['a','b','c'])
-df.loc[df.a<4,'c'] = 100
-print(df)
-print('it\'s ok')
-
-tmp = df[df.a<4]
-tmp['c'] = 200
-print('-----tmp------')
-print(tmp)
-print('-----df-------')
-print(df)
-```
-
-输出结果：
-```python
-   a  b      c
-a  1  4  100.0
-b  3  2  100.0
-c  5  7    NaN
-it's ok
-d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
-A value is trying to be set on a copy of a slice from a DataFrame.
-Try using .loc[row_indexer,col_indexer] = value instead
-
-See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy     
-  tmp['c'] = 200
------tmp------
-   a  b    c
-a  1  4  200
-b  3  2  200
------df-------
-   a  b      c
-a  1  4  100.0
-b  3  2  100.0
-c  5  7    NaN
-```
-
-it's ok 行后面的发生链式赋值，导致结果错误。因为 tmp 变了，df 没赋上值啊，所以必须理会。
-
-it's ok 行前的是正解。
-
-以上，链式操作尽量避免，如何避免？多使用 `.loc[row_indexer,col_indexer]`，提示告诉我们的~
-
-#### 16 NumPy 数据归一化、分布可视化
-
-仅使用 `NumPy`，下载数据，归一化，使用 `seaborn` 展示数据分布。
-
-**下载数据**
-
-```python
-import numpy as np
-
-url = 'https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data'
-wid = np.genfromtxt(url, delimiter=',', dtype='float', usecols=[1])
-```
-仅提取 `iris` 数据集的第二列 `usecols = [1]`
-
-**展示数据**
-
-```python
-array([3.5, 3. , 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.4, 3. ,
-       3. , 4. , 4.4, 3.9, 3.5, 3.8, 3.8, 3.4, 3.7, 3.6, 3.3, 3.4, 3. ,
-       3.4, 3.5, 3.4, 3.2, 3.1, 3.4, 4.1, 4.2, 3.1, 3.2, 3.5, 3.1, 3. ,
-       3.4, 3.5, 2.3, 3.2, 3.5, 3.8, 3. , 3.8, 3.2, 3.7, 3.3, 3.2, 3.2,
-       3.1, 2.3, 2.8, 2.8, 3.3, 2.4, 2.9, 2.7, 2. , 3. , 2.2, 2.9, 2.9,
-       3.1, 3. , 2.7, 2.2, 2.5, 3.2, 2.8, 2.5, 2.8, 2.9, 3. , 2.8, 3. ,
-       2.9, 2.6, 2.4, 2.4, 2.7, 2.7, 3. , 3.4, 3.1, 2.3, 3. , 2.5, 2.6,
-       3. , 2.6, 2.3, 2.7, 3. , 2.9, 2.9, 2.5, 2.8, 3.3, 2.7, 3. , 2.9,
-       3. , 3. , 2.5, 2.9, 2.5, 3.6, 3.2, 2.7, 3. , 2.5, 2.8, 3.2, 3. ,
-       3.8, 2.6, 2.2, 3.2, 2.8, 2.8, 2.7, 3.3, 3.2, 2.8, 3. , 2.8, 3. ,
-       2.8, 3.8, 2.8, 2.8, 2.6, 3. , 3.4, 3.1, 3. , 3.1, 3.1, 3.1, 2.7,
-       3.2, 3.3, 3. , 2.5, 3. , 3.4, 3. ])
-      
-```
-
-这是单变量(univariate)长度为 150 的一维 NumPy 数组。
-
-**归一化**
-
-求出最大值、最小值
-```python
-smax = np.max(wid)
-smin = np.min(wid)
-
-In [51]: smax,smin
-Out[51]: (4.4, 2.0)
-````
-归一化公式：
-```python
-s = (wid - smin) / (smax - smin)
-```
-只打印小数点后三位设置：
-```python
-np.set_printoptions(precision=3)  
-```
-
-归一化结果：
-```markdown
-array([0.625, 0.417, 0.5  , 0.458, 0.667, 0.792, 0.583, 0.583, 0.375,
-       0.458, 0.708, 0.583, 0.417, 0.417, 0.833, 1.   , 0.792, 0.625,
-       0.75 , 0.75 , 0.583, 0.708, 0.667, 0.542, 0.583, 0.417, 0.583,
-       0.625, 0.583, 0.5  , 0.458, 0.583, 0.875, 0.917, 0.458, 0.5  ,
-       0.625, 0.458, 0.417, 0.583, 0.625, 0.125, 0.5  , 0.625, 0.75 ,
-       0.417, 0.75 , 0.5  , 0.708, 0.542, 0.5  , 0.5  , 0.458, 0.125,
-       0.333, 0.333, 0.542, 0.167, 0.375, 0.292, 0.   , 0.417, 0.083,
-       0.375, 0.375, 0.458, 0.417, 0.292, 0.083, 0.208, 0.5  , 0.333,
-       0.208, 0.333, 0.375, 0.417, 0.333, 0.417, 0.375, 0.25 , 0.167,
-       0.167, 0.292, 0.292, 0.417, 0.583, 0.458, 0.125, 0.417, 0.208,
-       0.25 , 0.417, 0.25 , 0.125, 0.292, 0.417, 0.375, 0.375, 0.208,
-       0.333, 0.542, 0.292, 0.417, 0.375, 0.417, 0.417, 0.208, 0.375,
-       0.208, 0.667, 0.5  , 0.292, 0.417, 0.208, 0.333, 0.5  , 0.417,
-       0.75 , 0.25 , 0.083, 0.5  , 0.333, 0.333, 0.292, 0.542, 0.5  ,
-       0.333, 0.417, 0.333, 0.417, 0.333, 0.75 , 0.333, 0.333, 0.25 ,
-       0.417, 0.583, 0.458, 0.417, 0.458, 0.458, 0.458, 0.292, 0.5  ,
-       0.542, 0.417, 0.208, 0.417, 0.583, 0.417])
-```
-
-**分布可视化**
-
-```python
-import seaborn as sns
-sns.distplot(s,kde=False,rug=True)
-```
-频率分布直方图：
-
-
-![](https://imgkr.cn-bj.ufileos.com/49bf5190-429c-4172-a53c-e3f6b66d4e64.png)
-
-
-```python
-sns.distplot(s,hist=True,kde=True,rug=True)
-```
-带高斯密度核函数的直方图：
-
-![](https://imgkr.cn-bj.ufileos.com/4e4a72a5-8f59-4893-b435-e4b57e22a18e.png)
-
-
-
-**分布 fit 图**
-
-拿 `gamma` 分布去 fit ：
-```python
-from scipy import stats
-sns.distplot(s, kde=False, fit = stats.gamma)
-```
-
-
-
-![](https://imgkr.cn-bj.ufileos.com/89446755-7420-4f96-97fe-c4e45d0d3dec.png)
-
-
-拿双 `gamma` 去 fit：
-```python
-from scipy import stats
-sns.distplot(s, kde=False, fit = stats.dgamma)
-```
-
-![](https://imgkr.cn-bj.ufileos.com/f2c2a660-5433-4b4f-ad7b-d01da4121319.png)
-
-#### 17 Pandas 使用技巧
-
-对于动辄就几十或几百个 G 的数据，在读取的这么大数据的时候，我们有没有办法随机选取一小部分数据，然后读入内存，快速了解数据和开展 EDA ？
-
-使用 Pandas 的 skiprows 和 概率知识，就能做到。
-
-下面解释具体怎么做。
-
-如下所示，读取某 100 G 大小的 big_data.csv 数据
-
-1) 使用 skiprows 参数，
-
-2) x > 0 确保首行读入， 
-
-3) np.random.rand() > 0.01 表示 99% 的数据都会被随机过滤掉
-
-言外之意，只有全部数据的 1% 才有机会选入内存中。
-
-```python
-import pandas as pd
-import numpy as np
-
-df = pd.read_csv("big_data.csv", 
-skiprows = 
-lambda x: x>0 and np.random.rand() > 0.01)
-
-print("The shape of the df is {}. 
-It has been reduced 100 times!".format(df.shape))
-```
-
-使用这种方法，读取的数据量迅速缩减到原来的 1% ，对于迅速展开数据分析有一定的帮助。
-
-### 十一、一步一步掌握Flask web开发
-
-#### 1 Flask版 hello world
-
-Flask是Python轻量级web框架，容易上手，被广大Python开发者所喜爱。
-
-今天我们先从hello world开始，一步一步掌握Flask web开发。例子君是Flask框架的小白，接下来与读者朋友们，一起学习这个对我而言的新框架，大家多多指导。
-
-首先`pip install Flask`,安装Flask，然后import Flask，同时创建一个 `app`
-```python
-from flask import Flask
-
-App = Flask(__name__)
-```
-
-写一个index页的入口函数，返回hello world.
-
-通过装饰器：App.route('/')创建index页的路由或地址，一个`/`表示index页，也就是主页。
-
-```python
-@App.route('/')
-def index():
-    return "hello world"
-```
-
-调用 `index`函数:
-```python
-if __name__ == "__main__":
-    App.run(debug=True)
-```
-
-然后启动，会在console下看到如下启动信息，表明`服务启动成功`。
-```python
-* Debug mode: on
- * Restarting with stat
- * Debugger is active!
- * Debugger PIN: 663-788-611
- * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
-```
-
- 接下来，打开一个网页，相当于启动客户端，并在Url栏中输入：`http://127.0.0.1:5000/`，看到页面上答应出`hello world`，证明服务访问成功。
-
- 同时在服务端后台看到如下信息，表示处理一次来自客户端的`get`请求。
- ```python
- 27.0.0.1 - - [03/Feb/2020 21:26:50] "GET / HTTP/1.1" 200 -
- ```
-
- 以上就是flask的hello world 版
-
-#### 2 Flask之数据入库操作
-
-数据持久化就是将数据写入到数据库存储的过程。
-
-本例子使用`sqlite3`数据库。
-
-1)导入`sqlite3`，未安装前使用命令`pip install sqlite3`
-
-创建一个`py`文件：`sqlite3_started.py`，并写下第一行代码：
-```python
-import sqlite3
-```
-2)手动创建一个数据库实例`db`, 命名`test.db`
-
-3)创建与数据库实例`test.db`的连接:
-```python
-conn = sqlite3.connect("test.db")
-```
-
-4)拿到连接`conn`的cursor
-```python
-c = conn.cursor()
-```
-
-5)创建第一张表`books`
-
-共有四个字段：`id`,`sort`,`name`,`price`，类型分别为：`int`,`int`,`text`,`real`. 其中`id`为`primary key`. 主键的取值必须是唯一的(`unique`)，否则会报错。
-
-
-```python
-c.execute('''CREATE TABLE books
-      (id int primary key,
-       sort int,
-       name text,
-       price real)''')
-```
-第一次执行上面语句，表`books`创建完成。当再次执行时，就会报`重复建表`的错误。需要优化脚本，检查表是否存在`IF NOT EXISTS books`，不存在再创建：
-```python
-c.execute('''CREATE TABLE IF NOT EXISTS books
-      (id int primary key,
-       sort int,
-       name text,
-       price real)''')
-```
-
-6)插入一行记录
-
-共为4个字段赋值
-
-```python
-c.execute('''INSERT INTO books VALUES
-       (1, 
-       1, 
-       'computer science',
-       39.0)''')
-```
-
-7)一次插入多行记录
-
-先创建一个list:`books`，使用`executemany`一次插入多行。
-```python
-books = [(2, 2, 'Cook book', 68),
-         (3, 2, 'Python intro', 89),
-         (4, 3, 'machine learning', 59),
-         ]
-
-
-c.executemany('INSERT INTO books VALUES (?, ?, ?, ?)', books)
-```
-
-8)提交
-
-提交后才会真正生效，写入到数据库
-
-```python
-conn.commit()
-```
-
-9)关闭期初建立的连接conn
-
-务必记住手动关闭，否则会出现内存泄漏
-```python
-conn.close()
-print('Done')
-```
-
-10)查看结果
-例子君使用`vs code`，在扩展库中选择：`SQLite`安装。
-
-![image-20200208211721377](./img/image-20200208211721377.png)
-
-新建一个`sq`文件：`a.sql`，内容如下：
-
-```sql
-SELECT * from books 
-```
-右键`run query`，得到表`books`插入的4行记录可视化图：
-
-![image-20200208211806853](./img/image-20200208211806853.png)
-
-以上十步就是sqlite3写入数据库的主要步骤，作为Flask系列的第二篇，为后面的前端讲解打下基础。
-
-#### 3 Flask各层调用关系
-
-这篇介绍Flask和B/S模式，即浏览器/服务器模式，是接下来快速理解Flask代码的关键理论篇：**理解Views、models和渲染模板层的调用关系**。
-
-1) 发出请求
-
-当我们在浏览器地址栏中输入某个地址，按回车后，完成第一步。
-
-2) 视图层 views接收1)步发出的请求，Flask中使用解释器的方式处理这个求情，实例代码如下，它通常涉及到调用models层和模板文件层
-
-```python
-@main_blue.route('/', methods=['GET', 'POST'])
-def index():
-    form = TestForm()
-    print('test')
-```
-
-3) models层会负责创建数据模型，执行CRUD操作
-
-4) 模板文件层处理html模板
-
-5) 组合后返回html
-
-6) models层和html模板组合后返回给views层
-
-7）最后views层响应并渲染到浏览器页面，我们就能看到请求的页面。
-
-完整过程图如下所示：
-
-![image-20200211152007983](./img/web1.png)
-
-读者朋友们，如果你和例子君一样都是初学Flask编程，需要好好理解上面的过程。理解这些对于接下来的编程会有一定的理论指导，方向性指导价值。
-
-### Python 问答
-
-#### 2：Python 如何生成二维码？
-
-
-
-
-
-## qrcode
-
-今天先来解答如何生成二维码。Python的`qrcode`包支持生成二维码。
-
-用法也很简单：
-
-```python
-import qrcode
-
-# 二维码内容
-data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
-# 生成二维码
-img = qrcode.make(data=data)
-# 直接显示二维码
-img.show()
-# 保存二维码为文件
-img.save("我的微信.jpg")
-```
-
-生成的二维码如下：
-
-![](https://imgkr2.cn-bj.ufileos.com/f0b08c53-0107-483b-bbe5-072bebc58e8d.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=rVtaeBWhzLPPq%252BFCVtiOv6rS0tI%253D&Expires=1603544615)
-
-
-大家微信扫描后，会出现我的二维码。
-
-另外，还可以设置二维码的颜色等样式：
-
-```python
-import qrcode
-
-# 实例化二维码生成类
-qr = qrcode.QRCode(border=2)
-# 设置二维码数据
-data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
-qr.add_data(data=data)
-# 启用二维码颜色设置
-qr.make(fit=True)
-img = qr.make_image(fill_color="orange", back_color="white")
-
-# 显示二维码
-img.show()
-```
-
-生成一个orange的二维码：
-
-![](https://imgkr2.cn-bj.ufileos.com/cbd26fd8-27cf-4630-935f-6896822ce483.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=uy1r24x%252Fp5QpI5Wy10Ebdaz%252BpLM%253D&Expires=1603544681)
-
-更多样式，大家可以自己去玩耍。
-
diff --git a/_config.yml b/_config.yml
deleted file mode 100644
index ddeb671b..00000000
--- a/_config.yml
+++ /dev/null
@@ -1 +0,0 @@
-theme: jekyll-theme-time-machine
\ No newline at end of file
diff --git a/dev/python-dev.md b/dev/python-dev.md
new file mode 100644
index 00000000..1c5c04b8
--- /dev/null
+++ b/dev/python-dev.md
@@ -0,0 +1,1481 @@
+### Python 实战
+
+
+#### 221 自动群发邮件
+
+Python自动群发邮件
+
+```python
+import smtplib
+from email import (header)
+from email.mime import (text, application, multipart)
+import time
+
+def sender_mail():
+    smt_p = smtplib.SMTP()
+    smt_p.connect(host='smtp.qq.com', port=25)
+    sender, password = '113097485@qq.com', "**************"
+    smt_p.login(sender, password)
+    receiver_addresses, count_num = [
+        'guozhennianhua@163.com', 'xiaoxiazi99@163.com'], 1
+    for email_address in receiver_addresses:
+        try:
+            msg = multipart.MIMEMultipart()
+            msg['From'] = "zhenguo"
+            msg['To'] = email_address
+            msg['subject'] = header.Header('这是邮件主题通知', 'utf-8')
+            msg.attach(text.MIMEText(
+                '这是一封测试邮件，请勿回复本邮件~', 'plain', 'utf-8'))
+            smt_p.sendmail(sender, email_address, msg.as_string())
+            time.sleep(10)
+            print('第%d次发送给%s' % (count_num, email_address))
+            count_num = count_num + 1
+        except Exception as e:
+            print('第%d次给%s发送邮件异常' % (count_num, email_address))
+            continue
+    smt_p.quit()
+
+sender_mail()
+```
+
+
+
+注意：
+发送邮箱是qq邮箱，所以要在qq邮箱中设置开启SMTP服务，设置完成时会生成一个授权码，将这个授权码赋值给文中的`password`变量
+
+#### 222 二分搜索
+
+二分搜索是程序员必备的算法，无论什么场合，都要非常熟练地写出来。
+
+小例子描述：
+在**有序数组**`arr`中，指定区间`[left,right]`范围内，查找元素`x`
+如果不存在，返回`-1`
+
+二分搜索`binarySearch`实现的主逻辑
+
+```python
+def binarySearch(arr, left, right, x):
+    while left <= right:
+
+        mid = int(left + (right - left) / 2); # 找到中间位置。求中点写成(left+right)/2更容易溢出，所以不建议这样写
+
+        # 检查x是否出现在位置mid
+        if arr[mid] == x:
+            print('found %d 在索引位置%d 处' %(x,mid))
+            return mid
+
+            # 假如x更大，则不可能出现在左半部分
+        elif arr[mid] < x:
+            left = mid + 1 #搜索区间变为[mid+1,right]
+            print('区间缩小为[%d,%d]' %(mid+1,right))
+
+        # 同理，假如x更小，则不可能出现在右半部分
+        elif x<arr[mid]:
+            right = mid - 1 #搜索区间变为[left,mid-1]
+            print('区间缩小为[%d,%d]' %(left,mid-1))
+
+    # 假如搜索到这里，表明x未出现在[left,right]中
+    return -1
+```
+
+在`Ipython`交互界面中，调用`binarySearch`的小Demo:
+
+```python
+In [8]: binarySearch([4,5,6,7,10,20,100],0,6,5)
+区间缩小为[0,2]
+found 5 at 1
+Out[8]: 1
+
+In [9]: binarySearch([4,5,6,7,10,20,100],0,6,4)
+区间缩小为[0,2]
+区间缩小为[0,0]
+found 4 at 0
+Out[9]: 0
+
+In [10]: binarySearch([4,5,6,7,10,20,100],0,6,20)
+区间缩小为[4,6]
+found 20 at 5
+Out[10]: 5
+
+In [11]: binarySearch([4,5,6,7,10,20,100],0,6,100)
+区间缩小为[4,6]
+区间缩小为[6,6]
+found 100 at 6
+Out[11]: 6
+```
+
+#### 223 爬取天气数据并解析温度值
+
+爬取天气数据并解析温度值
+
+素材来自朋友袁绍，感谢！
+
+爬取的html 结构
+
+<img src="./img/1.png" width="50%"/>
+
+```python
+import requests
+from lxml import etree
+import pandas as pd
+import re
+
+url = 'http://www.weather.com.cn/weather1d/101010100.shtml#input'
+with requests.get(url) as res:
+    content = res.content
+    html = etree.HTML(content)
+```
+
+
+
+通过lxml模块提取值
+
+lxml比beautifulsoup解析在某些场合更高效
+
+```python
+location = html.xpath('//*[@id="around"]//a[@target="_blank"]/span/text()')
+temperature = html.xpath('//*[@id="around"]/div/ul/li/a/i/text()')
+```
+
+结果：
+
+```python
+['香河', '涿州', '唐山', '沧州', '天津', '廊坊', '太原', '石家庄', '涿鹿', '张家口', '保定', '三河', '北京孔庙', '北京国子监', '中国地质博物馆', '月坛公
+园', '明城墙遗址公园', '北京市规划展览馆', '什刹海', '南锣鼓巷', '天坛公园', '北海公园', '景山公园', '北京海洋馆']
+
+['11/-5°C', '14/-5°C', '12/-6°C', '12/-5°C', '11/-1°C', '11/-5°C', '8/-7°C', '13/-2°C', '8/-6°C', '5/-9°C', '14/-6°C', '11/-4°C', '13/-3°C'
+, '13/-3°C', '12/-3°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-3°C', '13/-3°C', '12/-2°C', '12/-2°C', '12/-2°C', '12/-3°C']
+```
+
+
+构造DataFrame对象
+
+```python
+df = pd.DataFrame({'location':location, 'temperature':temperature})
+print('温度列')
+print(df['temperature'])
+```
+
+正则解析温度值
+
+```python
+df['high'] = df['temperature'].apply(lambda x: int(re.match('(-?[0-9]*?)/-?[0-9]*?°C', x).group(1) ) )
+df['low'] = df['temperature'].apply(lambda x: int(re.match('-?[0-9]*?/(-?[0-9]*?)°C', x).group(1) ) )
+print(df)
+```
+
+详细说明子字符创捕获
+
+除了简单地判断是否匹配之外，正则表达式还有提取子串的强大功能。用`()`表示的就是要提取的分组（group）。比如：`^(\d{3})-(\d{3,8})$`分别定义了两个组，可以直接从匹配的字符串中提取出区号和本地号码
+
+```python
+m = re.match(r'^(\d{3})-(\d{3,8})$', '010-12345')
+print(m.group(0))
+print(m.group(1))
+print(m.group(2))
+
+# 010-12345
+# 010
+# 12345
+```
+
+如果正则表达式中定义了组，就可以在`Match`对象上用`group()`方法提取出子串来。
+
+注意到`group(0)`永远是原始字符串，`group(1)`、`group(2)`……表示第1、2、……个子串。
+
+
+最终结果
+
+```kepython
+Name: temperature, dtype: object
+    location temperature  high  low
+0         香河     11/-5°C    11   -5
+1         涿州     14/-5°C    14   -5
+2         唐山     12/-6°C    12   -6
+3         沧州     12/-5°C    12   -5
+4         天津     11/-1°C    11   -1
+5         廊坊     11/-5°C    11   -5
+6         太原      8/-7°C     8   -7
+7        石家庄     13/-2°C    13   -2
+8         涿鹿      8/-6°C     8   -6
+9        张家口      5/-9°C     5   -9
+10        保定     14/-6°C    14   -6
+11        三河     11/-4°C    11   -4
+12      北京孔庙     13/-3°C    13   -3
+13     北京国子监     13/-3°C    13   -3
+14   中国地质博物馆     12/-3°C    12   -3
+15      月坛公园     12/-3°C    12   -3
+16   明城墙遗址公园     13/-3°C    13   -3
+17  北京市规划展览馆     12/-2°C    12   -2
+18       什刹海     12/-3°C    12   -3
+19      南锣鼓巷     13/-3°C    13   -3
+20      天坛公园     12/-2°C    12   -2
+21      北海公园     12/-2°C    12   -2
+22      景山公园     12/-2°C    12   -2
+23     北京海洋馆     12/-3°C    12   -3
+```
+
+### 十、数据分析
+
+本项目基于Kaggle电影影评数据集，通过这个系列，你将学到如何进行数据探索性分析(EDA)，学会使用数据分析利器`pandas`，会用绘图包`pyecharts`，以及EDA时可能遇到的各种实际问题及一些处理技巧。
+
+
+
+本项目需要导入的包：
+
+```python
+import pandas as pd
+import numpy as np
+import matplotlib.pyplot as plt
+from pyecharts.charts import Bar,Grid,Line
+import pyecharts.options as opts
+from pyecharts.globals import ThemeType
+```
+
+#### 1 创建DataFrame
+pandas中一个dataFrame实例：
+```python
+Out[89]:
+        a  val
+0  apple1  1.0
+1  apple2  2.0
+2  apple3  3.0
+3  apple4  4.0
+4  apple5  5.0
+```
+
+我们的**目标**是变为如下结构：
+```python
+a  apple1  apple2  apple3  apple4  apple5
+0     1.0     2.0     3.0     4.0     5.0
+```
+
+乍看可使用`pivot`，但很难一步到位。
+
+所以另辟蹊径，提供一种简单且好理解的方法：
+
+```python
+In [113]: pd.DataFrame(index=[0],columns=df.a,data=dict(zip(df.a,df.val)))
+Out[113]:
+a  apple1  apple2  apple3  apple4  apple5
+0     1.0     2.0     3.0     4.0     5.0
+```
+以上方法是重新创建一个DataFrame,直接把`df.a`所有可能取值作为新dataframe的列，index调整为`[0]`，注意类型必须是数组类型(array-like 或者 Index)，两个轴确定后，`data`填充数据域。
+
+```python
+In [116]: dict(zip(df.a,df.val))
+Out[116]: {'apple1': 1.0, 'apple2': 2.0, 'apple3': 3.0, 'apple4': 4.0, 'apple5': 5.0}
+```
+
+
+
+#### 2 导入数据
+数据来自kaggle，共包括三个文件：
+
+1. movies.dat
+2. ratings.dat
+3. users.dat
+
+`movies.dat`包括三个字段：['Movie ID', 'Movie Title', 'Genre']
+
+使用pandas导入此文件：
+
+```python
+import pandas as pd
+
+movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
+```
+
+导入后，显示前5行：
+
+```python
+   Movie ID                                        Movie Title  \
+0         8      Edison Kinetoscopic Record of a Sneeze (1894)   
+1        10                La sortie des usines Lumi猫re (1895)   
+2        12                      The Arrival of a Train (1896)   
+3        25  The Oxford and Cambridge University Boat Race ...   
+4        91                         Le manoir du diable (1896)   
+5       131                           Une nuit terrible (1896)   
+6       417                      Le voyage dans la lune (1902)   
+7       439                     The Great Train Robbery (1903)   
+8       443        Hiawatha, the Messiah of the Ojibway (1903)   
+9       628                    The Adventures of Dollie (1908)  
+                                          Genre  
+0                             Documentary|Short  
+1                             Documentary|Short  
+2                             Documentary|Short  
+3                                           NaN  
+4                                  Short|Horror  
+5                           Short|Comedy|Horror  
+6  Short|Action|Adventure|Comedy|Fantasy|Sci-Fi  
+7                    Short|Action|Crime|Western  
+8                                           NaN  
+9                                  Action|Short  
+```
+
+
+
+次导入其他两个数据文件
+
+`users.dat`:
+
+```python
+users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
+print(users.head())
+```
+
+结果：
+
+```python
+   User ID  Twitter ID
+0        1   397291295
+1        2    40501255
+2        3   417333257
+3        4   138805259
+4        5  2452094989
+5        6   391774225
+6        7    47317010
+7        8    84541461
+8        9  2445803544
+9       10   995885060
+```
+
+
+
+`rating.data`:
+
+```python
+ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
+print(ratings.head())
+```
+
+结果：
+
+```python
+   User ID  Movie ID  Rating  Rating Timestamp
+0        1    111161      10        1373234211
+1        1    117060       7        1373415231
+2        1    120755       6        1373424360
+3        1    317919       6        1373495763
+4        1    454876      10        1373621125
+5        1    790724       8        1374641320
+6        1    882977       8        1372898763
+7        1   1229238       9        1373506523
+8        1   1288558       5        1373154354
+9        1   1300854       8        1377165712
+```
+
+ **read_csv 使用说明**
+
+说明，本次导入`dat`文件使用`pandas.read_csv`函数。
+
+第一个位置参数`./data/movietweetings/ratings.dat` 表示文件的相对路径
+
+第二个关键字参数：`delimiter='::'`，表示文件分隔符使用`::`
+
+后面几个关键字参数分别代表使用的引擎，文件没有表头，所以`header`为`None;`
+
+导入后dataframe的列名使用`names`关键字设置，这个参数大家可以记住，比较有用。
+
+
+
+Kaggle电影数据集第一节，我们使用数据处理利器 `pandas`， 函数`read_csv` 导入给定的三个数据文件。
+
+```python
+import pandas as pd
+
+movies = pd.read_csv('./data/movietweetings/movies.dat', delimiter='::', engine='python', header=None, names = ['Movie ID', 'Movie Title', 'Genre'])
+users = pd.read_csv('./data/movietweetings/users.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Twitter ID'])
+ratings = pd.read_csv('./data/movietweetings/ratings.dat', delimiter='::', engine='python', header=None, names = ['User ID', 'Movie ID', 'Rating', 'Rating Timestamp'])
+```
+
+用到的`read_csv`，某些重要的参数，如何使用在上一节也有所提到。下面开始数据探索分析(EDA)
+
+> 找出得分前10喜剧(comedy)
+
+
+
+#### 3 处理组合值
+
+表`movies`字段`Genre`表示电影的类型，可能有多个值，分隔符为`|`，取值也可能为`None`.
+
+针对这类字段取值，可使用Pandas中Series提供的`str`做一步转化，**注意它是向量级的**，下一步，如Python原生的`str`类似，使用`contains`判断是否含有`comedy`字符串：
+
+```python
+mask = movies.Genre.str.contains('comedy',case=False,na=False)
+```
+
+注意使用的两个参数：`case`, `na`
+
+case为 False，表示对大小写不敏感；
+na Genre列某个单元格为`NaN`时，我们使用的充填值，此处填充为`False`
+
+返回的`mask`是一维的`Series`，结构与 movies.Genre相同，取值为True 或 False.
+
+观察结果：
+
+```python
+0    False
+1    False
+2    False
+3    False
+4    False
+5     True
+6     True
+7    False
+8    False
+9    False
+Name: Genre, dtype: bool
+
+```
+
+
+ #### 4 访问某列
+
+得到掩码mask后，pandas非常方便地能提取出目标记录：
+
+```python
+comedy = movies[mask]
+comdey_ids = comedy['Movie ID']
+
+```
+
+以上，在pandas中被最频率使用，不再解释。看结果`comedy_ids.head()`：
+
+```python
+5      131
+6      417
+15    2354
+18    3863
+19    4099
+20    4100
+21    4101
+22    4210
+23    4395
+25    4518
+Name: Movie ID, dtype: int64
+
+```
+
+
+
+1-4介绍`数据读入`，`处理组合值`，`索引数据`等, pandas中使用较多的函数，基于Kaggle真实电影影评数据集，最后得到所有`喜剧 ID`：
+
+```python
+5      131
+6      417
+15    2354
+18    3863
+19    4099
+20    4100
+21    4101
+22    4210
+23    4395
+25    4518
+Name: Movie ID, dtype: int64
+
+```
+
+下面继续数据探索之旅~
+
+#### 5 连接两个表
+
+拿到所有喜剧的ID后，要想找出其中平均得分最高的前10喜剧，需要关联另一张表：`ratings`:
+
+再回顾下ratings表结构：
+
+```python
+   User ID  Movie ID  Rating  Rating Timestamp
+0        1    111161      10        1373234211
+1        1    117060       7        1373415231
+2        1    120755       6        1373424360
+3        1    317919       6        1373495763
+4        1    454876      10        1373621125
+5        1    790724       8        1374641320
+6        1    882977       8        1372898763
+7        1   1229238       9        1373506523
+8        1   1288558       5        1373154354
+9        1   1300854       8        1377165712
+
+```
+
+
+pandas 中使用`join`关联两张表，连接字段是`Movie ID`，如果顺其自然这么使用`join`：
+
+```python
+combine = ratings.join(comedy, on='Movie ID', rsuffix='2')
+
+```
+
+左右滑动，查看完整代码
+
+大家可验证这种写法，仔细一看，会发现结果非常诡异。
+
+究其原因，这是pandas join函数使用的一个算是坑点，它在官档中介绍，连接右表时，此处右表是`comedy`，它的`index`要求是连接字段，也就是 `Movie ID`. 
+
+左表的index不要求，但是要在参数 `on`中给定。
+
+**以上是要注意的一点**
+
+修改为：
+
+```python
+combine = ratings.join(comedy.set_index('Movie ID'), on='Movie ID')
+print(combine.head(10))
+
+```
+
+以上是OK的写法
+
+观察结果：
+
+```python
+   User ID  Movie ID  Rating  Rating Timestamp Movie Title Genre
+0        1    111161      10        1373234211         NaN   NaN
+1        1    117060       7        1373415231         NaN   NaN
+2        1    120755       6        1373424360         NaN   NaN
+3        1    317919       6        1373495763         NaN   NaN
+4        1    454876      10        1373621125         NaN   NaN
+5        1    790724       8        1374641320         NaN   NaN
+6        1    882977       8        1372898763         NaN   NaN
+7        1   1229238       9        1373506523         NaN   NaN
+8        1   1288558       5        1373154354         NaN   NaN
+9        1   1300854       8        1377165712         NaN   NaN
+
+```
+
+Genre列为`NaN`表明，这不是喜剧。需要筛选出此列不为`NaN` 的记录。
+
+#### 6 按列筛选
+
+pandas最方便的地方，就是向量化运算，尽可能减少了for循环的嵌套。
+
+按列筛选这种常见需求，自然可以轻松应对。
+
+为了照顾初次接触 pandas 的朋友，分两步去写：
+
+```python
+mask = pd.notnull(combine['Genre'])
+
+```
+
+结果是一列只含`True 或 False`的值
+
+```python
+result = combine[mask]
+print(result.head())
+
+```
+
+结果中，Genre字段中至少含有一个Comedy字符串，表明验证了我们以上操作是OK的。
+
+```python
+    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
+12        1   1588173       9        1372821281      Warm Bodies (2013)   
+13        1   1711425       3        1372604878        21 & Over (2013)   
+14        1   2024432       8        1372703553   Identity Thief (2013)   
+17        1   2101441       1        1372633473  Spring Breakers (2012)   
+28        2   1431045       7        1457733508         Deadpool (2016)   
+
+                             Genre  
+12           Comedy|Horror|Romance  
+13                          Comedy  
+14    Adventure|Comedy|Crime|Drama  
+17              Comedy|Crime|Drama  
+28  Action|Adventure|Comedy|Sci-Fi  
+
+
+```
+
+
+
+截止目前已经求出所有喜剧电影`result`，前5行如下，Genre中都含有`Comedy`字符串：
+```python
+    User ID  Movie ID  Rating  Rating Timestamp             Movie Title  \
+12        1   1588173       9        1372821281      Warm Bodies (2013)   
+13        1   1711425       3        1372604878        21 & Over (2013)   
+14        1   2024432       8        1372703553   Identity Thief (2013)   
+17        1   2101441       1        1372633473  Spring Breakers (2012)   
+28        2   1431045       7        1457733508         Deadpool (2016)   
+
+                             Genre  
+12           Comedy|Horror|Romance  
+13                          Comedy  
+14    Adventure|Comedy|Crime|Drama  
+17              Comedy|Crime|Drama  
+28  Action|Adventure|Comedy|Sci-Fi  
+```
+
+
+
+#### 7 按照Movie ID 分组
+
+result中会有很多观众对同一部电影的打分，所以要求得分前10的喜剧，先按照`Movie ID`分组，然后求出平均值：
+```python
+score_as_movie = result.groupby('Movie ID').mean()
+```
+
+前5行显示如下：
+```python
+               User ID  Rating  Rating Timestamp
+Movie ID                                        
+131       34861.000000     7.0      1.540639e+09
+417       34121.409091     8.5      1.458680e+09
+2354       6264.000000     8.0      1.456343e+09
+3863      43803.000000    10.0      1.430439e+09
+4099      25084.500000     7.0      1.450323e+09
+```
+
+#### 8 按照电影得分排序
+
+```python
+score_as_movie.sort_values(by='Rating', ascending = False,inplace=True)
+score_as_movie
+```
+前5行显示如下：
+```python
+	User ID	Rating	Rating Timestamp
+Movie ID			
+7134690	30110.0	10.0	1.524974e+09
+416889	1319.0	10.0	1.543320e+09
+57840	23589.0	10.0	1.396802e+09
+5693562	50266.0	10.0	1.511024e+09
+5074	43803.0	10.0	1.428352e+09
+```
+都是满分？这有点奇怪，会不会这些电影都只有几个人评分，甚至只有1个？评分样本个数太少，显然最终的平均分数不具有太强的说服力。
+
+所以，下面要进行每部电影的评分人数统计
+
+#### 9 分组后使用聚合函数
+
+根据`Movie ID`分组后，使用`count`函数统计`每组个数`，只保留count列，最后得到`watchs2`:
+
+```python
+watchs = result.groupby('Movie ID').agg(['count'])
+watchs2 = watchs['Rating']['count']
+```
+打印前20行：
+```python
+print(watchs2.head(20))
+```
+结果：
+```python
+Movie ID
+131      1
+417     22
+2354     1
+3863     1
+4099     2
+4100     1
+4101     1
+4210     1
+4395     1
+4518     1
+4546     2
+4936     2
+5074     1
+5571     1
+6177     1
+6414     3
+6684     1
+6689     1
+7145     1
+7162     2
+Name: count, dtype: int64
+```
+果然，竟然有这么多电影的评论数只有1次！样本个数太少，评论的平均值也就没有什么说服力。
+
+查看`watchs2`一些重要统计量：
+```python
+watchs2.describe()
+```
+结果：
+```python
+count    10740.000000
+mean        20.192086
+std         86.251411
+min          1.000000
+25%          1.000000
+50%          2.000000
+75%          7.000000
+max       1843.000000
+Name: count, dtype: float64
+```
+共有10740部**喜剧**电影被评分，平均打分次数20次，标准差86，75%的电影样本打分次数小于7次，最小1次，最多1843次。
+
+#### 10 频率分布直方图
+
+绘制评论数的频率分布直方图，便于更直观的观察电影被评论的分布情况。上面分析到，75%的电影打分次数小于7次，所以绘制打分次数小于20次的直方图：
+
+```python
+fig = plt.figure(figsize=(12,8))
+histn = plt.hist(watchs2[watchs2 <=19],19,histtype='step')
+plt.scatter([i+1 for i in range(len(histn[0]))],histn[0])
+```
+
+![](./img/20200131094927.jpg)
+
+`histn`元祖表示个数和对应的被分割的区间，查看`histn[0]`:
+```python
+array([4383., 1507.,  787.,  541.,  356.,  279.,  209.,  163.,  158.,
+        118.,  114.,   90.,  104.,   81.,   80.,   73.,   62.,   65.,
+         52.])
+```
+```python
+sum(histn[0]) # 9222
+```
+看到电影评论次数1到19次的喜剧电影9222部，共有10740部喜剧电影，大约`86%`的喜剧电影评论次数`小于20次`，有`1518`部电影评论数不小于20次。
+
+我们肯定希望挑选出被评论次数尽可能多的电影，因为难免会有水军和滥竽充数等`异常评论`行为。那么，如何准确的量化最小抽样量呢？
+
+
+
+#### 11 最小抽样量
+
+根据统计学的知识，最小抽样量和Z值、样本方差和样本误差相关，下面给出具体的求解最小样本量的计算方法。
+
+采用如下计算公式：
+
+$$ n = \frac{Z^2\sigma^2}{E^2} $$
+
+
+此处，$Z$ 值取为95%的置信度对应的Z值也就是1.96，样本误差取为均值的2.5%.
+
+根据以上公式，编写下面代码：
+
+```python
+n3 = result.groupby('Movie ID').agg(['count','mean','std'])
+n3r = n3[n3['Rating']['count']>=20]['Rating']
+```
+只计算影评超过20次，且满足最小样本量的电影。计算得到的`n3r`前5行：
+```python
+	count	mean	std
+Movie ID			
+417	22	8.500000	1.263027
+12349	68	8.485294	1.227698
+15324	20	8.350000	1.039990
+15864	51	8.431373	1.374844
+17925	44	8.636364	1.259216
+```
+进一步求出最小样本量：
+```python
+nmin = (1.96**2*n3r['std']**2) / ( (n3r['mean']*0.025)**2 )
+```
+`nmin`前5行：
+```python
+Movie ID
+417         135.712480
+12349       128.671290
+15324        95.349276
+15864       163.434005
+17925       130.668350
+```
+
+筛选出满足最小抽样量的喜剧电影：
+
+```python
+n3s = n3r[ n3r['count'] >= nmin ]
+```
+结果显示如下，因此共有`173`部电影满足最小样本抽样量。
+
+```python
+
+count	mean	std
+Movie ID			
+53604	129	8.635659	1.230714
+57012	207	8.449275	1.537899
+70735	224	8.839286	1.190799
+75686	209	8.095694	1.358885
+88763	296	8.945946	1.026984
+...	...	...	...
+6320628	860	7.966279	1.469924
+6412452	276	7.510870	1.389529
+6662050	22	10.000000	0.000000
+6966692	907	8.673649	1.286455
+7131622	1102	7.851180	1.751500
+173 rows × 3 columns
+```
+
+#### 12 去重和连表
+
+按照平均得分从大到小排序：
+```python
+n3s_sort = n3s.sort_values(by='mean',ascending=False)
+```
+结果：
+```python
+	count	mean	std
+Movie ID			
+6662050	22	10.000000	0.000000
+4921860	48	10.000000	0.000000
+5262972	28	10.000000	0.000000
+5512872	353	9.985836	0.266123
+3863552	199	9.010050	1.163372
+...	...	...	...
+1291150	647	6.327666	1.785968
+2557490	546	6.307692	1.858434
+1478839	120	6.200000	0.728761
+2177771	485	6.150515	1.523922
+1951261	1091	6.083410	1.736127
+173 rows × 3 columns
+```
+仅靠`Movie ID`还是不知道哪些电影，连接`movies`表：
+```python
+ms = movies.drop_duplicates(subset=['Movie ID'])
+ms = ms.set_index('Movie ID')
+n3s_final = n3s_drops.join(ms,on='Movie ID')
+```
+
+#### 13 结果分析
+
+喜剧榜单前50名：
+```python
+Movie Title
+Five Minutes (2017)
+MSG 2 the Messenger (2015)
+Avengers: Age of Ultron Parody (2015)
+Be Somebody (2016)
+Bajrangi Bhaijaan (2015)
+Back to the Future (1985)
+La vita 鐚?bella (1997)
+The Intouchables (2011)
+The Sting (1973)
+Coco (2017)
+Toy Story 3 (2010)
+3 Idiots (2009)
+Green Book (2018)
+Dead Poets Society (1989)
+The Apartment (1960)
+P.K. (2014)
+The Truman Show (1998)
+Am鑼卨ie (2001)
+Inside Out (2015)
+Toy Story 4 (2019)
+Toy Story (1995)
+Finding Nemo (2003)
+Dr. Strangelove or: How I Learned to Stop Worrying and Love the Bomb (1964)
+Home Alone (1990)
+Zootopia (2016)
+Up (2009)
+Monsters, Inc. (2001)
+La La Land (2016)
+Relatos salvajes (2014)
+En man som heter Ove (2015)
+Snatch (2000)
+Lock, Stock and Two Smoking Barrels (1998)
+How to Train Your Dragon 2 (2014)
+As Good as It Gets (1997)
+Guardians of the Galaxy (2014)
+The Grand Budapest Hotel (2014)
+Fantastic Mr. Fox (2009)
+Silver Linings Playbook (2012)
+Sing Street (2016)
+Deadpool (2016)
+Annie Hall (1977)
+Pride (2014)
+In Bruges (2008)
+Big Hero 6 (2014)
+Groundhog Day (1993)
+The Breakfast Club (1985)
+Little Miss Sunshine (2006)
+Deadpool 2 (2018)
+The Terminal (2004)
+```
+
+前10名评论数图：
+
+![](./img/2020013109495711.jpg)
+
+代码：
+```python
+x = n3s_final['Movie Title'][:10].tolist()[::-1]
+y = n3s_final['count'][:10].tolist()[::-1]
+bar = (
+    Bar()
+    .add_xaxis(x)
+    .add_yaxis('评论数',y,category_gap='50%')
+    .reversal_axis()
+    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影被评论次数"),
+                    toolbox_opts=opts.ToolboxOpts(),)
+)
+grid = (
+    Grid(init_opts=opts.InitOpts(theme=ThemeType.LIGHT))
+    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
+)
+grid.render_notebook()
+```
+
+前10名得分图：
+
+![](./img/2020013109500812.jpg)
+
+代码：
+```python
+x = n3s_final['Movie Title'][:10].tolist()[::-1]
+y = n3s_final['mean'][:10].round(3).tolist()[::-1]
+bar = (
+    Bar()
+    .add_xaxis(x)
+    .add_yaxis('平均得分',y,category_gap='50%')
+    .reversal_axis()
+    .set_global_opts(title_opts=opts.TitleOpts(title="喜剧电影平均得分"),
+                    xaxis_opts=opts.AxisOpts(min_=8.0,name='平均得分'),
+                    toolbox_opts=opts.ToolboxOpts(),)
+)
+grid = (
+    Grid(init_opts=opts.InitOpts(theme=ThemeType.MACARONS))
+    .add(bar, grid_opts=opts.GridOpts(pos_left="30%"))
+)
+grid.render_notebook()
+```
+
+
+
+#### 14 生成哑变量
+
+分类变量的数值化，是指将枚举类变量转化为indicator变量或称dummy变量。
+
+那么什么是`indicator变量`，看看如下例子，A变量解析为：`[1,0,0]`, B解析为：`[0,1,0]`, C解析为：`[0,0,1]`
+```python
+In [8]: s = pd.Series(list('ABCA'))
+In [9]: pd.get_dummies(s)
+Out[9]:
+   A  B  C
+0  1  0  0
+1  0  1  0
+2  0  0  1
+3  1  0  0
+```
+
+如果输入的字符有4个唯一值，看到字符a被解析为[1,0,0,0]，向量长度为4.
+
+```python
+In [5]: s = pd.Series(list('abaccd'))
+In [6]: pd.get_dummies(s)
+Out[6]:
+   a  b  c  d
+0  1  0  0  0
+1  0  1  0  0
+2  1  0  0  0
+3  0  0  1  0
+4  0  0  1  0
+5  0  0  0  1
+```
+
+也就是说dummy向量的长度等于输入字符串中，唯一字符的个数。
+
+#### 15 讨厌的SettingWithCopyWarning！！！
+
+Pandas 处理数据，太好用了，谁用谁知道！
+
+使用过 Pandas 的，几乎都会遇到一个警告：
+
+*SettingWithCopyWarning*
+
+非常烦人！
+
+尤其是刚接触 Pandas 的，完全不理解为什么弹出这么一串：
+
+```python
+d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
+A value is trying to be set on a copy of a slice from a DataFrame.
+Try using .loc[row_indexer,col_indexer] = value instead
+
+See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy 
+```
+
+归根结底，是因为代码中出现`链式操作`...
+
+有人就问了，什么是`链式操作`?
+
+这样的：
+
+```python
+tmp = df[df.a<4]
+tmp['c'] = 200
+```
+
+先记住这个最典型的情况，即可！
+
+有的人就问了：出现这个 Warning, 需要理会它吗？ 
+
+如果结果不对，当然要理会；如果结果对，不care.
+
+举个例子~~
+
+```python
+import pandas as  pd
+
+df = pd.DataFrame({'a':[1,3,5],'b':[4,2,7]},index=['a','b','c'])
+df.loc[df.a<4,'c'] = 100
+print(df)
+print('it\'s ok')
+
+tmp = df[df.a<4]
+tmp['c'] = 200
+print('-----tmp------')
+print(tmp)
+print('-----df-------')
+print(df)
+```
+
+输出结果：
+```python
+   a  b      c
+a  1  4  100.0
+b  3  2  100.0
+c  5  7    NaN
+it's ok
+d:\source\test\settingwithcopy.py:9: SettingWithCopyWarning:
+A value is trying to be set on a copy of a slice from a DataFrame.
+Try using .loc[row_indexer,col_indexer] = value instead
+
+See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy     
+  tmp['c'] = 200
+-----tmp------
+   a  b    c
+a  1  4  200
+b  3  2  200
+-----df-------
+   a  b      c
+a  1  4  100.0
+b  3  2  100.0
+c  5  7    NaN
+```
+
+it's ok 行后面的发生链式赋值，导致结果错误。因为 tmp 变了，df 没赋上值啊，所以必须理会。
+
+it's ok 行前的是正解。
+
+以上，链式操作尽量避免，如何避免？多使用 `.loc[row_indexer,col_indexer]`，提示告诉我们的~
+
+#### 16 NumPy 数据归一化、分布可视化
+
+仅使用 `NumPy`，下载数据，归一化，使用 `seaborn` 展示数据分布。
+
+**下载数据**
+
+```python
+import numpy as np
+
+url = 'https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data'
+wid = np.genfromtxt(url, delimiter=',', dtype='float', usecols=[1])
+```
+仅提取 `iris` 数据集的第二列 `usecols = [1]`
+
+**展示数据**
+
+```python
+array([3.5, 3. , 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.4, 3. ,
+       3. , 4. , 4.4, 3.9, 3.5, 3.8, 3.8, 3.4, 3.7, 3.6, 3.3, 3.4, 3. ,
+       3.4, 3.5, 3.4, 3.2, 3.1, 3.4, 4.1, 4.2, 3.1, 3.2, 3.5, 3.1, 3. ,
+       3.4, 3.5, 2.3, 3.2, 3.5, 3.8, 3. , 3.8, 3.2, 3.7, 3.3, 3.2, 3.2,
+       3.1, 2.3, 2.8, 2.8, 3.3, 2.4, 2.9, 2.7, 2. , 3. , 2.2, 2.9, 2.9,
+       3.1, 3. , 2.7, 2.2, 2.5, 3.2, 2.8, 2.5, 2.8, 2.9, 3. , 2.8, 3. ,
+       2.9, 2.6, 2.4, 2.4, 2.7, 2.7, 3. , 3.4, 3.1, 2.3, 3. , 2.5, 2.6,
+       3. , 2.6, 2.3, 2.7, 3. , 2.9, 2.9, 2.5, 2.8, 3.3, 2.7, 3. , 2.9,
+       3. , 3. , 2.5, 2.9, 2.5, 3.6, 3.2, 2.7, 3. , 2.5, 2.8, 3.2, 3. ,
+       3.8, 2.6, 2.2, 3.2, 2.8, 2.8, 2.7, 3.3, 3.2, 2.8, 3. , 2.8, 3. ,
+       2.8, 3.8, 2.8, 2.8, 2.6, 3. , 3.4, 3.1, 3. , 3.1, 3.1, 3.1, 2.7,
+       3.2, 3.3, 3. , 2.5, 3. , 3.4, 3. ])
+      
+```
+
+这是单变量(univariate)长度为 150 的一维 NumPy 数组。
+
+**归一化**
+
+求出最大值、最小值
+```python
+smax = np.max(wid)
+smin = np.min(wid)
+
+In [51]: smax,smin
+Out[51]: (4.4, 2.0)
+````
+归一化公式：
+```python
+s = (wid - smin) / (smax - smin)
+```
+只打印小数点后三位设置：
+```python
+np.set_printoptions(precision=3)  
+```
+
+归一化结果：
+```markdown
+array([0.625, 0.417, 0.5  , 0.458, 0.667, 0.792, 0.583, 0.583, 0.375,
+       0.458, 0.708, 0.583, 0.417, 0.417, 0.833, 1.   , 0.792, 0.625,
+       0.75 , 0.75 , 0.583, 0.708, 0.667, 0.542, 0.583, 0.417, 0.583,
+       0.625, 0.583, 0.5  , 0.458, 0.583, 0.875, 0.917, 0.458, 0.5  ,
+       0.625, 0.458, 0.417, 0.583, 0.625, 0.125, 0.5  , 0.625, 0.75 ,
+       0.417, 0.75 , 0.5  , 0.708, 0.542, 0.5  , 0.5  , 0.458, 0.125,
+       0.333, 0.333, 0.542, 0.167, 0.375, 0.292, 0.   , 0.417, 0.083,
+       0.375, 0.375, 0.458, 0.417, 0.292, 0.083, 0.208, 0.5  , 0.333,
+       0.208, 0.333, 0.375, 0.417, 0.333, 0.417, 0.375, 0.25 , 0.167,
+       0.167, 0.292, 0.292, 0.417, 0.583, 0.458, 0.125, 0.417, 0.208,
+       0.25 , 0.417, 0.25 , 0.125, 0.292, 0.417, 0.375, 0.375, 0.208,
+       0.333, 0.542, 0.292, 0.417, 0.375, 0.417, 0.417, 0.208, 0.375,
+       0.208, 0.667, 0.5  , 0.292, 0.417, 0.208, 0.333, 0.5  , 0.417,
+       0.75 , 0.25 , 0.083, 0.5  , 0.333, 0.333, 0.292, 0.542, 0.5  ,
+       0.333, 0.417, 0.333, 0.417, 0.333, 0.75 , 0.333, 0.333, 0.25 ,
+       0.417, 0.583, 0.458, 0.417, 0.458, 0.458, 0.458, 0.292, 0.5  ,
+       0.542, 0.417, 0.208, 0.417, 0.583, 0.417])
+```
+
+**分布可视化**
+
+```python
+import seaborn as sns
+sns.distplot(s,kde=False,rug=True)
+```
+频率分布直方图：
+
+
+![](https://imgkr.cn-bj.ufileos.com/49bf5190-429c-4172-a53c-e3f6b66d4e64.png)
+
+
+```python
+sns.distplot(s,hist=True,kde=True,rug=True)
+```
+带高斯密度核函数的直方图：
+
+![](https://imgkr.cn-bj.ufileos.com/4e4a72a5-8f59-4893-b435-e4b57e22a18e.png)
+
+
+
+**分布 fit 图**
+
+拿 `gamma` 分布去 fit ：
+```python
+from scipy import stats
+sns.distplot(s, kde=False, fit = stats.gamma)
+```
+
+
+
+![](https://imgkr.cn-bj.ufileos.com/89446755-7420-4f96-97fe-c4e45d0d3dec.png)
+
+
+拿双 `gamma` 去 fit：
+```python
+from scipy import stats
+sns.distplot(s, kde=False, fit = stats.dgamma)
+```
+
+![](https://imgkr.cn-bj.ufileos.com/f2c2a660-5433-4b4f-ad7b-d01da4121319.png)
+
+#### 17 Pandas 使用技巧
+
+对于动辄就几十或几百个 G 的数据，在读取的这么大数据的时候，我们有没有办法随机选取一小部分数据，然后读入内存，快速了解数据和开展 EDA ？
+
+使用 Pandas 的 skiprows 和 概率知识，就能做到。
+
+下面解释具体怎么做。
+
+如下所示，读取某 100 G 大小的 big_data.csv 数据
+
+1) 使用 skiprows 参数，
+
+2) x > 0 确保首行读入， 
+
+3) np.random.rand() > 0.01 表示 99% 的数据都会被随机过滤掉
+
+言外之意，只有全部数据的 1% 才有机会选入内存中。
+
+```python
+import pandas as pd
+import numpy as np
+
+df = pd.read_csv("big_data.csv", 
+skiprows = 
+lambda x: x>0 and np.random.rand() > 0.01)
+
+print("The shape of the df is {}. 
+It has been reduced 100 times!".format(df.shape))
+```
+
+使用这种方法，读取的数据量迅速缩减到原来的 1% ，对于迅速展开数据分析有一定的帮助。
+
+### 十一、一步一步掌握Flask web开发
+
+#### 1 Flask版 hello world
+
+Flask是Python轻量级web框架，容易上手，被广大Python开发者所喜爱。
+
+今天我们先从hello world开始，一步一步掌握Flask web开发。例子君是Flask框架的小白，接下来与读者朋友们，一起学习这个对我而言的新框架，大家多多指导。
+
+首先`pip install Flask`,安装Flask，然后import Flask，同时创建一个 `app`
+```python
+from flask import Flask
+
+App = Flask(__name__)
+```
+
+写一个index页的入口函数，返回hello world.
+
+通过装饰器：App.route('/')创建index页的路由或地址，一个`/`表示index页，也就是主页。
+
+```python
+@App.route('/')
+def index():
+    return "hello world"
+```
+
+调用 `index`函数:
+```python
+if __name__ == "__main__":
+    App.run(debug=True)
+```
+
+然后启动，会在console下看到如下启动信息，表明`服务启动成功`。
+```python
+* Debug mode: on
+ * Restarting with stat
+ * Debugger is active!
+ * Debugger PIN: 663-788-611
+ * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
+```
+
+ 接下来，打开一个网页，相当于启动客户端，并在Url栏中输入：`http://127.0.0.1:5000/`，看到页面上答应出`hello world`，证明服务访问成功。
+
+ 同时在服务端后台看到如下信息，表示处理一次来自客户端的`get`请求。
+ ```python
+ 27.0.0.1 - - [03/Feb/2020 21:26:50] "GET / HTTP/1.1" 200 -
+ ```
+
+ 以上就是flask的hello world 版
+
+#### 2 Flask之数据入库操作
+
+数据持久化就是将数据写入到数据库存储的过程。
+
+本例子使用`sqlite3`数据库。
+
+1)导入`sqlite3`，未安装前使用命令`pip install sqlite3`
+
+创建一个`py`文件：`sqlite3_started.py`，并写下第一行代码：
+```python
+import sqlite3
+```
+2)手动创建一个数据库实例`db`, 命名`test.db`
+
+3)创建与数据库实例`test.db`的连接:
+```python
+conn = sqlite3.connect("test.db")
+```
+
+4)拿到连接`conn`的cursor
+```python
+c = conn.cursor()
+```
+
+5)创建第一张表`books`
+
+共有四个字段：`id`,`sort`,`name`,`price`，类型分别为：`int`,`int`,`text`,`real`. 其中`id`为`primary key`. 主键的取值必须是唯一的(`unique`)，否则会报错。
+
+
+```python
+c.execute('''CREATE TABLE books
+      (id int primary key,
+       sort int,
+       name text,
+       price real)''')
+```
+第一次执行上面语句，表`books`创建完成。当再次执行时，就会报`重复建表`的错误。需要优化脚本，检查表是否存在`IF NOT EXISTS books`，不存在再创建：
+```python
+c.execute('''CREATE TABLE IF NOT EXISTS books
+      (id int primary key,
+       sort int,
+       name text,
+       price real)''')
+```
+
+6)插入一行记录
+
+共为4个字段赋值
+
+```python
+c.execute('''INSERT INTO books VALUES
+       (1, 
+       1, 
+       'computer science',
+       39.0)''')
+```
+
+7)一次插入多行记录
+
+先创建一个list:`books`，使用`executemany`一次插入多行。
+```python
+books = [(2, 2, 'Cook book', 68),
+         (3, 2, 'Python intro', 89),
+         (4, 3, 'machine learning', 59),
+         ]
+
+
+c.executemany('INSERT INTO books VALUES (?, ?, ?, ?)', books)
+```
+
+8)提交
+
+提交后才会真正生效，写入到数据库
+
+```python
+conn.commit()
+```
+
+9)关闭期初建立的连接conn
+
+务必记住手动关闭，否则会出现内存泄漏
+```python
+conn.close()
+print('Done')
+```
+
+10)查看结果
+例子君使用`vs code`，在扩展库中选择：`SQLite`安装。
+
+![image-20200208211721377](./img/image-20200208211721377.png)
+
+新建一个`sq`文件：`a.sql`，内容如下：
+
+```sql
+SELECT * from books 
+```
+右键`run query`，得到表`books`插入的4行记录可视化图：
+
+![image-20200208211806853](./img/image-20200208211806853.png)
+
+以上十步就是sqlite3写入数据库的主要步骤，作为Flask系列的第二篇，为后面的前端讲解打下基础。
+
+#### 3 Flask各层调用关系
+
+这篇介绍Flask和B/S模式，即浏览器/服务器模式，是接下来快速理解Flask代码的关键理论篇：**理解Views、models和渲染模板层的调用关系**。
+
+1) 发出请求
+
+当我们在浏览器地址栏中输入某个地址，按回车后，完成第一步。
+
+2) 视图层 views接收1)步发出的请求，Flask中使用解释器的方式处理这个求情，实例代码如下，它通常涉及到调用models层和模板文件层
+
+```python
+@main_blue.route('/', methods=['GET', 'POST'])
+def index():
+    form = TestForm()
+    print('test')
+```
+
+3) models层会负责创建数据模型，执行CRUD操作
+
+4) 模板文件层处理html模板
+
+5) 组合后返回html
+
+6) models层和html模板组合后返回给views层
+
+7）最后views层响应并渲染到浏览器页面，我们就能看到请求的页面。
+
+完整过程图如下所示：
+
+![image-20200211152007983](./img/web1.png)
+
+读者朋友们，如果你和例子君一样都是初学Flask编程，需要好好理解上面的过程。理解这些对于接下来的编程会有一定的理论指导，方向性指导价值。
+
+### Python 问答
+
+#### Python 如何生成二维码？
+
+
+
+
+
+## qrcode
+
+今天先来解答如何生成二维码。Python的`qrcode`包支持生成二维码。
+
+用法也很简单：
+
+```python
+import qrcode
+
+# 二维码内容
+data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
+# 生成二维码
+img = qrcode.make(data=data)
+# 直接显示二维码
+img.show()
+# 保存二维码为文件
+img.save("我的微信.jpg")
+```
+
+生成的二维码如下：
+
+![](https://imgkr2.cn-bj.ufileos.com/f0b08c53-0107-483b-bbe5-072bebc58e8d.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=rVtaeBWhzLPPq%252BFCVtiOv6rS0tI%253D&Expires=1603544615)
+
+
+大家微信扫描后，会出现我的二维码。
+
+另外，还可以设置二维码的颜色等样式：
+
+```python
+import qrcode
+
+# 实例化二维码生成类
+qr = qrcode.QRCode(border=2)
+# 设置二维码数据
+data = "http://www.zglg.work/wp-content/uploads/2020/10/image-3.png"
+qr.add_data(data=data)
+# 启用二维码颜色设置
+qr.make(fit=True)
+img = qr.make_image(fill_color="orange", back_color="white")
+
+# 显示二维码
+img.show()
+```
+
+生成一个orange的二维码：
+
+![](https://imgkr2.cn-bj.ufileos.com/cbd26fd8-27cf-4630-935f-6896822ce483.png?UCloudPublicKey=TOKEN_8d8b72be-579a-4e83-bfd0-5f6ce1546f13&Signature=uy1r24x%252Fp5QpI5Wy10Ebdaz%252BpLM%253D&Expires=1603544681)
+
+更多样式，大家可以自己去玩耍。
+
+## Python小项目：句子KWIC显示
+
+上下文关键字（KWIC, Key Word In Context）是最常见的多行协调显示格式。
+
+此小项目描述：输入一系列句子，给定一个给定单词，每个句子中至少会出现一次给定单词。目标输出，给定单词按照KWIC显示，KWIC显示的基本要求：待查询单词居中，前面`pre`序列右对齐，后面`post`序列左对齐，待查询单词前和后长度相等，若输入句子无法满足要求，用空格填充。
+
+输入参数：输入句子sentences, 待查询单词selword, 滑动窗口长度`window_len`
+
+举例，输入如下六个句子，给定单词`secure`，输出如下字符串：
+
+```python
+               pre keyword    post 
+
+     welfare , and secure  the blessings of
+     nations , and secured immortal glory with 
+       , and shall secure  to you the 
+    cherished . To secure  us against these 
+     defense as to secure  our cities and 
+          I can to secure  economy and fidelity 
+```
+
+请补充实现下面函数：
+
+```python
+def kwic(sentences: List[str], selword: str, window_len: int) -> str:
+    """
+    :type: sentences: input sentences
+    :type: selword: selected word
+    :type: window_len: window length
+    """
+```
+
+更多KWIC显示参考如下：
+
+http://dep.chs.nihon-u.ac.jp/english_lang/tukamoto/kwic_e.html
+
+完整代码已经公布在：http://www.zglg.work/Python-20-topics/python-project1-kwic/
+![image](https://user-images.githubusercontent.com/20391209/123213609-c494dc00-d4f8-11eb-84d6-4d8caabb44f7.png)
+![image](https://user-images.githubusercontent.com/20391209/123213901-26eddc80-d4f9-11eb-96cd-d3518005c4df.png)
+
diff --git a/md/1.md b/md/1.md
index f3dc1de3..817a25ea 100644
--- a/md/1.md
+++ b/md/1.md
@@ -25,3 +25,6 @@ relu(5) # 5
 relu(-1) # 0
 ```
 
+
+
+<center>[下一个例子](2.md)</center>
diff --git a/md/10.md b/md/10.md
index 5ac085b2..1fdec254 100644
--- a/md/10.md
+++ b/md/10.md
@@ -6,9 +6,39 @@
 
 #### 10  转为整型　　
 
-int(x, base =10) , x可能为字符串或数值，将x 转换为一个普通整数。如果参数是字符串，那么它可能包含符号和小数点。如果超出了普通整数的表示范围，一个长整数被返回。  
+int(x, base =10) , x可能为字符串或数值，将 x 转换为一个普通整数。
+
+参数base指定x进制数，常见的2，8，10，16分别表示二进制、八进制、十进制、十六进制的数字
+
+如果参数是字符串，必须为整数型字符串，如果是浮点数字符串会抛出异常。
+
+如果x是浮点数，int后截去小数点，只保留整数部分。
 
 ```python
-In [1]: int('12',16)
-Out[1]: 18
+In [2]: int('0110',2)
+
+Out[2]: 6
+
+In [3]: int('0732',8)
+Out[3]: 474
+
+In [4]: int('12',16)
+Out[4]: 18
+
+In [5]: int('12',10)
+Out[5]: 12
+
+In [6]: int(1.45)
+Out[6]: 1
+
+In [7]: int('1.45')
+---------------------------------------------------------------------------
+ValueError                                Traceback (most recent call last)
+<ipython-input-7-6cf4b951408f> in <module>
+----> 1 int('1.45')
+
+ValueError: invalid literal for int() with base 10: '1.45'
 ```
+
+
+<center>[上一个例子](9.md)    [下一个例子](11.md)</center>
diff --git a/md/100.md b/md/100.md
index 1dc596f0..3e7fdf39 100644
--- a/md/100.md
+++ b/md/100.md
@@ -46,4 +46,6 @@ print(r)  # False
 r = is_permutation('work', 'woo')
 print(r)  # False
 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](99.md)    [下一个例子](101.md)</center>
\ No newline at end of file
diff --git a/md/101.md b/md/101.md
index 639c4df6..5c78429e 100644
--- a/md/101.md
+++ b/md/101.md
@@ -32,4 +32,6 @@ print(r)  # True
 
 r = is_rotation('greatman', 'maneatgr')
 print(r)  # False
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](100.md)    [下一个例子](102.md)</center>
\ No newline at end of file
diff --git a/md/102.md b/md/102.md
index ff19dd37..56130bf5 100644
--- a/md/102.md
+++ b/md/102.md
@@ -59,4 +59,6 @@ Out[88]: <re.Match object; span=(0, 5), match='000.2'>
 
 两个式子连接起来就是最终的结果：
 
-`^[1-9]\d*\.\d*|0\.\d*[1-9]\d*$`
\ No newline at end of file
+`^[1-9]\d*\.\d*|0\.\d*[1-9]\d*$`
+
+<center>[上一个例子](101.md)    [下一个例子](103.md)</center>
\ No newline at end of file
diff --git a/md/103.md b/md/103.md
index 4dd2b710..3b039296 100644
--- a/md/103.md
+++ b/md/103.md
@@ -15,4 +15,6 @@ Out[5]: './data/py/test'
 
 In [6]: ext
 Out[6]: '.py'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](102.md)    [下一个例子](104.md)</center>
\ No newline at end of file
diff --git a/md/104.md b/md/104.md
index 57e79708..a7d32078 100644
--- a/md/104.md
+++ b/md/104.md
@@ -17,4 +17,6 @@ Out[12]: './data/py'
 
 In [13]: ifile
 Out[13]: 'test.py'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](103.md)    [下一个例子](105.md)</center>
\ No newline at end of file
diff --git a/md/105.md b/md/105.md
index 9c6d75ee..a0d5d5d7 100644
--- a/md/105.md
+++ b/md/105.md
@@ -78,4 +78,6 @@ def main():
         new_ext = '.' + new_ext
 
     batch_rename(work_dir, old_ext, new_ext)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](104.md)    [下一个例子](106.md)</center>
\ No newline at end of file
diff --git a/md/106.md b/md/106.md
index e65c908e..e3025b60 100644
--- a/md/106.md
+++ b/md/106.md
@@ -37,4 +37,6 @@ xls_to_xlsx('./data')
 # 输出结果：
 # ['cut_words.csv', 'email_list.xlsx', 'email_test.docx', 'email_test.jpg', 'email_test.xlsx', 'geo_data.png', 'geo_data.xlsx',
 'iotest.txt', 'pyside2.md', 'PySimpleGUI-4.7.1-py3-none-any.whl', 'test.txt', 'test_excel.xlsx', 'ziptest', 'ziptest.zip']
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](105.md)    [下一个例子](107.md)</center>
\ No newline at end of file
diff --git a/md/107.md b/md/107.md
index b60e8ea7..e148f023 100644
--- a/md/107.md
+++ b/md/107.md
@@ -21,4 +21,6 @@ def find_file(work_dir,extension='jpg'):
 
 r = find_file('.','md') 
 print(r) # 返回所有目录下的md文件
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](106.md)    [下一个例子](108.md)</center>
\ No newline at end of file
diff --git a/md/108.md b/md/108.md
index 92285829..530a221c 100644
--- a/md/108.md
+++ b/md/108.md
@@ -31,4 +31,6 @@ def batch_zip(start_dir):
 batch_zip('./data/ziptest')
 
 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](107.md)    [下一个例子](109.md)</center>
\ No newline at end of file
diff --git a/md/109.md b/md/109.md
index 54638ac2..3e52f2c0 100644
--- a/md/109.md
+++ b/md/109.md
@@ -19,4 +19,6 @@ def hash_cry32(s):
 
 print(hash_cry32(1))  # c4ca4238a0b923820dcc509a6f75849b
 print(hash_cry32('hello'))  # 5d41402abc4b2a76b9719d911017c592
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](108.md)    [下一个例子](110.md)</center>
\ No newline at end of file
diff --git a/md/11.md b/md/11.md
index 527d50d9..31072399 100644
--- a/md/11.md
+++ b/md/11.md
@@ -9,6 +9,12 @@
 base为底的exp次幂，如果mod给出，取余
 
 ```python
-In [1]: pow(3, 2, 4)
+In [1]: pow(2,1.5)                                                              
+Out[1]: 2.8284271247461903
+
+In [1]: pow(3, 2, 4) # 3的2次方结果再对4取余数
 Out[1]: 1
 ```
+
+
+<center>[上一个例子](10.md)    [下一个例子](12.md)</center>
diff --git a/md/110.md b/md/110.md
index e0a26a5d..1d09c097 100644
--- a/md/110.md
+++ b/md/110.md
@@ -52,4 +52,6 @@ Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su      Mo Tu We Th Fr Sa Su
 21 22 23 24 25 26 27      18 19 20 21 22 23 24      16 17 18 19 20 21 22
 28 29 30 31               25 26 27 28 29 30         23 24 25 26 27 28 29
                                                     30 31
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](109.md)    [下一个例子](111.md)</center>
\ No newline at end of file
diff --git a/md/111.md b/md/111.md
index 139d494c..a5463347 100644
--- a/md/111.md
+++ b/md/111.md
@@ -20,4 +20,6 @@ print(print_leap_str % mydate.year)
 
 ```python
 2019年不是闰年
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](110.md)    [下一个例子](112.md)</center>
\ No newline at end of file
diff --git a/md/112.md b/md/112.md
index 0b6d20a8..e3ae7ce2 100644
--- a/md/112.md
+++ b/md/112.md
@@ -16,4 +16,6 @@ print(f'{mydate.year}年-{mydate.month}月共有{days}天\n')
 2019年-12月的第一天是那一周的第6天
 
 2019年-12月共有31天
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](111.md)    [下一个例子](113.md)</center>
\ No newline at end of file
diff --git a/md/113.md b/md/113.md
index 9e17b876..24f0cc60 100644
--- a/md/113.md
+++ b/md/113.md
@@ -11,4 +11,6 @@ print(f"当月第一天:{month_first_day}\n")
 
 ```python
 # 当月第一天:2019-12-01
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](112.md)    [下一个例子](114.md)</center>
\ No newline at end of file
diff --git a/md/114.md b/md/114.md
index 5cfabc7a..04ddee42 100644
--- a/md/114.md
+++ b/md/114.md
@@ -19,4 +19,6 @@ print(f"当月最后一天:{month_last_day}\n")
 
 ```python
 当月最后一天:2019-12-31
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](113.md)    [下一个例子](115.md)</center>
\ No newline at end of file
diff --git a/md/115.md b/md/115.md
index ad518397..ddcdf2a1 100644
--- a/md/115.md
+++ b/md/115.md
@@ -18,4 +18,6 @@ print(today_time)  # 2019-12-22 18:02:33.398894
 
 local_time = localtime()
 print(strftime("%Y-%m-%d %H:%M:%S", local_time))  # 转化为定制的格式 2019-12-22 18:13:41
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](114.md)    [下一个例子](116.md)</center>
\ No newline at end of file
diff --git a/md/116.md b/md/116.md
index 88e2e46a..356bc44e 100644
--- a/md/116.md
+++ b/md/116.md
@@ -14,4 +14,6 @@ struct_time = strptime('2019-12-22 10:10:08', "%Y-%m-%d %H:%M:%S")
 print(struct_time) # struct_time类型就是time中的一个类
 
 # time.struct_time(tm_year=2019, tm_mon=12, tm_mday=22, tm_hour=10, tm_min=10, tm_sec=8, tm_wday=6, tm_yday=356, tm_isdst=-1)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](115.md)    [下一个例子](117.md)</center>
\ No newline at end of file
diff --git a/md/117.md b/md/117.md
index 58701ed9..40d00118 100644
--- a/md/117.md
+++ b/md/117.md
@@ -14,4 +14,6 @@ Out[2]: time.struct_time(tm_year=2019, tm_mon=12, tm_mday=22, tm_hour=18, tm_min
 
 print(strftime("%m-%d-%Y %H:%M:%S", localtime()))  # 转化为定制的格式
 # 这是字符串表示的时间：   12-22-2019 18:26:21
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](116.md)    [下一个例子](118.md)</center>
\ No newline at end of file
diff --git a/md/118.md b/md/118.md
index a86160ed..8b3eaec1 100644
--- a/md/118.md
+++ b/md/118.md
@@ -23,10 +23,12 @@ print(t) # <_MainThread(MainThread, started 139908235814720)>
 
 所以，验证了程序默认是在`MainThead`中执行。
 
-`t.getName()`获得这个线程的名字，其他常用方法，`getName()`获得线程`id`,`isAlive()`判断线程是否存活等。
+`t.getName()`获得这个线程的名字，其他常用方法，`t.ident`获得线程`id`,`isAlive()`判断线程是否存活等。
 
 ```python
 print(t.getName()) # MainThread
 print(t.ident) # 139908235814720
 print(t.isAlive()) # True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](117.md)    [下一个例子](119.md)</center>
diff --git a/md/119.md b/md/119.md
index 30a29c41..7db86481 100644
--- a/md/119.md
+++ b/md/119.md
@@ -38,4 +38,6 @@ my_thread().start()
 打印i:1
 ```
 
-至此，多线程相关的核心知识点，已经总结完毕。但是，仅仅知道这些，还不够！光纸上谈兵，当然远远不够。
\ No newline at end of file
+至此，多线程相关的核心知识点，已经总结完毕。但是，仅仅知道这些，还不够！光纸上谈兵，当然远远不够。
+
+<center>[上一个例子](118.md)    [下一个例子](120.md)</center>
\ No newline at end of file
diff --git a/md/12.md b/md/12.md
index 73c360c0..78fe34d1 100644
--- a/md/12.md
+++ b/md/12.md
@@ -15,3 +15,6 @@ Out[11]: 10.022
 In [12]: round(10.05,1)
 Out[12]: 10.1
 ```
+
+
+<center>[上一个例子](11.md)    [下一个例子](13.md)</center>
\ No newline at end of file
diff --git a/md/120.md b/md/120.md
index c332ad22..e79a709f 100644
--- a/md/120.md
+++ b/md/120.md
@@ -49,4 +49,6 @@ threads = [threading.Thread(name='t%d'%(i,),target=print_time) for i in range(3)
 当前线程t2,打印结束时间为:2020-01-12 02:27:16.107564
 当前线程t0,打印结束时间为:2020-01-12 02:27:16.107290
 当前线程t1,打印结束时间为:2020-01-12 02:27:16.107741
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](119.md)    [下一个例子](121.md)</center>
\ No newline at end of file
diff --git a/md/121.md b/md/121.md
index b5fbbc03..34608ac0 100644
--- a/md/121.md
+++ b/md/121.md
@@ -56,4 +56,6 @@ NO！
 
 
 
-但是在本例中，`a = a + 1`这种修改操作，花费的时间太短了，短到我们无法想象。所以，线程间轮询执行时，都能get到最新的a值。所以，暴露问题的概率就变得微乎其微。
\ No newline at end of file
+但是在本例中，`a = a + 1`这种修改操作，花费的时间太短了，短到我们无法想象。所以，线程间轮询执行时，都能get到最新的a值。所以，暴露问题的概率就变得微乎其微。
+
+<center>[上一个例子](120.md)    [下一个例子](122.md)</center>
\ No newline at end of file
diff --git a/md/122.md b/md/122.md
index bb0ba278..ca307f18 100644
--- a/md/122.md
+++ b/md/122.md
@@ -58,4 +58,6 @@ t9  adds a to 1: 1
 tmp = a + 1
 time.sleep(0.2) # 延时0.2秒，模拟写入所需时间
 a = tmp
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](121.md)    [下一个例子](123.md)</center>
\ No newline at end of file
diff --git a/md/123.md b/md/123.md
index d7763476..ccff06f5 100644
--- a/md/123.md
+++ b/md/123.md
@@ -54,8 +54,10 @@ t8  adds a to 1: 9
 t9  adds a to 1: 10
 ```
 
-一起正常，其实这已经是单线程顺序执行了，就本例子而言，已经失去多线程的价值，并且还带来了因为线程创建开销，浪费时间的副作用。
+一切正常，其实这已经是单线程顺序执行了，就本例子而言，已经失去多线程的价值，并且还带来了因为线程创建开销，浪费时间的副作用。
 
 程序中只有一把锁，通过 `try...finally`还能确保不发生死锁。但是，当程序中启用多把锁，还是很容易发生死锁。
 
-注意使用场合，避免死锁，是我们在使用多线程开发时需要注意的一些问题。
\ No newline at end of file
+注意使用场合，避免死锁，是我们在使用多线程开发时需要注意的一些问题。
+
+<center>[上一个例子](122.md)    [下一个例子](124.md)</center>
diff --git a/md/124.md b/md/124.md
index dbf40795..b96dbd89 100644
--- a/md/124.md
+++ b/md/124.md
@@ -32,4 +32,6 @@ Out[69]: time.struct_time(tm_year=2020, tm_mon=2, tm_mday=22, tm_hour=11, tm_min
     %a  Locale's abbreviated weekday name.
     %A  Locale's full weekday name.
     %b  Locale's abbreviated month name.
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](123.md)    [下一个例子](125.md)</center>
\ No newline at end of file
diff --git a/md/125.md b/md/125.md
index 427d482d..4d5b486c 100644
--- a/md/125.md
+++ b/md/125.md
@@ -20,4 +20,6 @@ def search_n(s, c, n):
 
 print(search_n("fdasadfadf", "a", 3))# 结果为7，正确
 print(search_n("fdasadfadf", "a", 30))# 结果为-1，正确
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](124.md)    [下一个例子](126.md)</center>
\ No newline at end of file
diff --git a/md/126.md b/md/126.md
index 8332c1dc..7ad476fa 100644
--- a/md/126.md
+++ b/md/126.md
@@ -15,4 +15,6 @@ def fibonacci(n):
 
 
 list(fibonacci(5))  # [1, 1, 2, 3, 5]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](125.md)    [下一个例子](127.md)</center>
\ No newline at end of file
diff --git a/md/127.md b/md/127.md
index 5424f588..bfe5eefe 100644
--- a/md/127.md
+++ b/md/127.md
@@ -16,4 +16,6 @@ def find_all_duplicates(lst):
 
 
 find_all_duplicates([1, 2, 2, 3, 3, 3])  # [2,3]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](126.md)    [下一个例子](128.md)</center>
\ No newline at end of file
diff --git a/md/128.md b/md/128.md
index e94c6097..cfd3df4a 100644
--- a/md/128.md
+++ b/md/128.md
@@ -34,4 +34,6 @@ def sumc(*c):
 #Counter({'orange': 3, 'computer': 3, 'apple': 1, 'abc': 1, 'face': 1})
 sumc(a, b, ['abc'], ['face', 'computer'])
 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](127.md)    [下一个例子](129.md)</center>
\ No newline at end of file
diff --git a/md/129.md b/md/129.md
index daa12086..60a5560f 100644
--- a/md/129.md
+++ b/md/129.md
@@ -48,4 +48,6 @@ cloud
 {'date': '2019-12-14', 'weather': 'cloud'}
 sunny
 {'date': '2019-12-13', 'weather': 'sunny'}
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](128.md)    [下一个例子](130.md)</center>
\ No newline at end of file
diff --git a/md/13.md b/md/13.md
index 6c5365a1..3cc2f3c5 100644
--- a/md/13.md
+++ b/md/13.md
@@ -6,8 +6,13 @@
 
 #### 13 链式比较
 
+Python支持这种连续不等比较，写起来更方便
+
 ```python
 i = 3
 print(1 < i < 3)  # False
 print(1 < i <= 3)  # True
 ```
+
+
+<center>[上一个例子](12.md)    [下一个例子](14.md)</center>
diff --git a/md/130.md b/md/130.md
index b5b1020e..510692f6 100644
--- a/md/130.md
+++ b/md/130.md
@@ -29,4 +29,6 @@ sunny
 {'date': '2019-12-13', 'weather': 'sunny'}
 ```
 
-注意这个结果与上面结果有些微妙不同，这个更多是我们想看到和使用更多的。
\ No newline at end of file
+注意这个结果与上面结果有些微妙不同，这个更多是我们想看到和使用更多的。
+
+<center>[上一个例子](129.md)    [下一个例子](131.md)</center>
\ No newline at end of file
diff --git a/md/131.md b/md/131.md
index 95a5ed4a..c2e00aad 100644
--- a/md/131.md
+++ b/md/131.md
@@ -30,4 +30,6 @@ cloud
 {'date': '2019-12-14', 'weather': 'cloud'}
 sunny
 {'date': '2019-12-13', 'weather': 'sunny'}
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](130.md)    [下一个例子](132.md)</center>
\ No newline at end of file
diff --git a/md/132.md b/md/132.md
index abd469c7..d9eb0078 100644
--- a/md/132.md
+++ b/md/132.md
@@ -29,4 +29,6 @@ In [8]:(i+1 for i in a)
 OUT [8]:<generator object <genexpr> at 0x000002AC7FFA8CF0>
 ```
 
-生成器每迭代一步吐出(`yield`)一个元素并计算和聚合后，进入下一次迭代，直到终点。
\ No newline at end of file
+生成器每迭代一步吐出(`yield`)一个元素并计算和聚合后，进入下一次迭代，直到终点。
+
+<center>[上一个例子](131.md)    [下一个例子](133.md)</center>
\ No newline at end of file
diff --git a/md/133.md b/md/133.md
index 613ea14b..39366435 100644
--- a/md/133.md
+++ b/md/133.md
@@ -43,3 +43,6 @@ In [3]: for day in getEverydaySince(2020,2,1):
 2020-02-10
 ```
 
+
+
+<center>[上一个例子](132.md)    [下一个例子](134.md)</center>
\ No newline at end of file
diff --git a/md/134.md b/md/134.md
index c22be448..c30c8de1 100644
--- a/md/134.md
+++ b/md/134.md
@@ -23,4 +23,6 @@ def divide_iter(lst, n):
 
 list(divide_iter([1, 2, 3, 4, 5], 0))  # [[1, 2, 3, 4, 5]]
 list(divide_iter([1, 2, 3, 4, 5], 2))  # [[1, 2, 3], [4, 5]]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](133.md)    [下一个例子](135.md)</center>
\ No newline at end of file
diff --git a/md/135.md b/md/135.md
index 2bd9cc8b..d3fb5c77 100644
--- a/md/135.md
+++ b/md/135.md
@@ -18,4 +18,6 @@ def function(lst):
         else:
             yield i
 print(list(function(a))) # [1, 2, 3, 4, 5, 6, 7, 8, 'python', 6, 9]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](134.md)    [下一个例子](136.md)</center>
\ No newline at end of file
diff --git a/md/136.md b/md/136.md
index 76880345..a29e3915 100644
--- a/md/136.md
+++ b/md/136.md
@@ -44,3 +44,6 @@ print("append/compre:",round(a/c,3))
 #append/compre: 2.749
 ```
 
+
+
+<center>[上一个例子](135.md)    [下一个例子](137.md)</center>
\ No newline at end of file
diff --git a/md/137.md b/md/137.md
index c72ccf32..6c067f1a 100644
--- a/md/137.md
+++ b/md/137.md
@@ -94,4 +94,6 @@ spending time:1.01
 ```
 
 
-#### 
\ No newline at end of file
+#### 
+
+<center>[上一个例子](136.md)    [下一个例子](138.md)</center>
\ No newline at end of file
diff --git a/md/138.md b/md/138.md
index 4966e2f3..a6dff3a1 100644
--- a/md/138.md
+++ b/md/138.md
@@ -91,4 +91,6 @@ In [33]: myfun2()
 you're calling myfun2 function
 ```
 
-你看，这与装饰器的实现效果是一模一样的。装饰器的写法可能更加直观些，所以不用显示的这样赋值：`myfun = call_print(myfun)`，`myfun2 = call_print(myfun2)`，但是装饰器的这种封装，猛一看，有些不好理解。
\ No newline at end of file
+你看，这与装饰器的实现效果是一模一样的。装饰器的写法可能更加直观些，所以不用显示的这样赋值：`myfun = call_print(myfun)`，`myfun2 = call_print(myfun2)`，但是装饰器的这种封装，猛一看，有些不好理解。
+
+<center>[上一个例子](137.md)    [下一个例子](139.md)</center>
\ No newline at end of file
diff --git a/md/139.md b/md/139.md
index 6f038d5e..006ec56c 100644
--- a/md/139.md
+++ b/md/139.md
@@ -32,4 +32,6 @@ print(list(descend_iter))
 1 `__nex__ `名字不能变，实现定制的迭代逻辑
 
 2 `raise StopIteration`：通过 raise 中断程序，必须这样写
-    
\ No newline at end of file
+    
+
+<center>[上一个例子](138.md)    [下一个例子](140.md)</center>
\ No newline at end of file
diff --git a/md/14.md b/md/14.md
index 90ee4b5b..061a1d8d 100644
--- a/md/14.md
+++ b/md/14.md
@@ -11,6 +11,20 @@
 ```python
 In [12]: s = "apple"                                                            
 
-In [13]: bytes(s,encoding='utf-8')                                              
-Out[13]: b'apple'
+In [13]: a = bytes(s,encoding='utf-8')   
+In [14] a 
+Out[14]: b'apple'
+
+# 转化后a变为字节序列，bytes类型，
+# 并且每个字符都被转化为数值，如下所示
+In [15]: for i in a: 
+    ...:     print(i)                                                                       
+97
+112
+112
+108
+101
 ```
+
+
+<center>[上一个例子](13.md)    [下一个例子](15.md)</center>
diff --git a/md/140.md b/md/140.md
index f9abebf1..3c6fc5bf 100644
--- a/md/140.md
+++ b/md/140.md
@@ -47,4 +47,6 @@ drawCircle(90,-30,'green')
 drawCircle(30,-30,'yellow')    
 
 p.done()
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](139.md)    [下一个例子](141.md)</center>
\ No newline at end of file
diff --git a/md/141.md b/md/141.md
index ca3447e1..70c0e290 100644
--- a/md/141.md
+++ b/md/141.md
@@ -76,4 +76,6 @@ def main():
     p.mainloop()
 
 main()
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](140.md)    [下一个例子](142.md)</center>
\ No newline at end of file
diff --git a/md/142.md b/md/142.md
index 757eddab..978f8df4 100644
--- a/md/142.md
+++ b/md/142.md
@@ -44,4 +44,6 @@ wc = WordCloud(
 ```python
 x = wc.generate(words)
 x.to_file('../data/geo_data.png') 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](141.md)    [下一个例子](143.md)</center>
\ No newline at end of file
diff --git a/md/143.md b/md/143.md
index 2e4cc805..aa633f4e 100644
--- a/md/143.md
+++ b/md/143.md
@@ -22,4 +22,6 @@ fig.add_trace(
         y=[2, 0.5, 0.7, -1.2, 0.3, 0.4]
     ))
 fig.show()
-```    
\ No newline at end of file
+```    
+
+<center>[上一个例子](142.md)    [下一个例子](144.md)</center>
\ No newline at end of file
diff --git a/md/144.md b/md/144.md
index 1f3cdfc2..5cbd7e90 100644
--- a/md/144.md
+++ b/md/144.md
@@ -23,4 +23,6 @@ print(data)
 # 绘制热力图
 heatmap_plot = sns.heatmap(data, center=0, cmap='gist_rainbow')
 plt.show()
-```    
\ No newline at end of file
+```    
+
+<center>[上一个例子](143.md)    [下一个例子](145.md)</center>
\ No newline at end of file
diff --git a/md/145.md b/md/145.md
index 0311effb..0d3e538a 100644
--- a/md/145.md
+++ b/md/145.md
@@ -18,4 +18,6 @@ gauge.render(path="./data/仪表盘.html")
 print('ok')
 ```  
 
-仪表盘中共展示三项，每项的比例为30%,70%,90%，如下图默认名称显示第一项：Python机器学习，完成比例为30%
\ No newline at end of file
+仪表盘中共展示三项，每项的比例为30%,70%,90%，如下图默认名称显示第一项：Python机器学习，完成比例为30%
+
+<center>[上一个例子](144.md)    [下一个例子](146.md)</center>
\ No newline at end of file
diff --git a/md/146.md b/md/146.md
index 29201849..e04770d8 100644
--- a/md/146.md
+++ b/md/146.md
@@ -21,4 +21,6 @@ def funnel_base() -> Funnel:
   )
   return c
 funnel_base().render('./img/car_fnnel.html')  
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](145.md)    [下一个例子](147.md)</center>
\ No newline at end of file
diff --git a/md/147.md b/md/147.md
index ef94ba8b..fd008bec 100644
--- a/md/147.md
+++ b/md/147.md
@@ -21,4 +21,6 @@ def liquid() -> Liquid:
     return c
 
 liquid().render('./img/liquid.html')
-```    
\ No newline at end of file
+```    
+
+<center>[上一个例子](146.md)    [下一个例子](148.md)</center>
\ No newline at end of file
diff --git a/md/148.md b/md/148.md
index 1ab1ab02..508ea117 100644
--- a/md/148.md
+++ b/md/148.md
@@ -23,4 +23,6 @@ def pie_base() -> Pie:
     return c
 
 pie_base().render('./img/pie_pyecharts.html')   
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](147.md)    [下一个例子](149.md)</center>
\ No newline at end of file
diff --git a/md/149.md b/md/149.md
index 0591ab0c..ae87e900 100644
--- a/md/149.md
+++ b/md/149.md
@@ -23,4 +23,6 @@ def polar_scatter0() -> Polar:
     return c
 
 polar_scatter0().render('./img/polar.html')
-```    
\ No newline at end of file
+```    
+
+<center>[上一个例子](148.md)    [下一个例子](150.md)</center>
\ No newline at end of file
diff --git a/md/15.md b/md/15.md
index 9b5d5405..b5127301 100644
--- a/md/15.md
+++ b/md/15.md
@@ -7,14 +7,30 @@
 #### 15 任意对象转为字符串　　
 
 ```python
-In [14]: i = 100                                                                
+                                                              
 
-In [15]: str(i)                                                                 
-Out[15]: '100'
+In [1]: str(100)                                                                 
+Out[1]: '100'
 
-In [16]: str([])                                                                
-Out[16]: '[]'
+In [2]: str([3,2,10])                                                           
+Out[2]: '[3, 2, 10]'
 
-In [17]: str(tuple())                                                           
-Out[17]: '()'
-```
\ No newline at end of file
+In [3]: str({'a':1, 'b':10})                                                    
+Out[3]: "{'a': 1, 'b': 10}"
+
+In [11]: from collections import defaultdict                                    
+In [12]: dd = defaultdict(int)                                                  
+
+In [14]: for i in [1,3,2,2,3,3]: 
+    ...:     dd[i] += 1 
+    ...:                                                                        
+
+In [15]: dd                                                                     
+Out[15]: defaultdict(int, {1: 1, 3: 3, 2: 2})
+
+In [16]: str(dd)                                                                
+Out[16]: "defaultdict(<class 'int'>, {1: 1, 3: 3, 2: 2})"
+
+```
+
+<center>[上一个例子](14.md)    [下一个例子](16.md)</center>
diff --git a/md/150.md b/md/150.md
index d4f75819..90d7fbbf 100644
--- a/md/150.md
+++ b/md/150.md
@@ -31,4 +31,6 @@ def wordcloud() -> WordCloud:
     return c
 
 wordcloud().render('./img/wordcloud.html')     
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](149.md)    [下一个例子](151.md)</center>
\ No newline at end of file
diff --git a/md/151.md b/md/151.md
index 0df44065..1006c27b 100644
--- a/md/151.md
+++ b/md/151.md
@@ -31,4 +31,6 @@ def heatmap_car() -> HeatMap:
 heatmap_car().render('./img/heatmap_pyecharts.html') 
 ```
 
-热力图描述的实际是三维关系，x轴表示车型，y轴表示国家，每个色块的颜色值代表销量，颜色刻度尺显示在左下角，颜色越红表示销量越大。
\ No newline at end of file
+热力图描述的实际是三维关系，x轴表示车型，y轴表示国家，每个色块的颜色值代表销量，颜色刻度尺显示在左下角，颜色越红表示销量越大。
+
+<center>[上一个例子](150.md)    [下一个例子](152.md)</center>
\ No newline at end of file
diff --git a/md/152.md b/md/152.md
index 4e7e1894..ee5530c0 100644
--- a/md/152.md
+++ b/md/152.md
@@ -67,4 +67,6 @@ def draw_chart():
 ```python
 draw_chart()
 179     
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](151.md)    [下一个例子](153.md)</center>
\ No newline at end of file
diff --git a/md/153.md b/md/153.md
index d7975e8e..b64115cd 100644
--- a/md/153.md
+++ b/md/153.md
@@ -58,4 +58,6 @@ plt.show()
 sns.pairplot(df, hue="species")
 plt.show()
 ```
-   
\ No newline at end of file
+   
+
+<center>[上一个例子](152.md)    [下一个例子](154.md)</center>
\ No newline at end of file
diff --git a/md/154.md b/md/154.md
index e8907e1d..8cdb47b1 100644
--- a/md/154.md
+++ b/md/154.md
@@ -19,4 +19,6 @@ c = (5) # NO!
 c = (5,) # YES!
 186   
 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](153.md)    [下一个例子](155.md)</center>
\ No newline at end of file
diff --git a/md/155.md b/md/155.md
index b6a16a84..56f84e76 100644
--- a/md/155.md
+++ b/md/155.md
@@ -30,4 +30,6 @@ f(1)
 这是可变类型的默认参数之坑，请务必设置此类默认参数为None：
 
 def f(a,b=None): # YES!
-    pass
\ No newline at end of file
+    pass
+
+<center>[上一个例子](154.md)    [下一个例子](156.md)</center>
\ No newline at end of file
diff --git a/md/156.md b/md/156.md
index eabae77d..3b11562b 100644
--- a/md/156.md
+++ b/md/156.md
@@ -29,3 +29,6 @@ def f():
     i+=1
 ```
 
+
+
+<center>[上一个例子](155.md)    [下一个例子](157.md)</center>
\ No newline at end of file
diff --git a/md/157.md b/md/157.md
index 6b4cf7c5..e7353abc 100644
--- a/md/157.md
+++ b/md/157.md
@@ -29,4 +29,6 @@ def f(a):
 正确做法是转化自由参数为lambda函数的默认参数：
 ```python
 a = [lambda x,i=i: x+i for i in range(3)] # YES!    
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](156.md)    [下一个例子](158.md)</center>
\ No newline at end of file
diff --git a/md/158.md b/md/158.md
index 0219b190..338514f4 100644
--- a/md/158.md
+++ b/md/158.md
@@ -128,3 +128,6 @@ f(a=1,b=2,width=3) #打印结果：{'a': 1, 'b': 2, 'width': 3}
 ```python
 f(1) TypeError: f() takes 0 positional arguments but 1 was given
 ```
+
+
+<center>[上一个例子](157.md)    [下一个例子](159.md)</center>
\ No newline at end of file
diff --git a/md/159.md b/md/159.md
index 1406e316..98b7426f 100644
--- a/md/159.md
+++ b/md/159.md
@@ -13,7 +13,7 @@
 
 ```python
 def del_item(lst,e):
-    return [lst.remove(i) for i in e if i==e] # NO!
+    return [lst.remove(i) for i in lst if i==e] # NO!
 ```
 
 考虑删除这个序列[1,3,3,3,5]中的元素3，结果发现只删除其中两个：
@@ -29,4 +29,6 @@ def del_item(lst,e):
     d = dict(zip(range(len(lst)),lst)) # YES! 构造字典
     return [v for k,v in d.items() if v!=e]
 
-```  
\ No newline at end of file
+```  
+
+<center>[上一个例子](158.md)    [下一个例子](160.md)</center>
diff --git a/md/16.md b/md/16.md
index 7a0cb11a..5cc1a5a3 100644
--- a/md/16.md
+++ b/md/16.md
@@ -18,4 +18,18 @@ Out[3]: <code object <module> at 0x0000000005DE75D0, file "<string>", line 1>
     
 In [4]: exec(r)
 helloworld
-```
\ No newline at end of file
+
+s  = """
+def f():
+    a = 100 % 52
+    print(a)
+f()
+"""
+r = compile(s,"<string>", "exec")
+exec(r)
+```
+
+输出
+48
+
+<center>[上一个例子](15.md)    [下一个例子](17.md)</center>
diff --git a/md/160.md b/md/160.md
index ac2171a6..057584e7 100644
--- a/md/160.md
+++ b/md/160.md
@@ -33,4 +33,6 @@ a[0][0] = 10 #
 
 ```python
 a = [[] for _ in range(3)]
-```  
\ No newline at end of file
+```  
+
+<center>[上一个例子](159.md)    [下一个例子](161.md)</center>
\ No newline at end of file
diff --git a/md/161.md b/md/161.md
index a6c0ff08..fc4b8173 100644
--- a/md/161.md
+++ b/md/161.md
@@ -24,4 +24,6 @@ In [2]: b = '@zglg'+'.com'
 In [3]: id(a)==id(b)
 Out[3]: False
 ```
-这与Cpython 编译优化相关，行为称为`字符串驻留`，但驻留的字符串中只包含字母，数字或下划线。   
\ No newline at end of file
+这与Cpython 编译优化相关，行为称为`字符串驻留`，但驻留的字符串中只包含字母，数字或下划线。   
+
+<center>[上一个例子](160.md)    [下一个例子](162.md)</center>
\ No newline at end of file
diff --git a/md/162.md b/md/162.md
index 8775d7ab..023623c3 100644
--- a/md/162.md
+++ b/md/162.md
@@ -24,4 +24,6 @@ Out[8]: 'python'
 ```
 这是因为具有相同值的不可变对象在Python中始终具有`相同的哈希值`
 
-由于存在`哈希冲突`，不同值的对象也可能具有相同的哈希值。  
\ No newline at end of file
+由于存在`哈希冲突`，不同值的对象也可能具有相同的哈希值。  
+
+<center>[上一个例子](161.md)    [下一个例子](163.md)</center>
\ No newline at end of file
diff --git a/md/163.md b/md/163.md
index bcf3aed0..3803d3e4 100644
--- a/md/163.md
+++ b/md/163.md
@@ -41,4 +41,6 @@ Out[64]: True
 当连续两次进行此操作, Python会将相同的内存地址分配给第二个对象，所以两个对象的id值是相同的.
 
 
-但是is行为却与之不同，通过打印顺序就可以看到。 
\ No newline at end of file
+但是is行为却与之不同，通过打印顺序就可以看到。 
+
+<center>[上一个例子](162.md)    [下一个例子](164.md)</center>
\ No newline at end of file
diff --git a/md/164.md b/md/164.md
index 4fa085f7..b177bade 100644
--- a/md/164.md
+++ b/md/164.md
@@ -21,4 +21,6 @@ In [65]: for i in range(5):
 ```
 为什么不是执行一次就退出？
 
-按照for在Python中的工作方式, i = 10 并不会影响循环。range(5)生成的下一个元素就被解包，并赋值给目标列表的变量`i`.   
\ No newline at end of file
+按照for在Python中的工作方式, i = 10 并不会影响循环。range(5)生成的下一个元素就被解包，并赋值给目标列表的变量`i`.   
+
+<center>[上一个例子](163.md)    [下一个例子](165.md)</center>
\ No newline at end of file
diff --git a/md/165.md b/md/165.md
index 209c6253..05653ab7 100644
--- a/md/165.md
+++ b/md/165.md
@@ -40,4 +40,6 @@ array = [5, 7, 9]
 等价于：
 ```python
 g = (x for x in [1,3,5] if [5,7,9].count(x) > 0)
-``` 
\ No newline at end of file
+``` 
+
+<center>[上一个例子](164.md)    [下一个例子](166.md)</center>
\ No newline at end of file
diff --git a/md/166.md b/md/166.md
index c413d061..17892f99 100644
--- a/md/166.md
+++ b/md/166.md
@@ -21,4 +21,6 @@ cpython会解释它为字典
 
 ```python
 empty = set() #YES!
-``` 
\ No newline at end of file
+``` 
+
+<center>[上一个例子](165.md)    [下一个例子](167.md)</center>
\ No newline at end of file
diff --git a/md/167.md b/md/167.md
index 95514f54..8dc039aa 100644
--- a/md/167.md
+++ b/md/167.md
@@ -43,4 +43,6 @@ c = (
 c.render()
 ```
 
-<img src="./img/image-20200129164339971.png" width="50%"/>    
\ No newline at end of file
+<img src="./img/image-20200129164339971.png" width="50%"/>    
+
+<center>[上一个例子](166.md)    [下一个例子](168.md)</center>
\ No newline at end of file
diff --git a/md/168.md b/md/168.md
index 259cac5a..ef13d066 100644
--- a/md/168.md
+++ b/md/168.md
@@ -71,4 +71,6 @@ division by zero
 
 完整的输出信息如下图片所示：
 
-<img src="./img/errors.png" width="50%"/>    
\ No newline at end of file
+<img src="./img/errors.png" width="50%"/>    
+
+<center>[上一个例子](167.md)    [下一个例子](169.md)</center>
\ No newline at end of file
diff --git a/md/169.md b/md/169.md
index 680b4e00..39021b49 100644
--- a/md/169.md
+++ b/md/169.md
@@ -51,4 +51,6 @@ im.filter(ImageFilter.CONTOUR).show()
 ```
 
 <img src="./img/pillow_filter.png" width="40%"/>
-  
\ No newline at end of file
+  
+
+<center>[上一个例子](168.md)    [下一个例子](170.md)</center>
\ No newline at end of file
diff --git a/md/17.md b/md/17.md
index ea3e9c1d..3b3c360c 100644
--- a/md/17.md
+++ b/md/17.md
@@ -10,6 +10,24 @@
 
 ```python
 In [1]: s = "1 + 3 +5"
-    ...: eval(s)
-Out[1]: 9
-```
\ No newline at end of file
+
+In [2]: eval(s)
+Out[2]: 9
+
+s = ["{'小汽车':10, '面包车':8}", "{'面包车':5}"]
+from collections import defaultdict
+d = defaultdict(int)
+
+for item in s:
+    my_dict = eval(item)
+    print(type(my_dict))
+    for key in my_dict:
+        d[key] += my_dict[key]
+print(d)
+
+<class 'dict'>
+<class 'dict'>
+defaultdict(<class 'int'>, {'小汽车': 10, '面包车': 13})
+```
+
+<center>[上一个例子](16.md)    [下一个例子](18.md)</center>
diff --git a/md/170.md b/md/170.md
index a6333116..8c1ac757 100644
--- a/md/170.md
+++ b/md/170.md
@@ -70,4 +70,6 @@ In [7]: b'\xc8\xcb\xc9\xfa\xbf\xe0\xb6\xcc\xa3\xac\xce\xd2\xd3\xc3Python'.decode
 Out[7]: '人生苦短，我用Python'
 ```
 
-目前，`chardet` 包支持的检测编码几十种。     
\ No newline at end of file
+目前，`chardet` 包支持的检测编码几十种。     
+
+<center>[上一个例子](169.md)    [下一个例子](171.md)</center>
\ No newline at end of file
diff --git a/md/171.md b/md/171.md
index c3bcf1b2..392e8080 100644
--- a/md/171.md
+++ b/md/171.md
@@ -45,4 +45,6 @@ print(p.static_method())
 print(p.class_method())
 # 1.5
 # 2.6666666666666665
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](170.md)    [下一个例子](172.md)</center>
\ No newline at end of file
diff --git a/md/172.md b/md/172.md
index 8ec353e9..8329547a 100644
--- a/md/172.md
+++ b/md/172.md
@@ -35,4 +35,6 @@ array([[0., 0., 0., 0., 0., 0., 0., 0.],
 
 此函数在为数组充填值，卷积中有重要应用。
 
-以上就是《python-small-examples》第 172 个小例子：NumPy 的 pad 填充方法。
\ No newline at end of file
+以上就是《python-small-examples》第 172 个小例子：NumPy 的 pad 填充方法。
+
+<center>[上一个例子](171.md)    [下一个例子](173.md)</center>
\ No newline at end of file
diff --git a/md/173.md b/md/173.md
index b8e4e79d..a29c0bef 100644
--- a/md/173.md
+++ b/md/173.md
@@ -20,4 +20,6 @@ In [2]: Z = np.diag(1+np.arange(4),k=-1)
  [0 0 0 4 0]]
  ```
  
- 其中，k 参数：大于0，表示与主对角线上移k，小于0下移k
\ No newline at end of file
+ 其中，k 参数：大于0，表示与主对角线上移k，小于0下移k
+
+<center>[上一个例子](172.md)    [下一个例子](174.md)</center>
\ No newline at end of file
diff --git a/md/174.md b/md/174.md
index f4066e81..0c50e7fe 100644
--- a/md/174.md
+++ b/md/174.md
@@ -26,4 +26,6 @@ Length: 20
 Categories (4, object): [D < C < B < A]
 ```
 
-分箱后，48分对应D，22分对应D，46对应D，84分对应B，...
\ No newline at end of file
+分箱后，48分对应D，22分对应D，46对应D，84分对应B，...
+
+<center>[上一个例子](173.md)    [下一个例子](175.md)</center>
\ No newline at end of file
diff --git a/md/175.md b/md/175.md
index 9a2b6d48..59b0b9ac 100644
--- a/md/175.md
+++ b/md/175.md
@@ -25,3 +25,6 @@ df.dropna(axis=0, how='all')
 df["a"].fillna(df["a"].mean(), inplace=True)
 ```
 
+
+
+<center>[上一个例子](174.md)    [下一个例子](176.md)</center>
\ No newline at end of file
diff --git a/md/176.md b/md/176.md
index ccd97fd1..6a9fa44f 100644
--- a/md/176.md
+++ b/md/176.md
@@ -31,4 +31,6 @@ pip --defualt-timeout = 600 install scrapy
 pip --defualt-timeout = 600 install scrapy -i https://pypi.tuna.tsinghua.edu.cn/simple 
 ```
 
-后面安装你可以直接复制我这行命令，安装包的速度会快很多。
\ No newline at end of file
+后面安装你可以直接复制我这行命令，安装包的速度会快很多。
+
+<center>[上一个例子](175.md)    [下一个例子](177.md)</center>
\ No newline at end of file
diff --git a/md/177.md b/md/177.md
index 800b091a..e8e6e4ff 100644
--- a/md/177.md
+++ b/md/177.md
@@ -23,4 +23,6 @@ jupyter notebook 是运行 python 非常好用的笔记本之一，尤其作数
 
 邀请伙伴直接进入你的notebook，多人协作，开发更快：
 
-多了一种选择，调换着使用它们会很不错！
\ No newline at end of file
+多了一种选择，调换着使用它们会很不错！
+
+<center>[上一个例子](176.md)    [下一个例子](178.md)</center>
\ No newline at end of file
diff --git a/md/178.md b/md/178.md
index 7ab4eb6c..4a0c6ce8 100644
--- a/md/178.md
+++ b/md/178.md
@@ -33,4 +33,6 @@ Out[28]:
 0    cd  edc.rc
 1     3       3
 2  d ef       4
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](177.md)    [下一个例子](179.md)</center>
\ No newline at end of file
diff --git a/md/179.md b/md/179.md
index bb6cd1c4..e2dc705c 100644
--- a/md/179.md
+++ b/md/179.md
@@ -31,3 +31,6 @@ d = {"male": 0, "female": 1}
 df["gender2"] = df["gender"].map(d)
 ```
 
+
+
+<center>[上一个例子](178.md)    [下一个例子](180.md)</center>
\ No newline at end of file
diff --git a/md/18.md b/md/18.md
index 5312f424..e4348925 100644
--- a/md/18.md
+++ b/md/18.md
@@ -26,4 +26,6 @@ i am tom,age18
 | 1000000000 | {:.2e}  | 1.00e+09  | 指数记法                     |
 | 18         | {:>10d} | ' 18'     | 右对齐 (默认, 宽度为10)      |
 | 18         | {:<10d} | '18 '     | 左对齐 (宽度为10)            |
-| 18         | {:^10d} | ' 18 '    | 中间对齐 (宽度为10)          |
\ No newline at end of file
+| 18         | {:^10d} | ' 18 '    | 中间对齐 (宽度为10)          |
+
+<center>[上一个例子](17.md)    [下一个例子](19.md)</center>
\ No newline at end of file
diff --git a/md/180.md b/md/180.md
index e61fa1b8..49283079 100644
--- a/md/180.md
+++ b/md/180.md
@@ -25,4 +25,6 @@ def c2n(x):
         return 80
 
 df['a'].apply(c2n)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](179.md)    [下一个例子](181.md)</center>
\ No newline at end of file
diff --git a/md/181.md b/md/181.md
index 986d8979..482f7094 100644
--- a/md/181.md
+++ b/md/181.md
@@ -20,4 +20,6 @@ Out[53]:
 2    1.0
 3    3.0
 4    5.0
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](180.md)    [下一个例子](182.md)</center>
\ No newline at end of file
diff --git a/md/182.md b/md/182.md
index a318168a..02bf588c 100644
--- a/md/182.md
+++ b/md/182.md
@@ -1,18 +1,12 @@
 
 ```markdown
 @author jackzhenguo
-<<<<<<< HEAD
-@desc **完成数据下采样，调整步长由小时为天**
-=======
-@desc 完成数据下采样，步长小时调整为天
->>>>>>> c71f26070eb30c7a48f537edde0083fa46adc020
+@desc 完成数据下采样，调整步长由小时为天
 @tag
 @version 
 @date 2020/03/20
-```
-
-<<<<<<< HEAD
 第 182 个小例子：**完成数据下采样，调整步长由小时为天**
+```
 
 步长为小时的时间序列数据，有没有小技巧，快速完成下采样，采集成按天的数据呢？先生成测试数据：
 
@@ -21,46 +15,21 @@ import pandas as pd
 import numpy as np
 df = pd.DataFrame(np.random.randint(1,10,size=(240,3)), \
 columns = ['商品编码','商品销量','商品库存'])
-=======
-182 如何完成数据下采样，调整步长由小时为天？
-
-步长为小时的时间序列数据，有没有小技巧，快速完成下采样，采集成按天的数据呢？
-先生成测试数据：
-```python
-import pandas as pd
-import numpy as np
-```
-
-```python
-df = pd.DataFrame(np.random.randint(1,10,size=(240,3)), \
-columns = ['商品编码','商品销量','商品库存'])
 ```
 
 ```python
->>>>>>> c71f26070eb30c7a48f537edde0083fa46adc020
 df.index = pd.util.testing.makeDateIndex(240,freq='H')
 df
-```
-
-<<<<<<< HEAD
 使用 resample 方法，合并为天(D)
-
-=======
-生成 240 行步长为小时间隔的数据：
-
-![](../img/182-1.png)
+```
 
 小技巧，使用 resample 方法，合并为天(D)
->>>>>>> c71f26070eb30c7a48f537edde0083fa46adc020
 ```python
 day_df = df.resample("D")["商品销量"].sum().to_frame()
 day_df
 ```
 
-<<<<<<< HEAD
-结果如下，10行，240小时，正好为 10 days
-=======
-结果如下，10行，240小时，正好为 10 days:
+果如下，10行，240小时，正好为 10 days:
+
 
-![](../img/182-2.png)
->>>>>>> c71f26070eb30c7a48f537edde0083fa46adc020
+<center>[上一个例子](181.md)    [下一个例子](183.md)</center>
\ No newline at end of file
diff --git a/md/183.md b/md/183.md
index 23eb43a4..b446ed36 100644
--- a/md/183.md
+++ b/md/183.md
@@ -61,4 +61,6 @@ df.index = pd.util.testing.makeDateIndex(10,freq='H')
 2000-01-01 07:00:00 300 129 474
 2000-01-01 08:00:00 966 372 835
 2000-01-01 09:00:00 687 493 910
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](182.md)    [下一个例子](184.md)</center>
\ No newline at end of file
diff --git a/md/184.md b/md/184.md
index c27c3235..fc582f4d 100644
--- a/md/184.md
+++ b/md/184.md
@@ -57,4 +57,6 @@ Age 列 177 个 null 值
 
 Cabin 列 687 个 null 值
 
-Embarked 列 2 个 null 值
\ No newline at end of file
+Embarked 列 2 个 null 值
+
+<center>[上一个例子](183.md)    [下一个例子](185.md)</center>
\ No newline at end of file
diff --git a/md/185.md b/md/185.md
index 9c798e5d..3496470d 100644
--- a/md/185.md
+++ b/md/185.md
@@ -32,4 +32,6 @@ df3 = df[cols]
 df3
 ```
 
-也能得到方法1的结果。
\ No newline at end of file
+也能得到方法1的结果。
+
+<center>[上一个例子](184.md)    [下一个例子](186.md)</center>
\ No newline at end of file
diff --git a/md/186.md b/md/186.md
index 16da4809..08ca6887 100644
--- a/md/186.md
+++ b/md/186.md
@@ -40,3 +40,6 @@ df["words_count"] = df["Title"].str.count(" ") + 1
 df[["Title","words_count"]]
 ```
 
+
+
+<center>[上一个例子](185.md)    [下一个例子](187.md)</center>
\ No newline at end of file
diff --git a/md/187.md b/md/187.md
index 4e6308b5..0be0fdb3 100644
--- a/md/187.md
+++ b/md/187.md
@@ -28,4 +28,6 @@ df['bsplit'] = df['b'].str.split(':')
 ```python
 df['amins'] = df['asplit'].apply(lambda x: int(x[0])*60 + int(x[1]))
 df['bmins'] = df['bsplit'].apply(lambda x: int(x[0])*60 + int(x[1]))
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](186.md)    [下一个例子](188.md)</center>
\ No newline at end of file
diff --git a/md/188.md b/md/188.md
index 313d8538..97057f18 100644
--- a/md/188.md
+++ b/md/188.md
@@ -66,4 +66,6 @@ zip_code variable  value
 7    131415   retail      4
 ```
 
-melt 透视数据后，因为组合多个列为1列，所以数据一定变长。
\ No newline at end of file
+melt 透视数据后，因为组合多个列为1列，所以数据一定变长。
+
+<center>[上一个例子](187.md)    [下一个例子](189.md)</center>
\ No newline at end of file
diff --git a/md/189.md b/md/189.md
index c4dcb6a0..447bd994 100644
--- a/md/189.md
+++ b/md/189.md
@@ -46,4 +46,6 @@ df_melt2.pivot(index='zip_code',columns='variable')
 
 index 设定第一个轴，为 zip_code，columns 设定哪些列或哪个列的不同取值组合为一个轴，此处设定为 variable 列，它一共有 2 种不同的取值，分别为 factory, retail，pivot 透视后变为列名，也就是 axis = 1 的轴
 
-pivot 方法没有聚合功能，它的升级版为 `pivot_table` 方法，能对数据聚合。
\ No newline at end of file
+pivot 方法没有聚合功能，它的升级版为 `pivot_table` 方法，能对数据聚合。
+
+<center>[上一个例子](188.md)    [下一个例子](190.md)</center>
\ No newline at end of file
diff --git a/md/19.md b/md/19.md
index 88c85901..776e9b44 100644
--- a/md/19.md
+++ b/md/19.md
@@ -20,4 +20,6 @@ In [4]: sorted(a,key=lambda x: x['age'],reverse=False)
 Out[4]:
 [{'name': 'xiaoming', 'age': 18, 'gender': 'male'},
  {'name': 'xiaohong', 'age': 20, 'gender': 'female'}]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](18.md)    [下一个例子](20.md)</center>
\ No newline at end of file
diff --git a/md/190.md b/md/190.md
index d76fcb7d..9b1d6139 100644
--- a/md/190.md
+++ b/md/190.md
@@ -1,9 +1,31 @@
 
 ```markdown
 @author jackzhenguo
-@desc
+@desc 随机读取文件的K行，生成N个
 @tag
 @version 
 @date 2020/03/28
 ```
-		     
\ No newline at end of file
+
+### 第190个小例子： 随机读取文件的K行，生成N个
+
+```python
+def random_lines_save(filename,gen_file_cnt=10):
+    """
+    随机选取文件的某些行并保存，想要生成这类文件的个数由参数
+    @param: gen_file_cnt 指定 
+    
+    @param: filename 读入文件的完整路径
+    @param: gen_file_cnt 想要产生的文件个数
+    """
+    df = pd.read_excel(filename)
+    for i in range(gen_file_cnt):
+        n = random.randint(1,len(df))
+        dfs = df.sample(n)
+        dfs.to_excel(str(n)+".xlsx",index=False)
+        print(str(n)+".xlsx")
+```
+
+这是一个很实用的函数，用于随机生成K行N个文件，使用场景：原来的文件行数较多，想从中随机提取组合N个文件时。
+
+<center>[上一个例子](189.md)    [下一个例子](191.md)</center>
\ No newline at end of file
diff --git a/md/191.md b/md/191.md
index c1de5a3b..cd572176 100644
--- a/md/191.md
+++ b/md/191.md
@@ -1,9 +1,42 @@
 
 ```markdown
 @author jackzhenguo
-@desc
+@desc 格式化Pandas的时间列
 @tag
 @version 
 @date 2020/03/29
 ```
-		     
\ No newline at end of file
+
+### 第191个小例子： 格式化Pandas的时间列
+
+
+
+```python
+import pandas as pd 
+from datetime import datetime, time 
+
+def series_dt_fmt(s:pd.Series,fmt:str)-> pd.Series: 
+        """
+        根据fmt格式，格式化s列
+        s列是datetime 或者 datetime的str类型，如'2020-12-30 11:44:00' 
+        """
+        st = pd.to_datetime(s)
+        return st.apply(lambda t: datetime.strftime(t,fmt))
+```
+
+别看只有两行代码，却能实现更加丰富的功能，相比pandas，支持直接返回时分等格式：
+
+```python
+s = pd.Series(['2020-12-30 11:44:00','2020-12-30 11:20:10'])
+
+# 只保留时分
+fmt = '%H:%M'
+series_dt_fmt(s,fmt)
+
+# 输出结果
+0    11:44
+1    11:20
+dtype: object
+```
+
+<center>[上一个例子](190.md)    [下一个例子](192.md)</center>
\ No newline at end of file
diff --git a/md/192.md b/md/192.md
index de3f35db..c11bebd8 100644
--- a/md/192.md
+++ b/md/192.md
@@ -6,4 +6,32 @@
 @version 
 @date 2020/03/30
 ```
-		     
\ No newline at end of file
+
+### 192: 创建SQLite连接
+
+编写一个Python程序，创建一个SQLite数据库，并与数据库连接，打印SQLite数据库的版本
+
+一种解决方法：
+
+```python
+import sqlite3
+try:
+   sqlite_Connection = sqlite3.connect('temp.db')
+   conn = sqlite_Connection.cursor()
+   print("连接到 SQLite.")
+   sqlite_select_Query = "select sqlite_version();"
+   conn.execute(sqlite_select_Query)
+   record = conn.fetchall()
+   print("SQLite 数据库的版本是 ", record)
+   conn.close()
+except sqlite3.Error as error:
+   print("连接到SQLite出错：", error)
+finally:
+   if (sqlite_Connection):
+       sqlite_Connection.close()
+       print("关闭SQLite连接")
+```
+
+以上就是第192例，希望对你有用，欢迎点赞支持。
+
+<center>[上一个例子](191.md)    [下一个例子](193.md)</center>
\ No newline at end of file
diff --git a/md/193.md b/md/193.md
index 03608d08..c84b6c68 100644
--- a/md/193.md
+++ b/md/193.md
@@ -6,4 +6,28 @@
 @version 
 @date 2020/03/31
 ```
-		     
\ No newline at end of file
+
+### 193 json对象转python对象
+python的`json`模块`loads`方法将json对象转为字典，如下所示：
+
+```python
+In [1]: import json                                                             
+
+In [2]: json_obj =  '{ "Name":"David", "Class":"I", "Age":6 }'                  
+
+In [3]: python_obj = json.loads(json_obj)                                       
+
+In [4]: type(python_obj)                                                        
+Out[4]: dict
+```
+
+打印查看相关属性
+```python
+print("\nJSON data:")
+print(python_obj)
+print("\nName: ",python_obj["Name"])
+print("Class: ",python_obj["Class"])
+print("Age: ",python_obj["Age"]) 
+```
+
+<center>[上一个例子](192.md)    [下一个例子](194.md)</center>
\ No newline at end of file
diff --git a/md/194.md b/md/194.md
index 38807c7f..8ac0ff98 100644
--- a/md/194.md
+++ b/md/194.md
@@ -6,4 +6,45 @@
 @version 
 @date 2020/04/01
 ```
-		     
\ No newline at end of file
+#### 194 python对象转json对象
+
+```python
+import json
+# a Python object (dict):
+python_obj = {
+  "name": "David",
+  "class":"I",
+  "age": 6  
+}
+print(type(python_obj))
+```
+
+使用`json.dumps`方法转化为json对象：
+```
+# convert into JSON:
+j_data = json.dumps(python_obj)
+
+# result is a JSON string:
+print(j_data)
+```
+
+##### 带格式转为json
+
+若字典转化为json对象后，保证键有序，且缩进4格，如何做到？
+
+```python
+json.dumps(j_str, sort_keys=True, indent=4)
+```
+
+例子：
+
+```python
+import json
+j_str = {'4': 5, '6': 7, '1': 3, '2': 4}
+print(json.dumps(j_str, sort_keys=True, indent=4))
+```
+
+
+
+<center>[上一个例子](193.md)    [下一个例子](195.md)</center>
+
diff --git a/md/195.md b/md/195.md
index b81246d8..f03c8cfe 100644
--- a/md/195.md
+++ b/md/195.md
@@ -6,4 +6,29 @@
 @version 
 @date 2020/04/02
 ```
-		     
\ No newline at end of file
+#### 195 发现列表前3个最大或最小数
+
+使用堆模块 heapq 里的 nlargest 方法：
+
+```python
+import heapq as hq
+nums_list = [25, 35, 22, 85, 14, 65, 75, 22, 58]
+
+# Find three largest values
+largest_nums = hq.nlargest(3, nums_list)
+print(largest_nums)
+```
+
+相应的求最小3个数，使用堆模块 heapq 里的 nsmallest 方法：
+
+```python
+import heapq as hq
+nums_list = [25, 35, 22, 85, 14, 65, 75, 22, 58]
+smallest_nums = hq.nsmallest(3, nums_list)
+print("\nThree smallest numbers are:", smallest_nums)
+```
+
+
+
+
+<center>[上一个例子](194.md)    [下一个例子](196.md)</center>
\ No newline at end of file
diff --git a/md/196.md b/md/196.md
index f3dfe76f..7a1342bd 100644
--- a/md/196.md
+++ b/md/196.md
@@ -6,4 +6,19 @@
 @version 
 @date 2020/04/03
 ```
-		     
\ No newline at end of file
+
+### 196 使用堆排序列表为升序
+
+使用 heapq 模块，首先对列表建堆，默认建立小根堆，调用len(nums) 次heapop：
+
+```python
+import heapq as hq
+
+nums_list = [18, 14, 10, 9, 8, 7, 9, 3, 2, 4, 1]
+hq.heapify(nums_list)
+s_result = [hq.heappop(nums_list) for _ in range(len(nums_list))]
+print(s_result)
+```
+
+
+<center>[上一个例子](195.md)    [下一个例子](197.md)</center>
\ No newline at end of file
diff --git a/md/197.md b/md/197.md
index 6564e6fc..3d166a53 100644
--- a/md/197.md
+++ b/md/197.md
@@ -6,4 +6,41 @@
 @version 
 @date 2020/04/04
 ```
-		     
\ No newline at end of file
+
+下面正则适用于提取正整数和大于0的浮点数，参看我的，若有疏漏欢迎补充。
+
+```python
+>>> import re
+>>> pat_integ = '[1-9]+\d*'
+>>> pat_float0 = '0\.\d+[1-9]'
+>>> pat_float1 = '[1-9]\d*\.d+'
+>>> pat = 'r%s|%s|%s'%(pat_float_0,pat_float_1,pat_integ)
+>>> re.findall(pat, r)
+['0.78', '3446.73', '0.91', '13642.95', '1.06', '2672.12', '3000']
+```
+
+排除这些串：
+
+000
+000100
+0.00
+000.00
+
+
+解释：`*`表示前一个字符出现0次或多次，`+`表示前一个字符出现1次或多次，`\d`表示数字[0-9]，`[1-9]`表示1,2,3,4,5,6,7,8,9，`\.`表示小数点
+
+主要考虑：正整数最左侧一位大于0，大于1的浮点数必须以[1-9]开始，大于0小于1的浮点数小数点前只有1个0.
+
+
+
+
+Day163：使用Python正则 提取出输入一段文字中的所有浮点数和整数 #Python拆书1# 
+
+例如： 截至收盘，上证指数涨0.78%，报3446.73点，深证成指涨0.91%，报13642.95点，创业板指涨1.06%，报2672.12点。指数午后震荡走高，碳中和概念强者恒强，板块内上演涨停潮，环保、物业、特高压板块午后涨幅扩大，数字货币板块尾盘冲高，钢铁、煤炭、有色板块全天较为低迷，题材股午后整体回暖，两市上涨个股逾3000家，赚钱效益较好。
+
+提取出所有浮点数和整数： 0.78, 3446.73, 0.91,13642.95 等
+
+
+		     
+
+<center>[上一个例子](196.md)    [下一个例子](198.md)</center>
diff --git a/md/198.md b/md/198.md
index 54af1fb6..45709457 100644
--- a/md/198.md
+++ b/md/198.md
@@ -6,4 +6,23 @@
 @version 
 @date 2020/04/05
 ```
-		     
\ No newline at end of file
+## 1 常见算术运算
+
+```python
+x, y = 3, 2
+print(x + y) # = 5 
+print(x - y) # = 1 
+print(x * y) # = 6 
+print(x / y) # = 1.5 
+print(x // y) # = 1 
+print(x % y) # = 1 
+print(-x) # = -3 
+print(abs(-x)) # = 3 
+print(int(3.9)) # = 3 
+print(float(x)) # = 3.0 
+print(x ** y) # = 9
+```
+
+大多数操作符都是不言自明的。注意，`//`运算符执行整数除法。结果是一个向下舍入的整数值（例如，3//2==1）		     
+
+<center>[上一个例子](197.md)    [下一个例子](199.md)</center>
diff --git a/md/199.md b/md/199.md
index b6f23790..7df7ea58 100644
--- a/md/199.md
+++ b/md/199.md
@@ -1,9 +1,29 @@
 
 ```markdown
 @author jackzhenguo
-@desc
+@desc 求两点球面距离
 @tag
 @version 
 @date 2020/04/06
 ```
-		     
\ No newline at end of file
+
+```python
+EARTH_RADIUS = 6378.137
+
+import math
+# 角度弧度计算公式
+def get_radian(degree):
+    return degree * 3.1415926 / 180.0
+# 根据经纬度计算两点之间的距离，得到的单位是 千米
+def get_distance(lat1,lng1,lat2,lng2):
+    radLat1 = get_radian(lat1)
+    radLat2 = get_radian(lat2)
+    a = radLat1 - radLat2 # 两点纬度差
+    b = get_radian(lng1) - get_radian(lng2); # 两点的经度差
+    s = 2 * math.asin(math.sqrt(math.pow(math.sin(a / 2), 2) + 
+                                math.cos(radLat1) * math.cos(radLat2) * math.pow(math.sin(b / 2), 2)));
+    s = s * EARTH_RADIUS
+    return s
+```		     
+
+<center>[上一个例子](198.md)    [下一个例子](200.md)</center>
\ No newline at end of file
diff --git a/md/2.md b/md/2.md
index 3982457a..aa58feaa 100644
--- a/md/2.md
+++ b/md/2.md
@@ -8,18 +8,21 @@
 
 十进制转换为二进制：
 ```python
-In [2]: bin(10)                                                                 
-Out[2]: '0b1010'
+In [1]: bin(10)                                                                 
+Out[1]: '0b1010'
 ```
 
 十进制转换为八进制：
 ```python
-In [3]: oct(9)                                                                  
-Out[3]: '0o11'
+In [2]: oct(9)                                                                  
+Out[2]: '0o11'
 ```
 
 十进制转换为十六进制：
 ```python
-In [4]: hex(15)                                                                 
-Out[4]: '0xf'
+In [3]: hex(15)                                                                 
+Out[3]: '0xf'
 ```
+
+
+<center>[上一个例子](1.md)    [下一个例子](3.md)</center>
\ No newline at end of file
diff --git a/md/20.md b/md/20.md
index e5dfef1f..9517aecd 100644
--- a/md/20.md
+++ b/md/20.md
@@ -16,4 +16,6 @@ Out[182]: 11
 
 In [185]: sum(a,10) #求和的初始值为10
 Out[185]: 21
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](19.md)    [下一个例子](21.md)</center>
\ No newline at end of file
diff --git a/md/200.md b/md/200.md
index bd31655d..ff873a60 100644
--- a/md/200.md
+++ b/md/200.md
@@ -1,9 +1,29 @@
 
 ```markdown
 @author jackzhenguo
-@desc
+@desc 获取文件编码
 @tag
 @version 
 @date 2020/04/07
 ```
-		     
\ No newline at end of file
+
+```python
+import chardet
+from chardet import UniversalDetector
+
+def get_encoding(file):
+    with open(file, "rb") as f:
+        cs = chardet.detect(f.read())
+        return cs['encoding']
+    
+    detector = UniversalDetector()
+    with open(file, "rb") as f:
+        for line in f.readlines():
+            detector.feed(line)
+            if detector.done:
+                break
+        detector.close()
+    return detector.result
+```
+
+<center>[上一个例子](199.md)    [下一个例子](201.md)</center>
\ No newline at end of file
diff --git a/md/201.md b/md/201.md
index 0ecfc0fe..7e323468 100644
--- a/md/201.md
+++ b/md/201.md
@@ -1,9 +1,25 @@
 
 ```markdown
 @author jackzhenguo
-@desc
+@desc 格式化json串
 @tag
 @version 
 @date 2020/04/08
 ```
-		     
\ No newline at end of file
+```python
+import json
+
+
+def format_json(json_str: str):
+    dic = json.loads(json_str)
+
+    js = json.dumps(dic,
+                    sort_keys=True,
+                    ensure_ascii=False,
+                    indent=4,
+                    separators=(', ', ': '))
+    return js
+```
+		     
+
+<center>[上一个例子](200.md)    [下一个例子](202.md)</center>
\ No newline at end of file
diff --git a/md/202.md b/md/202.md
index 79489391..c34f9255 100644
--- a/md/202.md
+++ b/md/202.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/09
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](201.md)    [下一个例子](203.md)</center>
\ No newline at end of file
diff --git a/md/203.md b/md/203.md
index 13ebb419..ddc0eea3 100644
--- a/md/203.md
+++ b/md/203.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/10
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](202.md)    [下一个例子](204.md)</center>
\ No newline at end of file
diff --git a/md/204.md b/md/204.md
index 40d77782..61845d72 100644
--- a/md/204.md
+++ b/md/204.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/11
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](203.md)    [下一个例子](205.md)</center>
\ No newline at end of file
diff --git a/md/205.md b/md/205.md
index 819a1c9f..eee98506 100644
--- a/md/205.md
+++ b/md/205.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/12
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](204.md)    [下一个例子](206.md)</center>
\ No newline at end of file
diff --git a/md/206.md b/md/206.md
index e813bb54..a98cb77d 100644
--- a/md/206.md
+++ b/md/206.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/13
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](205.md)    [下一个例子](207.md)</center>
\ No newline at end of file
diff --git a/md/207.md b/md/207.md
index a7ddeac8..5e385f74 100644
--- a/md/207.md
+++ b/md/207.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/14
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](206.md)    [下一个例子](208.md)</center>
\ No newline at end of file
diff --git a/md/208.md b/md/208.md
index 1b3371ee..502f14b2 100644
--- a/md/208.md
+++ b/md/208.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/15
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](207.md)    [下一个例子](209.md)</center>
\ No newline at end of file
diff --git a/md/209.md b/md/209.md
index 9ec7d645..1d1ffe09 100644
--- a/md/209.md
+++ b/md/209.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/16
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](208.md)    [下一个例子](210.md)</center>
\ No newline at end of file
diff --git a/md/21.md b/md/21.md
index 8f108abe..ee62cf7f 100644
--- a/md/21.md
+++ b/md/21.md
@@ -24,4 +24,6 @@ def excepter(f):
             if i == n:
                 print(f'spending time:{round(t2-t1,2)}')
     return wrapper
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](20.md)    [下一个例子](22.md)</center>
\ No newline at end of file
diff --git a/md/210.md b/md/210.md
index a1ae84b0..1ab19e46 100644
--- a/md/210.md
+++ b/md/210.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/17
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](209.md)    [下一个例子](211.md)</center>
\ No newline at end of file
diff --git a/md/211.md b/md/211.md
index 83029be4..ffa2aea2 100644
--- a/md/211.md
+++ b/md/211.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/18
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](210.md)    [下一个例子](212.md)</center>
\ No newline at end of file
diff --git a/md/212.md b/md/212.md
index cb1bf8b8..3fde4ad6 100644
--- a/md/212.md
+++ b/md/212.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/19
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](211.md)    [下一个例子](213.md)</center>
\ No newline at end of file
diff --git a/md/213.md b/md/213.md
index 63d6816c..46cdecb4 100644
--- a/md/213.md
+++ b/md/213.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/20
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](212.md)    [下一个例子](214.md)</center>
\ No newline at end of file
diff --git a/md/214.md b/md/214.md
index cc93ab36..e57dcf16 100644
--- a/md/214.md
+++ b/md/214.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/21
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](213.md)    [下一个例子](215.md)</center>
\ No newline at end of file
diff --git a/md/215.md b/md/215.md
index 5bab8514..2efc7724 100644
--- a/md/215.md
+++ b/md/215.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/22
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](214.md)    [下一个例子](216.md)</center>
\ No newline at end of file
diff --git a/md/216.md b/md/216.md
index 36b5f3d4..fec07d1e 100644
--- a/md/216.md
+++ b/md/216.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/23
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](215.md)    [下一个例子](217.md)</center>
\ No newline at end of file
diff --git a/md/217.md b/md/217.md
index bd89698c..ae44bcb2 100644
--- a/md/217.md
+++ b/md/217.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/24
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](216.md)    [下一个例子](218.md)</center>
\ No newline at end of file
diff --git a/md/218.md b/md/218.md
index 6d7349b5..86e52cec 100644
--- a/md/218.md
+++ b/md/218.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/25
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](217.md)    [下一个例子](219.md)</center>
\ No newline at end of file
diff --git a/md/219.md b/md/219.md
index e62584a2..0a2d61dc 100644
--- a/md/219.md
+++ b/md/219.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/26
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](218.md)    [下一个例子](220.md)</center>
\ No newline at end of file
diff --git a/md/22.md b/md/22.md
index c63f993b..3dd911ef 100644
--- a/md/22.md
+++ b/md/22.md
@@ -45,4 +45,6 @@ def h():
 
 h()
 print(i)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](21.md)    [下一个例子](23.md)</center>
\ No newline at end of file
diff --git a/md/220.md b/md/220.md
index f9390a55..8239b0ce 100644
--- a/md/220.md
+++ b/md/220.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/27
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](219.md)    [下一个例子](221.md)</center>
\ No newline at end of file
diff --git a/md/221.md b/md/221.md
index be4076dc..6ec83e98 100644
--- a/md/221.md
+++ b/md/221.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/28
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](220.md)    [下一个例子](222.md)</center>
\ No newline at end of file
diff --git a/md/222.md b/md/222.md
index cc1acb6e..44ba88fd 100644
--- a/md/222.md
+++ b/md/222.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/29
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](221.md)    [下一个例子](223.md)</center>
\ No newline at end of file
diff --git a/md/223.md b/md/223.md
index 8467de00..b5ea2d36 100644
--- a/md/223.md
+++ b/md/223.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/04/30
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](222.md)    [下一个例子](224.md)</center>
\ No newline at end of file
diff --git a/md/224.md b/md/224.md
index 1350f6d8..1c6c6d78 100644
--- a/md/224.md
+++ b/md/224.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/01
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](223.md)    [下一个例子](225.md)</center>
\ No newline at end of file
diff --git a/md/225.md b/md/225.md
index e5265707..4d96d885 100644
--- a/md/225.md
+++ b/md/225.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/02
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](224.md)    [下一个例子](226.md)</center>
\ No newline at end of file
diff --git a/md/226.md b/md/226.md
index 03501fe0..786b30b0 100644
--- a/md/226.md
+++ b/md/226.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/03
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](225.md)    [下一个例子](227.md)</center>
\ No newline at end of file
diff --git a/md/227.md b/md/227.md
index 70486d56..d74d365b 100644
--- a/md/227.md
+++ b/md/227.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/04
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](226.md)    [下一个例子](228.md)</center>
\ No newline at end of file
diff --git a/md/228.md b/md/228.md
index 0963c4dc..f64a8378 100644
--- a/md/228.md
+++ b/md/228.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/05
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](227.md)    [下一个例子](229.md)</center>
\ No newline at end of file
diff --git a/md/229.md b/md/229.md
index 2ac03b6e..a06fca93 100644
--- a/md/229.md
+++ b/md/229.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/06
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](228.md)    [下一个例子](230.md)</center>
\ No newline at end of file
diff --git a/md/23.md b/md/23.md
index 62785f91..ac994a9e 100644
--- a/md/23.md
+++ b/md/23.md
@@ -6,10 +6,36 @@
 
 #### 23 交换两元素
 
+理解交换两个元素，需要首先明白什么是pack，什么是unpack
+
+```python
+In [1]: a=[3,1]                                                                 
+
+# unpack
+In [2]: a0,a1 = a                                                               
+
+In [3]: a0                                                                      
+Out[3]: 3
+
+In [4]: a1                                                                      
+Out[4]: 1
+
+# pack
+In [5]: b = a0, a1                                                              
+
+In [6]: b                                                                       
+Out[6]: (3, 1)
+
+```
+
+所以下面 `b,a = a,b` 交换2个元素的过程，实际是先pack a,b为元组 (a,b)，然后再unpack (a,b) 给 b, a的过程
+
 ```python
 def swap(a, b):
     return b, a
 
 
 print(swap(1, 0))  # (0,1)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](22.md)    [下一个例子](24.md)</center>
diff --git a/md/230.md b/md/230.md
index 383df973..603c1678 100644
--- a/md/230.md
+++ b/md/230.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/07
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](229.md)    [下一个例子](231.md)</center>
\ No newline at end of file
diff --git a/md/231.md b/md/231.md
index ce09e38f..5018855f 100644
--- a/md/231.md
+++ b/md/231.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/08
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](230.md)    [下一个例子](232.md)</center>
\ No newline at end of file
diff --git a/md/232.md b/md/232.md
index 0fe7f0a3..70a47be0 100644
--- a/md/232.md
+++ b/md/232.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/09
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](231.md)    [下一个例子](233.md)</center>
\ No newline at end of file
diff --git a/md/233.md b/md/233.md
index 8aa98ec3..f9384f14 100644
--- a/md/233.md
+++ b/md/233.md
@@ -6,4 +6,6 @@
 @version 
 @date 2020/05/10
 ```
-		     
\ No newline at end of file
+		     
+
+<center>[上一个例子](232.md)    [下一个例子](234.md)</center>
\ No newline at end of file
diff --git a/md/24.md b/md/24.md
index fa6aafd0..22af8a0c 100644
--- a/md/24.md
+++ b/md/24.md
@@ -9,14 +9,14 @@
 ```python
 In [31]: def f():
     ...:     print('i\'m f')
-    ...:
 
 In [32]: def g():
     ...:     print('i\'m g')
-    ...:
 
 In [33]: [f,g][1]()
 i'm g
 ```
 
-创建函数对象的list，根据想要调用的index，方便统一调用。
\ No newline at end of file
+创建函数对象的list，根据想要调用的index，方便统一调用。
+
+<center>[上一个例子](23.md)    [下一个例子](25.md)</center>
\ No newline at end of file
diff --git a/md/25.md b/md/25.md
index 3899d98f..f1e70f29 100644
--- a/md/25.md
+++ b/md/25.md
@@ -7,7 +7,10 @@
 #### 25 生成逆序序列
 
 ```python
-list(range(10,-1,-1)) # [10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0]
+list(range(10,-1,-1))
+# [10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0]
 ```
 
-第三个参数为负时，表示从第一个参数开始递减，终止到第二个参数(不包括此边界)
\ No newline at end of file
+第三个参数为负时，表示从第一个参数开始递减，终止到第二个参数(不包括此边界)
+
+<center>[上一个例子](24.md)    [下一个例子](26.md)</center>
\ No newline at end of file
diff --git a/md/26.md b/md/26.md
index 548597f6..7cf30fc6 100644
--- a/md/26.md
+++ b/md/26.md
@@ -73,4 +73,6 @@ a POSITIONAL_OR_KEYWORD
 b VAR_POSITIONAL
 ```
 
-可以看到参数`a`既可以是位置参数也可是关键字参数。
\ No newline at end of file
+可以看到参数`a`既可以是位置参数也可是关键字参数。
+
+<center>[上一个例子](25.md)    [下一个例子](27.md)</center>
\ No newline at end of file
diff --git a/md/27.md b/md/27.md
index 622be3de..cc13a91b 100644
--- a/md/27.md
+++ b/md/27.md
@@ -68,4 +68,6 @@ In [15]: a_slice
 Out[15]: [0, 9, 7, 5]
 ```
 
-频繁使用同一切片的操作可使用slice对象抽出来，复用的同时还能提高代码可读性。
\ No newline at end of file
+频繁使用同一切片的操作可使用slice对象抽出来，复用的同时还能提高代码可读性。
+
+<center>[上一个例子](26.md)    [下一个例子](28.md)</center>
\ No newline at end of file
diff --git a/md/28.md b/md/28.md
index d564e80d..a6e167f6 100644
--- a/md/28.md
+++ b/md/28.md
@@ -32,4 +32,6 @@ print(f'更长的列表是{r}')
 
 - 参数v的可能取值为`*lists`，也就是 `tuple` 的一个元素。
 
-- `lambda`函数返回值，等于`lambda v`冒号后表达式的返回值。
\ No newline at end of file
+- `lambda`函数返回值，等于`lambda v`冒号后表达式的返回值。
+
+<center>[上一个例子](27.md)    [下一个例子](29.md)</center>
\ No newline at end of file
diff --git a/md/29.md b/md/29.md
index 93e8f31c..9998a648 100644
--- a/md/29.md
+++ b/md/29.md
@@ -9,15 +9,24 @@
 创建数据字典
 
 ```python
+# 方法1：使用dict
 In [1]: dict()
 Out[1]: {}
-
 In [2]: dict(a='a',b='b')
 Out[2]: {'a': 'a', 'b': 'b'}
 
+# 方法2：zip
 In [3]: dict(zip(['a','b'],[1,2]))
 Out[3]: {'a': 1, 'b': 2}
 
+# 方法3：嵌入元组的列表
 In [4]: dict([('a',1),('b',2)])
 Out[4]: {'a': 1, 'b': 2}
-```
\ No newline at end of file
+
+# 方法4：自典型字符串
+In [1]: s = "{'a':1, 'b':2}"                                                    
+In [2]: eval(s)                                                                 
+Out[2]: {'a': 1, 'b': 2}
+```
+
+<center>[上一个例子](28.md)    [下一个例子](30.md)</center>
diff --git a/md/3.md b/md/3.md
index 36484fd3..58ba87fd 100644
--- a/md/3.md
+++ b/md/3.md
@@ -16,4 +16,6 @@ Out[1]: 'A'
 ```python
 In [1]: ord('A')
 Out[1]: 65
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](2.md)    [下一个例子](4.md)</center>
\ No newline at end of file
diff --git a/md/30.md b/md/30.md
index c4e79784..7e965dbd 100644
--- a/md/30.md
+++ b/md/30.md
@@ -13,4 +13,6 @@ In [1]: frozenset([1,1,3,2,3])
 Out[1]: frozenset({1, 2, 3})
 ```
 
-因为不可修改，所以没有像`set`那样的`add`和`pop`方法
\ No newline at end of file
+因为不可修改，所以没有像`set`那样的`add`和`pop`方法
+
+<center>[上一个例子](29.md)    [下一个例子](31.md)</center>
\ No newline at end of file
diff --git a/md/31.md b/md/31.md
index dd8c1bdc..c9a73545 100644
--- a/md/31.md
+++ b/md/31.md
@@ -9,8 +9,37 @@
 返回一个set对象，集合内不允许有重复元素：
 
 ```python
-In [159]: a = [1,4,2,3,1]
+In [1]: a = [1,4,2,3,1]
 
-In [160]: set(a)
-Out[160]: {1, 2, 3, 4}
-```
\ No newline at end of file
+In [2]: set(a)
+Out[2]: {1, 2, 3, 4}
+
+In [3]: b = set(a)
+
+In [4]: b.add(5)
+
+In [5]: b
+Out[5]: {1, 2, 3, 4, 5}
+
+In [6]: b.pop()
+Out[6]: 1
+
+In [7]: b
+Out[7]: {2, 3, 4, 5}
+
+In [8]: b.pop()
+Out[8]: 2
+
+In [9]: b
+Out[9]: {3, 4, 5}
+
+# 注意pop删除集合内任意一个元素
+In [10]: help(b.pop)
+Help on built-in function pop:
+
+pop(...) method of builtins.set instance
+    Remove and return an arbitrary set element.
+    Raises KeyError if the set is empty.
+```
+
+<center>[上一个例子](30.md)    [下一个例子](32.md)</center>
diff --git a/md/32.md b/md/32.md
index cb6a40ae..0d70cbac 100644
--- a/md/32.md
+++ b/md/32.md
@@ -13,4 +13,6 @@
  In [17]: i_am_tuple = tuple(i_am_list)
  In [18]: i_am_tuple
  Out[18]: (1, 3, 5)
- ```
\ No newline at end of file
+ ```
+
+<center>[上一个例子](31.md)    [下一个例子](33.md)</center>
\ No newline at end of file
diff --git a/md/33.md b/md/33.md
index c6f9d5a3..925a8ffe 100644
--- a/md/33.md
+++ b/md/33.md
@@ -49,4 +49,6 @@ In [2]: t = Student('001','xiaoming')
 In [3]: t()
 I can be called
 my name is xiaoming
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](32.md)    [下一个例子](34.md)</center>
\ No newline at end of file
diff --git a/md/34.md b/md/34.md
index 19faca4b..ea9db1d5 100644
--- a/md/34.md
+++ b/md/34.md
@@ -23,4 +23,6 @@
 id = 1, name = xiaoming
 >>> ascii(xiaoming)
 'id = 1, name = xiaoming'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](33.md)    [下一个例子](35.md)</center>
\ No newline at end of file
diff --git a/md/35.md b/md/35.md
index 3a00ed45..81a4d483 100644
--- a/md/35.md
+++ b/md/35.md
@@ -20,4 +20,6 @@ In [1]: class Student():
     ...:     @classmethod
     ...:     def f(cls):
     ...:         print(cls)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](34.md)    [下一个例子](36.md)</center>
\ No newline at end of file
diff --git a/md/36.md b/md/36.md
index ae53ef4c..96a3859e 100644
--- a/md/36.md
+++ b/md/36.md
@@ -30,4 +30,6 @@ In [1]: delattr(xiaoming,'id')
 
 In [2]: hasattr(xiaoming,'id')
 Out[2]: False
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](35.md)    [下一个例子](37.md)</center>
\ No newline at end of file
diff --git a/md/37.md b/md/37.md
index 745b92bb..274aafb4 100644
--- a/md/37.md
+++ b/md/37.md
@@ -39,4 +39,6 @@ Out[96]:
  '__weakref__',
  
  'name']
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](36.md)    [下一个例子](38.md)</center>
\ No newline at end of file
diff --git a/md/38.md b/md/38.md
index e29eb092..c21d43ac 100644
--- a/md/38.md
+++ b/md/38.md
@@ -19,4 +19,6 @@ In [1]: class Student():
 In [2]: xiaoming = Student(id='001',name='xiaoming')
 In [3]: getattr(xiaoming,'name') # 获取xiaoming这个实例的name属性值
 Out[3]: 'xiaoming'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](37.md)    [下一个例子](39.md)</center>
\ No newline at end of file
diff --git a/md/39.md b/md/39.md
index 4c561876..677151e2 100644
--- a/md/39.md
+++ b/md/39.md
@@ -20,4 +20,6 @@ Out[3]: True
 
 In [4]: hasattr(xiaoming,'address')
 Out[4]: False
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](38.md)    [下一个例子](40.md)</center>
\ No newline at end of file
diff --git a/md/4.md b/md/4.md
index ad1d4326..a18e1c8d 100644
--- a/md/4.md
+++ b/md/4.md
@@ -14,4 +14,6 @@ Out[5]: False
 ```python
 In [6]: all([1,2,3])                                                            
 Out[6]: True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](3.md)    [下一个例子](5.md)</center>
\ No newline at end of file
diff --git a/md/40.md b/md/40.md
index f88638f6..2c7be683 100644
--- a/md/40.md
+++ b/md/40.md
@@ -22,4 +22,6 @@ In [2]: xiaoming = Student(id='001',name='xiaoming')
 ```python
 In [1]: id(xiaoming)
 Out[1]: 98234208
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](39.md)    [下一个例子](41.md)</center>
\ No newline at end of file
diff --git a/md/41.md b/md/41.md
index 832a7ec5..af6c86a0 100644
--- a/md/41.md
+++ b/md/41.md
@@ -20,4 +20,6 @@ In [2]: xiaoming = Student(id='001',name='xiaoming')
 
 In [3]: isinstance(xiaoming,Student)
 Out[3]: True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](40.md)    [下一个例子](42.md)</center>
\ No newline at end of file
diff --git a/md/42.md b/md/42.md
index 0aebf0cc..9a052976 100644
--- a/md/42.md
+++ b/md/42.md
@@ -28,4 +28,6 @@ Out[4]: True
 ```python
 In [1]: issubclass(int,(int,float))
 Out[1]: True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](41.md)    [下一个例子](43.md)</center>
\ No newline at end of file
diff --git a/md/43.md b/md/43.md
index 03c0bf36..1de14923 100644
--- a/md/43.md
+++ b/md/43.md
@@ -13,4 +13,6 @@ In [1]: o = object()
 
 In [2]: type(o)
 Out[2]: object
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](42.md)    [下一个例子](44.md)</center>
\ No newline at end of file
diff --git a/md/44.md b/md/44.md
index 92dde191..d8800eff 100644
--- a/md/44.md
+++ b/md/44.md
@@ -43,4 +43,6 @@ class C:
     @x.deleter
     def x(self):
         del self._x
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](43.md)    [下一个例子](45.md)</center>
\ No newline at end of file
diff --git a/md/45.md b/md/45.md
index 3388c94e..603a2645 100644
--- a/md/45.md
+++ b/md/45.md
@@ -25,4 +25,6 @@ Out[3]: __main__.Student
 
 In [4]: type(tuple())
 Out[4]: tuple
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](44.md)    [下一个例子](46.md)</center>
\ No newline at end of file
diff --git a/md/46.md b/md/46.md
index 195fbcc2..6b7296e5 100644
--- a/md/46.md
+++ b/md/46.md
@@ -78,4 +78,6 @@ Out[46]: True
 
 元类，确实使用不是那么多，也许先了解这些，就能应付一些场合。就连 Python 界的领袖 `Tim Peters` 都说：
 
-“元类就是深度的魔法，99%的用户应该根本不必为此操心。
\ No newline at end of file
+“元类就是深度的魔法，99%的用户应该根本不必为此操心。
+
+<center>[上一个例子](45.md)    [下一个例子](47.md)</center>
\ No newline at end of file
diff --git a/md/47.md b/md/47.md
index 07b3fa9a..8a8c5a5d 100644
--- a/md/47.md
+++ b/md/47.md
@@ -16,4 +16,6 @@ In [1]: s = ["a","b","c"]
 1 a
 2 b
 3 c
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](46.md)    [下一个例子](48.md)</center>
\ No newline at end of file
diff --git a/md/48.md b/md/48.md
index be4eaac1..c62565fc 100644
--- a/md/48.md
+++ b/md/48.md
@@ -5,12 +5,22 @@
 ```
 
 #### 48 查看变量所占字节数
+`getsizeof`查看变量占用字节数
+看到：字典比列表占用更多空间
 
 ```python
 In [1]: import sys
 
-In [2]: a = {'a':1,'b':2.0}
+In [3]: a = [('a',1),('b',2)]                                                                                                             
 
-In [3]: sys.getsizeof(a) # 占用240个字节
-Out[3]: 240
-```
\ No newline at end of file
+In [5]: sys.getsizeof(a)                                                        
+Out[5]: 88
+
+
+In [6]: a = {'a':1,'b':2.0}                                                     
+In [7]: sys.getsizeof(a)                                                        
+Out[7]: 248
+
+```
+
+<center>[上一个例子](47.md)    [下一个例子](49.md)</center>
diff --git a/md/49.md b/md/49.md
index a31c4cc5..9586d968 100644
--- a/md/49.md
+++ b/md/49.md
@@ -13,4 +13,6 @@ In [1]: fil = filter(lambda x: x>10,[1,11,2,45,7,6,13])
 
 In [2]: list(fil)
 Out[2]: [11, 45, 13]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](48.md)    [下一个例子](50.md)</center>
\ No newline at end of file
diff --git a/md/5.md b/md/5.md
index 7e890b73..ccc279b2 100644
--- a/md/5.md
+++ b/md/5.md
@@ -15,4 +15,6 @@ Out[7]: False
 ```python
 In [8]: any([0,0,1])                                                            
 Out[8]: True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](4.md)    [下一个例子](6.md)</center>
\ No newline at end of file
diff --git a/md/50.md b/md/50.md
index 3624dfb8..b1ea2d86 100644
--- a/md/50.md
+++ b/md/50.md
@@ -14,4 +14,6 @@ Out[1]: 6139638
 
 In [2]: hash([1,2,3])
 # TypeError: unhashable type: 'list'
-  ```
\ No newline at end of file
+  ```
+
+<center>[上一个例子](49.md)    [下一个例子](51.md)</center>
\ No newline at end of file
diff --git a/md/51.md b/md/51.md
index ff6c63c4..3ae2bcd9 100644
--- a/md/51.md
+++ b/md/51.md
@@ -26,4 +26,6 @@ class Student(builtins.object)
  |
  |  __weakref__
  |      list of weak references to the object (if defined)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](50.md)    [下一个例子](52.md)</center>
\ No newline at end of file
diff --git a/md/52.md b/md/52.md
index e1514579..ee915b1c 100644
--- a/md/52.md
+++ b/md/52.md
@@ -12,4 +12,6 @@
 In [1]: input()
 aa
 Out[1]: 'aa'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](51.md)    [下一个例子](53.md)</center>
\ No newline at end of file
diff --git a/md/53.md b/md/53.md
index f4ea9956..e9b5b39a 100644
--- a/md/53.md
+++ b/md/53.md
@@ -46,4 +46,6 @@ __iter__ is called!!
 3
 4
 5
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](52.md)    [下一个例子](54.md)</center>
\ No newline at end of file
diff --git a/md/54.md b/md/54.md
index 86770f90..6ad48f62 100644
--- a/md/54.md
+++ b/md/54.md
@@ -67,4 +67,6 @@ f.close
 # with关键字系统会自动关闭文件和处理异常
 with open(r"./data/test.txt", "w") as f:
     f.write("hello world!")
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](53.md)    [下一个例子](55.md)</center>
\ No newline at end of file
diff --git a/md/55.md b/md/55.md
index 330960e5..22a3430d 100644
--- a/md/55.md
+++ b/md/55.md
@@ -17,4 +17,6 @@ Out[1]: range(0, 11)
 
 In [2]: range(0,11,1)
 Out[2]: range(0, 11)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](54.md)    [下一个例子](56.md)</center>
\ No newline at end of file
diff --git a/md/56.md b/md/56.md
index fa80998e..35c5511f 100644
--- a/md/56.md
+++ b/md/56.md
@@ -17,4 +17,6 @@ In [2]: for i in rev:
 2
 4
 1
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](55.md)    [下一个例子](57.md)</center>
\ No newline at end of file
diff --git a/md/57.md b/md/57.md
index b94e1aa8..d8ab15db 100644
--- a/md/57.md
+++ b/md/57.md
@@ -20,4 +20,6 @@ In [6]: b
 Out[6]: ['a', 'b', 'c', 'd', 'e']
 In [7]: [str(y) + str(x) for x,y in zip(a,b)]
 Out[7]: ['a0', 'b1', 'c2', 'd3', 'e4']
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](56.md)    [下一个例子](58.md)</center>
\ No newline at end of file
diff --git a/md/58.md b/md/58.md
index 9a47cd9c..452b7c3f 100644
--- a/md/58.md
+++ b/md/58.md
@@ -15,4 +15,6 @@ def add_or_sub(a, b, oper):
 
 
 add_or_sub(1, 2, '-')  # -1
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](57.md)    [下一个例子](59.md)</center>
\ No newline at end of file
diff --git a/md/59.md b/md/59.md
index b632fc19..f1834c5e 100644
--- a/md/59.md
+++ b/md/59.md
@@ -46,4 +46,6 @@ with open('json.txt', 'w') as f:
         "name":"xiaohong"
     }
 ]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](58.md)    [下一个例子](60.md)</center>
\ No newline at end of file
diff --git a/md/6.md b/md/6.md
index 5ac32890..265e7c55 100644
--- a/md/6.md
+++ b/md/6.md
@@ -17,3 +17,6 @@ Out[10]: False
 In [11]: bool([1,0,1])                                                          
 Out[11]: True
 ```
+
+
+<center>[上一个例子](5.md)    [下一个例子](7.md)</center>
\ No newline at end of file
diff --git a/md/60.md b/md/60.md
index b8fe99e2..12fe4c24 100644
--- a/md/60.md
+++ b/md/60.md
@@ -16,4 +16,6 @@ def calculator(a, b, k):
 
 calculator(1, 2, '+')  # 3
 calculator(3, 4, '**')  # 81
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](59.md)    [下一个例子](61.md)</center>
\ No newline at end of file
diff --git a/md/61.md b/md/61.md
index 46e10760..e097f035 100644
--- a/md/61.md
+++ b/md/61.md
@@ -14,4 +14,6 @@ def score_mean(lst):
 
 lst=[9.1, 9.0,8.1, 9.7, 19,8.2, 8.6,9.8]
 score_mean(lst) # 9.1
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](60.md)    [下一个例子](62.md)</center>
\ No newline at end of file
diff --git a/md/62.md b/md/62.md
index f584a6e9..65f4de5f 100644
--- a/md/62.md
+++ b/md/62.md
@@ -35,4 +35,6 @@ for i in range(1, 10):
     for j in range(1, i+1):
         print('%d * %d = %d' % (j, i, j * i) , end="\t")
     print()
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](61.md)    [下一个例子](63.md)</center>
\ No newline at end of file
diff --git a/md/63.md b/md/63.md
index 4765f04a..d3a807ca 100644
--- a/md/63.md
+++ b/md/63.md
@@ -47,4 +47,6 @@ import numpy
 b = numpy.array([[1,2,3],[4,5]])
 b.flatten()
 array([list([1, 2, 3]), list([4, 5])], dtype=object)
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](62.md)    [下一个例子](64.md)</center>
\ No newline at end of file
diff --git a/md/64.md b/md/64.md
index 6acde3b3..283dd081 100644
--- a/md/64.md
+++ b/md/64.md
@@ -28,3 +28,6 @@ r = divide([1, 3, 5, 7, 9], -3)
 print(r)  # [[1, 3, 5, 7, 9]]
 ```
 
+
+
+<center>[上一个例子](63.md)    [下一个例子](65.md)</center>
\ No newline at end of file
diff --git a/md/65.md b/md/65.md
index 11190a8f..58d09c89 100644
--- a/md/65.md
+++ b/md/65.md
@@ -14,4 +14,6 @@ def filter_false(lst):
 r = filter_false([None, 0, False, '', [], 'ok', [1, 2]])
 print(r)  # ['ok', [1, 2]]
 
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](64.md)    [下一个例子](66.md)</center>
\ No newline at end of file
diff --git a/md/66.md b/md/66.md
index 3ba4bb28..c35ae049 100644
--- a/md/66.md
+++ b/md/66.md
@@ -16,4 +16,6 @@ print(f'更长的列表是{r}')  # [4, 5, 6, 7]
 
 r = max_length([1, 2, 3], [4, 5, 6, 7], [8, 9])
 print(f'更长的列表是{r}')  # [4, 5, 6, 7]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](65.md)    [下一个例子](67.md)</center>
\ No newline at end of file
diff --git a/md/67.md b/md/67.md
index b7204cb0..50e3a6c5 100644
--- a/md/67.md
+++ b/md/67.md
@@ -20,3 +20,6 @@ print(f'{lst}中出现次数最多的元素为:{r}')
 # [1, 3, 3, 2, 1, 1, 2]中出现次数最多的元素为:1
 ```
 
+
+
+<center>[上一个例子](66.md)    [下一个例子](68.md)</center>
\ No newline at end of file
diff --git a/md/68.md b/md/68.md
index 1fe090dd..00ae05ab 100644
--- a/md/68.md
+++ b/md/68.md
@@ -12,4 +12,6 @@ def max_lists(*lst):
 
 r = max_lists([1, 2, 3], [6, 7, 8], [4, 5])
 print(r)  # 8
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](67.md)    [下一个例子](69.md)</center>
\ No newline at end of file
diff --git a/md/69.md b/md/69.md
index a01254dc..ed1b5207 100644
--- a/md/69.md
+++ b/md/69.md
@@ -15,4 +15,6 @@ x = [1, 1, 2, 2, 3, 2, 3, 4, 5, 6]
 y = [1, 2, 3, 4, 5]
 has_duplicates(x)  # False
 has_duplicates(y)  # True
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](68.md)    [下一个例子](70.md)</center>
\ No newline at end of file
diff --git a/md/7.md b/md/7.md
index 3f16318f..65310311 100644
--- a/md/7.md
+++ b/md/7.md
@@ -12,3 +12,6 @@
 In [1]: complex(1,2)
 Out[1]: (1+2j)
 ```
+
+
+<center>[上一个例子](6.md)    [下一个例子](8.md)</center>
\ No newline at end of file
diff --git a/md/70.md b/md/70.md
index 7f649773..ac349b57 100644
--- a/md/70.md
+++ b/md/70.md
@@ -13,4 +13,6 @@ def reverse(lst):
 
 r = reverse([1, -2, 3, 4, 1, 2])
 print(r)  # [2, 1, 4, 3, -2, 1]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](69.md)    [下一个例子](71.md)</center>
\ No newline at end of file
diff --git a/md/71.md b/md/71.md
index c2afa8f4..532d5dae 100644
--- a/md/71.md
+++ b/md/71.md
@@ -24,3 +24,6 @@ float_range(1, 8, 10)
 # [1.0, 1.7, 2.4, 3.1, 3.8, 4.5, 5.2, 5.9, 6.6, 7.3, 8.0]
 ```
 
+
+
+<center>[上一个例子](70.md)    [下一个例子](72.md)</center>
\ No newline at end of file
diff --git a/md/72.md b/md/72.md
index a6dc36f3..8e7b4c40 100644
--- a/md/72.md
+++ b/md/72.md
@@ -18,3 +18,6 @@ records = [25,89,31,34]
 bif_by(records, lambda x: x<80) # [[25, 31, 34], [89]]
 ```
 
+
+
+<center>[上一个例子](71.md)    [下一个例子](73.md)</center>
\ No newline at end of file
diff --git a/md/73.md b/md/73.md
index 519dde79..db9cfca9 100644
--- a/md/73.md
+++ b/md/73.md
@@ -15,4 +15,6 @@ lst1=[1,2,3,4,5,6]
 lst2=[3,4,5,6,3,2]
 list(map(lambda x,y:x*y+1,lst1,lst2))
 ### [4, 9, 16, 25, 16, 13]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](72.md)    [下一个例子](74.md)</center>
\ No newline at end of file
diff --git a/md/74.md b/md/74.md
index 67611552..58980265 100644
--- a/md/74.md
+++ b/md/74.md
@@ -21,3 +21,6 @@ r = max_pairs({'a': -10, 'b': 5, 'c': 3, 'd': 5})
 print(r)  # [('b', 5), ('d', 5)]
 ```
 
+
+
+<center>[上一个例子](73.md)    [下一个例子](75.md)</center>
\ No newline at end of file
diff --git a/md/75.md b/md/75.md
index f7ae0fda..4a34cb77 100644
--- a/md/75.md
+++ b/md/75.md
@@ -20,3 +20,6 @@ merge_dict({'a': 1, 'b': 2}, {'c': 3})
 # {'a': 1, 'b': 2, 'c': 3}
 ```
 
+
+
+<center>[上一个例子](74.md)    [下一个例子](76.md)</center>
\ No newline at end of file
diff --git a/md/76.md b/md/76.md
index aadd8cf7..1736bef1 100644
--- a/md/76.md
+++ b/md/76.md
@@ -21,3 +21,6 @@ topn_dict({'a': 10, 'b': 8, 'c': 9, 'd': 10}, 3)
 # ['a', 'd', 'c']
 ```
 
+
+
+<center>[上一个例子](75.md)    [下一个例子](77.md)</center>
\ No newline at end of file
diff --git a/md/77.md b/md/77.md
index c1b50cca..e6aa13f4 100644
--- a/md/77.md
+++ b/md/77.md
@@ -18,4 +18,6 @@ def anagram(str1, str2):
 
 anagram('eleven+two', 'twelve+one')  # True 这是一对神器的变位词
 anagram('eleven', 'twelve')  # False
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](76.md)    [下一个例子](78.md)</center>
\ No newline at end of file
diff --git a/md/78.md b/md/78.md
index 33a09628..021ebbdc 100644
--- a/md/78.md
+++ b/md/78.md
@@ -22,4 +22,6 @@ merged1 = {**dic1, **dic2} # {'x': 1, 'y': 3, 'z': 4}
 from collections import ChainMap
 merged2 = ChainMap(dic1,dic2)
 print(merged2) # ChainMap({'x': 1, 'y': 2}, {'y': 3, 'z': 4})
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](77.md)    [下一个例子](79.md)</center>
\ No newline at end of file
diff --git a/md/79.md b/md/79.md
index d0b884a8..720d755c 100644
--- a/md/79.md
+++ b/md/79.md
@@ -15,4 +15,6 @@ lst = [Point(1.5, 2, 3.0), Point(-0.3, -1.0, 2.1), Point(1.3, 2.8, -2.5)]
 print(lst[0].y - lst[1].y)
 ```
 
-使用命名元组写出来的代码可读性更好，尤其处理上百上千个属性时作用更加凸显。
\ No newline at end of file
+使用命名元组写出来的代码可读性更好，尤其处理上百上千个属性时作用更加凸显。
+
+<center>[上一个例子](78.md)    [下一个例子](80.md)</center>
\ No newline at end of file
diff --git a/md/8.md b/md/8.md
index 56cd79d9..56a96da7 100644
--- a/md/8.md
+++ b/md/8.md
@@ -12,3 +12,6 @@
 In [1]: divmod(10,3)
 Out[1]: (3, 1)
 ```
+
+
+<center>[上一个例子](7.md)    [下一个例子](9.md)</center>
\ No newline at end of file
diff --git a/md/80.md b/md/80.md
index 2b038773..67f0e825 100644
--- a/md/80.md
+++ b/md/80.md
@@ -14,4 +14,6 @@ lst = [randint(0,50) for _ in range(100)]
 print(lst[:5])# [38, 19, 11, 3, 6]
 lst_sample = sample(lst,10)
 print(lst_sample) # [33, 40, 35, 49, 24, 15, 48, 29, 37, 24]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](79.md)    [下一个例子](81.md)</center>
\ No newline at end of file
diff --git a/md/81.md b/md/81.md
index 04037010..9b02a67f 100644
--- a/md/81.md
+++ b/md/81.md
@@ -13,4 +13,6 @@ from random import shuffle
 lst = [randint(0,50) for _ in range(100)]
 shuffle(lst)
 print(lst[:5]) # [50, 3, 48, 1, 26]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](80.md)    [下一个例子](82.md)</center>
\ No newline at end of file
diff --git a/md/82.md b/md/82.md
index 2f672128..765449cf 100644
--- a/md/82.md
+++ b/md/82.md
@@ -24,4 +24,6 @@ Out[1]:
  (1.4749644859469302, 8.038753079253127),
  (9.005430657826324, 7.58011186920019),
  (8.700789540392917, 1.2217577293254112)]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](81.md)    [下一个例子](83.md)</center>
\ No newline at end of file
diff --git a/md/83.md b/md/83.md
index 572bd109..6b1f00d5 100644
--- a/md/83.md
+++ b/md/83.md
@@ -24,4 +24,6 @@ points = list(zip(x,y))
  (7, 14.02384035204836),
  (8, 15.33755823101161),
  (9, 17.565074449028497)]
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](82.md)    [下一个例子](84.md)</center>
\ No newline at end of file
diff --git a/md/84.md b/md/84.md
index 0e034b48..849f5bac 100644
--- a/md/84.md
+++ b/md/84.md
@@ -23,4 +23,6 @@ for i in chain(a,b):
 2
 4
 6
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](83.md)    [下一个例子](85.md)</center>
\ No newline at end of file
diff --git a/md/85.md b/md/85.md
index 3ff3de43..2a6878c3 100644
--- a/md/85.md
+++ b/md/85.md
@@ -22,4 +22,6 @@ def product(*args, repeat=1):
 ```python
 rtn = product('xyz', '12', repeat=3)
 print(list(rtn))
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](84.md)    [下一个例子](86.md)</center>
\ No newline at end of file
diff --git a/md/86.md b/md/86.md
index dc9fa83c..990acbe7 100644
--- a/md/86.md
+++ b/md/86.md
@@ -22,3 +22,6 @@ st="python"
 st[::-1]
 ```
 
+
+
+<center>[上一个例子](85.md)    [下一个例子](87.md)</center>
\ No newline at end of file
diff --git a/md/87.md b/md/87.md
index b294de46..e52ae6e1 100644
--- a/md/87.md
+++ b/md/87.md
@@ -13,4 +13,6 @@ In [4]: mystr = ['1','2','java','4','python','java','7','8','java','python','11'
 
 In [5]: ','.join(mystr) #用逗号连接字符串
 Out[5]: '1,2,java,4,python,java,7,8,java,python,11,java,13,14'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](86.md)    [下一个例子](88.md)</center>
\ No newline at end of file
diff --git a/md/88.md b/md/88.md
index e0116157..30aebbb2 100644
--- a/md/88.md
+++ b/md/88.md
@@ -18,3 +18,6 @@ str_byte_len('i love python')  # 13(个字节)
 str_byte_len('字符')  # 6(个字节)
 ```
 
+
+
+<center>[上一个例子](87.md)    [下一个例子](89.md)</center>
\ No newline at end of file
diff --git a/md/89.md b/md/89.md
index 3c51cb04..28300800 100644
--- a/md/89.md
+++ b/md/89.md
@@ -30,4 +30,6 @@ print(s2)
 
 ```python
 .*
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](88.md)    [下一个例子](90.md)</center>
\ No newline at end of file
diff --git a/md/9.md b/md/9.md
index b11c467b..3cd61f78 100644
--- a/md/9.md
+++ b/md/9.md
@@ -1,6 +1,6 @@
 ```markdown
 @author jackzhenguo
-@desc 
+@desc 转为浮点类型　
 @date 2019/2/15
 ```
 #### 9 转为浮点类型　
@@ -12,9 +12,30 @@ In [1]: float(3)
 Out[1]: 3.0
 ```
 
+```python
+In [1]: float('3')
+Out[1]: 3.0
+```
+
+浮点数最大值
+```python
+import sys
+
+In[4]: sys.float_info.max                                                      
+Out[4]: 1.7976931348623157e+308
+```
+
+正无穷大、负无穷大
+```python
+float('inf') # 正无穷大
+float('-inf') # 负无穷大
+```
+
 如果不能转化为浮点数，则会报`ValueError`:
 
 ```python
 In [2]: float('a')
 # ValueError: could not convert string to float: 'a'
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](8.md)    [下一个例子](10.md)</center>
diff --git a/md/90.md b/md/90.md
index e3cebb06..00f299f8 100644
--- a/md/90.md
+++ b/md/90.md
@@ -16,4 +16,6 @@
 
 因此，普通字符是原子，正则中的通用字符(下面会讲到)也是原子。
 
-大家记住*原子*这个概念。
\ No newline at end of file
+大家记住*原子*这个概念。
+
+<center>[上一个例子](89.md)    [下一个例子](91.md)</center>
\ No newline at end of file
diff --git a/md/91.md b/md/91.md
index ab463b6a..5d77d62a 100644
--- a/md/91.md
+++ b/md/91.md
@@ -30,4 +30,6 @@ pat = '[0123456789
 
 类似的通用正则字符还有几个，下面也会讲到。
 
-做一件事前，把规则弄清，触类旁通，相信大家理解其他几个也没问题。
\ No newline at end of file
+做一件事前，把规则弄清，触类旁通，相信大家理解其他几个也没问题。
+
+<center>[上一个例子](90.md)    [下一个例子](92.md)</center>
\ No newline at end of file
diff --git a/md/92.md b/md/92.md
index 80945a2a..e745030b 100644
--- a/md/92.md
+++ b/md/92.md
@@ -41,4 +41,6 @@ print(result)
 
 以上就是使用正则的最普通例子。如果要找出前缀为grow的单词，比如可能为grows, growing 等，最普通查找实现起来就不方便。
 
-然而，借助于下面介绍的元字符、通用字符和捕获组合起来，便能应对解决复杂的匹配查找问题。
\ No newline at end of file
+然而，借助于下面介绍的元字符、通用字符和捕获组合起来，便能应对解决复杂的匹配查找问题。
+
+<center>[上一个例子](91.md)    [下一个例子](93.md)</center>
\ No newline at end of file
diff --git a/md/93.md b/md/93.md
index bdbca2ac..0094a997 100644
--- a/md/93.md
+++ b/md/93.md
@@ -46,3 +46,6 @@ result = re.findall(pat,s)
 
 而\S, \W, \D 分别对应 \s, \w, \d匹配字符集的补集，例如\S 的意思是匹配 \s 以外的其他任意字符。
 
+
+
+<center>[上一个例子](92.md)    [下一个例子](94.md)</center>
\ No newline at end of file
diff --git a/md/94.md b/md/94.md
index 95b0d9cf..d03bc3bb 100644
--- a/md/94.md
+++ b/md/94.md
@@ -21,4 +21,6 @@
 {n} 前面的原子出现了 n 次
 {n,} 前面的原子至少出现 n 次
 {n,m} 前面的原子出现次数介于 n-m 之间
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](93.md)    [下一个例子](95.md)</center>
\ No newline at end of file
diff --git a/md/95.md b/md/95.md
index e603b0c1..e8cfaba3 100644
--- a/md/95.md
+++ b/md/95.md
@@ -75,4 +75,6 @@ pat = r'https:.*\)'
 pat = r'(https:.*)\)'
 ```
 
-此时返回结果完全正确，无任何多余字符。想要返回的子串外面添加一对括号还有个专业叫法：**捕获**或**分组**。
\ No newline at end of file
+此时返回结果完全正确，无任何多余字符。想要返回的子串外面添加一对括号还有个专业叫法：**捕获**或**分组**。
+
+<center>[上一个例子](94.md)    [下一个例子](96.md)</center>
\ No newline at end of file
diff --git a/md/96.md b/md/96.md
index 8bed0f3a..27f40030 100644
--- a/md/96.md
+++ b/md/96.md
@@ -47,4 +47,6 @@ print(result)
   '<p> 这是一个段落>/p>']
 ```
 
-以上例子仅仅用作演示两者区别，实际的html结构含有换行符等，环境比上面要复杂的多，贪心和非贪心捕获的写法可能不会导致结果不同，但是我们依然需要理解它们的区别。
\ No newline at end of file
+以上例子仅仅用作演示两者区别，实际的html结构含有换行符等，环境比上面要复杂的多，贪心和非贪心捕获的写法可能不会导致结果不同，但是我们依然需要理解它们的区别。
+
+<center>[上一个例子](95.md)    [下一个例子](97.md)</center>
\ No newline at end of file
diff --git a/md/97.md b/md/97.md
index e2d2fcb8..e011a205 100644
--- a/md/97.md
+++ b/md/97.md
@@ -24,4 +24,6 @@ pat.fullmatch('qaz12wsxedcrfvtgb67890942234343434') # None 长度大于22
 pat.fullmatch('qaz_231') # None 含有下划线
 pat.fullmatch('n0passw0Rd')
 Out[4]: <re.Match object; span=(0, 10), match='n0passw0Rd'>
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](96.md)    [下一个例子](98.md)</center>
\ No newline at end of file
diff --git a/md/98.md b/md/98.md
index b7aaab0a..d12dce1b 100644
--- a/md/98.md
+++ b/md/98.md
@@ -20,4 +20,6 @@ result=re.search(pat,data)
 print(result) <re.Match object; span=(1358, 1382), match='<title>百度一下，你就知道</title>'>
 
 result.group() # 百度一下，你就知道
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](97.md)    [下一个例子](99.md)</center>
\ No newline at end of file
diff --git a/md/99.md b/md/99.md
index 05a12564..4ba72add 100644
--- a/md/99.md
+++ b/md/99.md
@@ -63,4 +63,6 @@ s = batch_camel(['student_id', 'student\tname', 'student-add'])
 print(s)
 # 结果
 ['studentId', 'studentName', 'studentAdd']
-```
\ No newline at end of file
+```
+
+<center>[上一个例子](98.md)    [下一个例子](100.md)</center>
\ No newline at end of file
diff --git a/md/__pycache__/batch.cpython-37.pyc b/md/__pycache__/batch.cpython-37.pyc
deleted file mode 100644
index a5bd4868..00000000
Binary files a/md/__pycache__/batch.cpython-37.pyc and /dev/null differ
diff --git a/script/add_nav.py b/script/add_nav.py
new file mode 100644
index 00000000..47b8986f
--- /dev/null
+++ b/script/add_nav.py
@@ -0,0 +1,17 @@
+# function: give each *.md example to a navigation in bottom of file
+# author: zhenguo
+# date: 2021.2.27
+# version: 1.0
+
+import os
+
+for file in os.listdir('../md'):
+    if os.path.splitext(file)[-1] == '.md':
+        with open('../md/' + file, 'a') as f:
+            file_name = os.path.splitext(file)[0]
+            try:
+                c = '\n\n<center>[上一个例子](%s.md)    [下一个例子](%s.md)</center>' % (str(int(file_name) - 1), str(int(file_name) + 1))
+                f.write(c)
+                print('文件%s写入成功' % (file,))
+            except:
+                print(ex)
diff --git a/script/avatar.png b/script/avatar.png
new file mode 100644
index 00000000..4455032c
Binary files /dev/null and b/script/avatar.png differ