2019.8.26
老师邮箱:zhpj@mail.sysu.edu.cn
- 期中考可能以闭卷形式
- 期末闭卷
- 难度:较浅显,但不代表非常简单
- 作用:数据分析,推论出信息
- 参考:盛骤、谢世千、潘承毅,《概率论与数理统计》第四版,高等教育出版社,2008
- 使用工具:MATLAB / Python
相关问题
- 生日问题
- Benford’s Law / First Digit Law:一堆从实际生活得出的数据中,以1为首位数字的数的出现概率约为总数的三成,接近直觉得出之期望值1/9的3倍。
Chapter 1
相关概念
描述性数字:描述一个整体,不专门针对个人(例:“00”后)
Population(总体):A well-defined collection of objects(units).
Variables(变量,描述性的)
Sample(样本):A subset of the population.(因为要考虑实际情况,因此样本范围小于总体)
Univariate(一元变量)
Bivariate(二元变量)
Multivariate(多元变量)
Descriptive statistics(描述性统计)
Visual techniques:Stem-and-leaf display(茎叶图), Dotplot(点图) & histograms(直方图)
Numerical summary measures:mean(均值), standard deviations(标准差) & correlations coefficients
Inferential statistics(推断性统计)
收集数据
- Random sampling(随机抽样)
-
Straitied sampling(分层抽样)
//: # (系统抽样呢?) - Sample size(样本数量):Offen be donated as n
茎叶图
Constructing a Stem-and-Leaf Display
- Select one or more leading digits for the stem values. The trailing digits become the leaves.
- List possible stem values in a vertical column.
- Record the leaf for each observation beside the corresponding stem value.
- Indicate the units for stems and leaves someplace in the display.
可以对其范围进行操作,得到不同的图
点图
Definition:
A dotplot is an attractive summary of numerical data when the data set is reasonably small or there are relatively few distinct data values. Each observation is represented by a dot above the corresponding location on a horizontal measurement scale.
直方图
Definition:
A numerical variable is discrete if its set of possible values either is finite or else can be listed in an infinite sequence (one in which there is a first number, a second number, and so on). A numerical variable is continuous if its possible values consist of an entire interval on the number line.