View on GitHub

我的博客

杂七杂八啥都有

2019.8.26

老师邮箱:zhpj@mail.sysu.edu.cn

  1. 期中考可能以闭卷形式
  2. 期末闭卷
  3. 难度:较浅显,但不代表非常简单
  4. 作用:数据分析,推论出信息
  5. 参考:盛骤、谢世千、潘承毅,《概率论与数理统计》第四版,高等教育出版社,2008
  6. 使用工具:MATLAB / Python

相关问题

  1. 生日问题
  2. Benford’s Law / First Digit Law:一堆从实际生活得出的数据中,以1为首位数字的数的出现概率约为总数的三成,接近直觉得出之期望值1/9的3倍。

Chapter 1

相关概念

描述性数字:描述一个整体,不专门针对个人(例:“00”后)
Population(总体):A well-defined collection of objects(units).
Variables(变量,描述性的)
Sample(样本):A subset of the population.(因为要考虑实际情况,因此样本范围小于总体)
Univariate(一元变量)
Bivariate(二元变量)
Multivariate(多元变量)

Descriptive statistics(描述性统计)
Visual techniques:Stem-and-leaf display(茎叶图), Dotplot(点图) & histograms(直方图)
Numerical summary measures:mean(均值), standard deviations(标准差) & correlations coefficients

Inferential statistics(推断性统计)

收集数据

  1. Random sampling(随机抽样)
  2. Straitied sampling(分层抽样)
    //: # (系统抽样呢?)

  3. Sample size(样本数量):Offen be donated as n

茎叶图

Constructing a Stem-and-Leaf Display

  1. Select one or more leading digits for the stem values. The trailing digits become the leaves.
  2. List possible stem values in a vertical column.
  3. Record the leaf for each observation beside the corresponding stem value.
  4. Indicate the units for stems and leaves someplace in the display.

可以对其范围进行操作,得到不同的图

点图

Definition:

A dotplot is an attractive summary of numerical data when the data set is reasonably small or there are relatively few distinct data values. Each observation is represented by a dot above the corresponding location on a horizontal measurement scale.

直方图

Definition:

A numerical variable is discrete if its set of possible values either is finite or else can be listed in an infinite sequence (one in which there is a first number, a second number, and so on). A numerical variable is continuous if its possible values consist of an entire interval on the number line.

TOP