Recapitulation For Midterm 1

1
I PART OF EXAMINATION

I. INTRODUCTORY DEFINITIONS

1. Mass phenomenon is - a phenomenon present in a large number of objects
2. Statistics is - the science of data
3. Two main parts of statistics are - Desriptive statistics and inferential statistics
4. Interferential statistics is - the science of using a sample to make generalizations
about the important aspects of a population.
5. An element that possessing characteristics on the basis of which mass phenomenon
variations are investigated is - Statistical unit
6. The variable we wish to analyze is the number of TV sets in the household in town
A. The population that corresponds to this variable is set of all household in town A
7. Any characteristic of a population unit is a(n) - Variable
8. A set of all statistical units (all those people, objects, transactions, events or
organizations of interest) that we are interested in studying is - Population
9. We received the data on turnover of 100 trade companies in B&H for 2009.
The trade company in this example represents - Statistical unit
10. The term population is given the precise meaning of - All those people, objects or
items of interest
11. Data that we have collected to meet our research needs are - Primary data
12. Director for children's program has compiled 10 questions that he will ask childes
from fifth grade of elementary schools in order to select announcer for show based on
their answers. What method of data collection using director? - Interview
13. Data collected by others but useful to our research are - Secondary data
14. Primary data is - Data collected specifically for the research
15. If a person collects data on the number of cars that pass through the intersection in a
given period of time, what data collection method did he use? - Recording
16. Census means that every element of population is included in research
17. Reliability in one research means that if our research is repeated then we should
expect the same kind of results
18. When a researcher collects data on subjects un their natural setting he in conducting
an observation study
19. A statistical variable is a characteristic in which population units vary
20. A modality for one statistical variable is a value that a statistical variable may
have
21. A study is done to determine the average amount of television watched per week by
children in B&H. What is the phrase "the amount of television watched per week by
California children in a random survey" describing? - Variable
22. A nominal level (scale) of measurement implies order to the data? - False
2
23. A ordinal level (scale) of measurement doesn't imply order to the data - False
24. When we have quantitative scale of measurement and zero does not mean absence of
phenomenon, that is intervale scale
25. When we have qualitative scale of measurement and each modality has the same
relative importance, that is Nominal scale
26. Quantitative statistical variable that can take any value from the set interval is
continous variable
27. If in the analysis of a phenomenon, we get the data by enumeration (counting), the
appropriate statistical variable is quantitative discrete
28. Quantitative statistical variable that can take only certain values from the set interval is
Discrete variable
29. If we work with discrete statistical variable we will work with - counting
30. In order to get the data related to the continuous variable, we use the process of -
measuring
31. What level of measurement would be involved in recording a person's social security
number? nominal level
32. You measure the width (in inches) of a number of fabric samples. This would be an
example of measurement at the ratio level
33. In a branch office of Raiffeisen Bank, we monitored the number of transactions during
a day, for 100 consecutive days. The statistical variable the number of transactions
during a day is of the following type a discrete variable
34. We measured the height of ship masts of 30 ships produced in a single series. In this
example, the mast height represents a continous variable
35. For what type of a variable we should not use a chart with columns merged
(histogram)? a discrete variable
36. Which of the following types of graphs could be thought of primarily as a "curve"?
Polygon of frequency
37. A(n) Nominal variable is a qualitative variable such that there is no meaningful
ordering or ranking of the categories - Nominal
38. How should you classify variable from the following question: What are the names of
colleges you have attended? - Nominal
39. How should you classify variable from the following question: What is the number of
times during the summer that you go to the beach? - discrete
40. How should you classify variable from the following question: What is your weight? -
continous
41. The average number of people per car for all cars using the "Diamond" Carpool Lane
is called the - Statistic
42. In order to represent a statistical distribution, we drew a histogram. That means that we
have - a continous variable

II. FREQUENCIES

43. The number of occurrences (repetitions) of a modality is absolute frequency
3
44. Absolute frequency shows the number of occurrences (repetitions) of a modality
45. Relative frequency shows proportion of units of a statistical set with the same
modality
46. Formula for calculating of relative frequency is p = fi/N

47. What is missing (in empty box) in formula for calculating relative frequency
f
i
p
i
= ? -
N
48. Increasing absolute cumulative frequency shows how many data have the value
that is equal to or lower than the value of present modality
49. We received information that 27% of first year students Business School are 19 years
old. On the basis of which type of frequency is obtained this information?
percentage frequency
50. We received information that 127 of second year students Business School are 20
years old. On the basis of which type of frequency is obtained this information?
absolute frequency
51. Sum of all absolute frequencies for one statistical distribution must be equal to N
52. Increasing relative cumulative frequency shows what is the percentage of data with
a value equal to or lower than the value of present modality
53. Sum of all relative frequencies for one statistical distribution has to be equal to 1
54. Formula for calculating of increasing relative cumulative frequency is CRF = p
j

55. What is missing (in empty box) in formula for calculating of increasing relative
cumulative frequency CRF
i
=
p
j
? - i
j=1
56. The last absolute cumulative frequency must be equal to N
57. Formula for calculating of increasing absolute cumulative frequency is CAF = f
j

58. What is missing (in empty box) in formula for calculating of increasing absolute
cumulative frequencyCAF
i
=
f
j
? i
j=1
59. The last relative cumulative frequency has to be equal to 1 (or 100%)
60. Statistical series with gross data is a database in which we entered data for each
statistical unit like piled data
61. If we have a form of data grouping in which each variable modality is associated a
corresponding absolute frequency, that is the statistical distribution of frequencies
62. The statistical distribution of frequencies is form of data grouping in which each
variable modality is associated a corresponding absolute frequency
63. Formula for calculating of center for interval is - L1,i + L c
i/2

64. What is missing (in empty box) in formula for calculating of center for interval
4
L1,i + L c
i
= ? - 1,i+1
65. We received information that 76% of first year students Business School has 21 or
fewer years. On the basis of which type of frequency is obtained this information?
increasing relative cumulative frequency
66. We received information that 170 of first year students Business School has 21 or
fewer years. On the basis of which type of frequency is obtained this information?
increasing absolute cumulative frequency
67. Formula for calculating of corrected absolute frequencies is fi=fi/li

III. DESCRIPTIVE STATISTICS

III.1. MEASURES OF CENTRAL TENDENCY

III.1.1. COMPLETE (ARITHMETIC) MEASURES OF CENTRAL TENDENCY

68. Measures of central tendency predict what the next value will be
69. The arithmetic mean of a data series is defined as sum of all data multiplied by
the number of data

70. Arithmetic mean is equal to the coeficient of the sum of all observation and the
size of series

71. To what measures of central tendency does arithmetic mean belong? arithmetic or
complete
72. Formula for calculating of arithmetic mean for interval-grouped data is next
question
73. What is missing (in empty box) in formula for calculating of arithmetic mean for
1 k
interval-grouped data X = c
i
? - fi
N i=1
74. Formula for calculating of arithmetic mean for non-interval-grouped data is X =
1/N * xi * fi

75. The arithmetic mean uses every value in the sample of the variable to calculate the
statistic? - true

5
76. If we multiply each observation by the same constant, the arithmetic mean of the new
variable is equal to the - product of the constant and the arithmetic mean of the
original variable
77. If we add the same constant to each observation, the arithmetic mean of the new
variable is equal to the sum of the constant and the arithmetic mean of the
original variable
78. The harmonic mean is the reciprocal value of the arithmetic mean of the
reciprocal value of the observation
79. Which of the following statistical measures is sensitive to outliers? - Mean
80. The geometric mean is the Nth root of the product of all observations
81. To what measures of central tendency does harmonic mean belong? arithmetic or
complete
82. Formula for calculating of geometric mean for non-interval-grouped data is next
question
83. What is missing (in empty box) in formula for calculating of geometric mean for
N
non-grouped data G
=
x
i
? N
i=1
84. The sum of deviations of observations from the arithmetic mean is equal to 0
85. The average salary in company X was 800 KM. If the employee structure has not
changed, the new average salary, in case that the salary of each employee was
increased by 40%, is 1,120
86. If we have a series of data that are all equal to a constant c, then the arithmetic mean
of the series is equal to the c
87. Measure of central tendency used to express the indirect relation is Harmonic
mean
88. The average salary in company X was 800 KM. If the employee structure has not
changed, the new average salary, in case that the salary of each employee was
increased by 60 KM, is 860
89. Formula for calculating of harmonic mean for non-grouped data from population is
next question
90. What is missing (in empty box) in formula for calculating of harmonic mean for non-
interval-grouped data from population H = ? - N
i=1 xi
91. For the variable motion speed in relation to the population of distance traveled,
we calculated that the measure of mean value is 75 km/h. In the calculation, we used
harmonic mean
92. For the data series: 15; 20; 14; 21; 28; the arithmetic mean is 19.6
93. Formula for calculating of harmonic mean for interval-grouped data from sample is
H = n/*fi/ci
k
i
f

6

III.1.2. INCOMPLETE (POSITIONAL) MEASURES OF CENTRAL
TENDENCY

94. The distribution on household income tends to be skewed to the right, with a
relatively few households making a large salary. Which measure of central tendency
would be the best measure of the center of this distribution? Median
95. To what measures of central tendency does mode belong? positional or
incomplete
96. The theoretical relative increasing cumulative frequency of the median is 50%
97. The median is determined based on The increasing cumulative frequency
98. The theoretical absolute increasing cumulative frequency of the median is N/2
99. The mod is The most frequent value
100. To what measures of central tendency does median belong? positional or
incomplete
101. The median of interval-grouped data is calculated by interpolation on the basis of
the following formula next question
102. What is missing (in empty box) in formula for calculating of median for
population M
e
= L
1Me +l
Me ?
f
Me - CAF UMr - 1
103. What is missing (in empty box) in formula for calculating of median for
CAF Me1
population M
e
= L
1Me +l
Me ?
f
Me N/2
104. The median divides the statistical series in two equal parts
105. What is missing (in empty box) in formula for calculating of mode

Mo = L1Mo +lMo ?
( f
M
o f
M
o
1
)
+
( f
M
o f
M
o+
1
) fMo fMo - 1
106. What is missing (in empty box) in formula for calculating of mode
fMo fMo1
Mo = L1Mo +lMo ? - ( f
M
o f
M
o
1
)
+
( f
M
o f
M
o+
1
)
107. The mode is graphically determined on a histogram
108. Quartiles divide the statistical series in four equal parts
109. How much data of a statistically arranged series is placed between the first and the
third quartile? - 50%
2
N

7
110. How much data of a statistically arranged series is placed between the second and the
third quartile? 25%
111. How much data of a statistically arranged series is placed between the median and
the third quartile? 25%
112. Formula for calculating of mode for interval-grouped data is Lmo + lMo*fM0
fMo-1
113. In statistical frequency distribution, the mode is the data that is repeated the most
times in the series
114. In statistical frequency distribution, the median is the data that divides a statistical
series in two halves
115. The theoretical relative increasing cumulative frequency of the first quartile is 25%
116. The theoretical relative increasing cumulative frequency of the third quartile is
75%
117. When distribution tends to be skewed, most appropriate measure of central tendency
is - Median
118. The theoretical absolute increasing cumulative frequency of the first quartile in
population is N/4
119. The theoretical absolute increasing cumulative frequency of the third quartile in
population is 3*N/4
120. The mod data is read on the basis of the highest frequency
121. The median is graphically determined on the increasing cumulant (ogive)
8
III.2. MEASURES OF VARIABILITY

III. 2. 1. ABSOLUTE MEASURES OF VARIABILITY

122. Measure of dispersion is measure for The variability of the values of our data set
about the measure of central tendency for the data set
123. Dispersion measures The spread of the data
124. The Inter-Quartile range is the Difference between the third and first quartiles
125. The variance is The square of the standard deviation
126. The standard deviation is the maximum deviation about the mean in a variable? -
False
127. What is formula for calculating of middle absolute distance for population? MAD =
x
i
X fi ?
128. What is missing (in empty box) in formula for calculating of middle absolute
k
distance for population MAD = x
i
X ? - fi
i=1
129. Formula for calculating of standard deviation from population is -
130. What is missing (in empty box) in formula for calculating of standard deviation
1 k ( i )2 i
for population = N i
=
1

x f

? - X
for population = N1 i=k1 (xi X
)2 ? -
fi
1 k
S
(x
i
X )
2
f
i
?
for sample S =
i=1 - - n-1
133. In what unit of measure is standard deviation expressed? - Same unit of measure
like observed variable
134. Variance equals to The arithmetic mean of the squares of data deviation from
the arithmetic mean
9
135. Standard deviation equals The square root of the variance
136. The variance is sensitive to outliers and extreme values? - True
137. Formula for calculating of inter-quartile absolute divergence is QD = Q3-Q1/2
138. The inter-quartile range shows the spread of the values in a variable of 50%
139. We can use the standard deviation for comparing variability of series with different
units of measure - False
140. Standard deviation of wages in an enterprise is 124 KM. If all workers got the same
raise of 50 KM, what is the new standard deviation of wages? 124 KM
141. Variance for meal compensation at one company is 36 KM
2
. Due to the rise in prices
of food items, the administration has decided to double meal compensation to all
workers. What is the new variance for meal compensation in the observed company?-
144 KM
2

142. If some measurements include the entire population, we will get the standard
deviation equal to 0? - True
143. If we removed the most extreme value in the variable, the new variance for the new
series of data will be - Lower
144. Which of the following answers can be the standard deviation of grades (grade can
take value 5-10) for one group of students 1.35
145. If we know that standard deviation is equal to 0, we know that All data in data set
are same
10
III. 2. 1. RELATIVE MEASURES OF VARIABILITY

146. The coefficient of variation expresses the standard deviation relative to the
absolute value of the mean by taking the ratio of the two? - True
147. What is missing (in empty box) in formula for calculating of the coefficient of
__
variation for population V = 100 ? X
148. What is missing (in empty box) in formula for calculating of the coefficient of
variation for population V = X100 ?

149. The coefficient of variation enables us to compare the relative variability across
several variables? - False
150. The coefficient of variation doesnt enable us to compare the relative variability
across several variables? - True
151. Which indicator will we use if we want to know the relative measure of data
varying around the median? The coefficient of inter-quartile deviation
152. In order to determine the relative inter-quartile deviation, it is necessary to
know The first and third quartile
153. Which indicator will we use if we want to know the relative measure of data
varying around the arithmetic mean? The coefficient of variation

154. Higher value of the coefficient of variation indicates Greater dispersion
155. Lower value of the coefficient of variation indicates Lower dispersion
156. Formula for calculating of coefficient of variation for population is V =
/X*100
157. The mean for whole set of standardized variable (i=1,...,N) is equal to - 0
158. Formula for calculating of standardized z value is Zi=xi-X/
159. What is missing (in empty box) in formula for calculating of the standardized z
value z
i
= ? -
160. What is missing (in empty box) in formula for calculating of the

standardized z value z
i
=? __
- X
161. Students have worked exam statistics. For the three students A, B and C
standardized values were: z
A
= 0.5, z
B
=1.4 and z
C
= 0.4. The best position
achieved by the results of the examination has Student B
i
x X
i
x
11
162. Students have worked exam statistics. For the three students A, B and C
standardized values were: z
A
= 0.5, z
B
=1.4 and z
C
= 0.4. The worst position
achieved by the results of the examination has Student C
163. What is missing (in empty box) in formula for calculating of the quartile
Q
3
Q
1
deviation
coefficient V
Q
= (100%)? - Q1+Q3
164.

The quartile deviation coefficient is
relative indicator of data varying around the
mean? - False
165. Which indicator will we use if we want to compare positions of individual data
in different series? z value
166. The variance for whole set of standardized variable (i=1,...,N) is equal to: - 1

12
III.3. MEASURES FOR SHAPE OF DISTRIBUTION AND FOR
CONCENTRATION

167. In case that we know that the distribution of frequencies is right asymmetric,
sign of its asymmetry coefficient is: - Positive
168. In case that we know that the distribution of frequencies is left asymmetric, sign
of its asymmetry coefficient is - Negative
169. For distribution of frequencies that is left asymmetric, left end of its polygon of
absolute frequencies is longer? - True
170. For distribution of frequencies that is left asymmetric, right end of its polygon of
absolute frequencies is longer? - False
171. If
3
< 0, that means Left asymmetry
172. If
3
= 0, that means - Symmetry
173. If
3
> 0, that means Right Asymmetry
Formula for calculating the asymmetry coefficient is -
3 =
3
/
174.
3
asymmetry coefficient
3
= ?

176. Formula for calculating the kurtosis coefficient is -
4
=
4
/

177. What is missing (in empty box) in formula for calculating of the kurtosis
4
coefficient
4
= ?
178. If the mean, median, and mode for a given population all equal 25, then we
know that its distribution is
179. What is missing (in empty box) in formula for calculating of the kurtosis

coefficient
4
=
4
?

180. We can have conclusion about kurtosis on base of
4
?

181. If M
o
= M
e
= X , that means.
182. If
4
< 3, that means
183. If
4
= 3, that means
184. If
4
> 3, that means
13
185. Which of the following conditions need to be satisfied that the distribution were
elongated (leptokurtic)?
186. We can have conclusion about asymmetry (skewness) on base of
3
?
187. Measures of concentration are used to
188. Area of concentration is area between
189. Ginny coefficient represents
190. Formula for calculating of Ginny coefficient is:

191. What is missing (in empty box) in formula for calculating of the Ginny
coefficient G = area of concentration
192. We calculated that the Ginny coefficient of consumption by citizens of a region
is 1. In that case, Lorenz curve is
193. If Lorenz curve is equal to line of total inequality, then Ginny coefficient has
value
194. We calculated that the Ginny coefficient is 0. In that case, Lorenz curve is: 195.
The image below represents a Lorenz curve:

Gini coefficient that corresponds to that Lorenz curve is
196. If Lorenz curve is equal to line of total equality, then Ginny coefficient has
value
197. In the group of 10 workers only one received a salary for February. What is
the Ginny coefficient for February salaries for 10 employees observed?
198. A Lorenz curve uses

Q i
CF i
14
IV. REGRESSION AND CORRELATION

199. The scatter plot graphically displays the form, direction and strength of the
relationship between more than two quantitative variables
200. When direction of relationship between two variables is positive, we can say that
High scores on one variable go with high scores on the the other variable
201. Correlation coefficient is a symmetric measure of association? - True
202. In multiple regressions, there is a single independent variable and more than one
dependent variable? - False
203. In multiple regressions, there is a single dependent variable and more than one
independent variable? True
204. Perfect negative correlation means that Correlation coefficient is equal to (-
1)
205. Perfect positive correlation means that Correlation coefficient is equal to (1)
206. The stronger the correlation means the better the prediction and the smaller
the errors of prediction
207. If correlation coefficient is equal to 1 then All points will line sloping
upwards to the right
208. The circle radius and circle area are in a functional relationship. The coefficient
of correlation of the circle radius and circle area is - 1
209. A study is done to see if there is a linear relationship between the life expectancy
of an individual and the year of birth. The year of birth is the Independent
variable
210. Consider the following pairs of data (independent variable given first):
(16, 136); (10, 98); (35, 155); (4, 80); (12, 110)
The regression line is Steeply sloped upward
211. The least-squares method determines the values of a and b that minimizes
unexplained variability
212. In term for decomposition of variance, selected part (y
i
y)
2
=
(y
i
y)
2
+
(y
i
y
i
)
2
is explained variability
213. In term for decomposition of variance, selected
part
(y
i
y)
2
=
(y
i
y)
2
+ is unexplained variability
214. Explained variability can be define as the sum of squares due to prediction
or regression

(y
i
y
i
)
2

15
2 (yi )2
coefficient of determination r =
2
? __
( y
i
y) - Y
216. Coefficient of determination is the fraction of the explained variability in
the total variability
_
217. _Formula for_ calculating of coefficient of determination is r
2
=(yi
y)
2
/(yi-y)
2

218. According to this LSM method, formula for calculating slope (b) for linear
regression model is b=Cov
xy
/
x
2

219. What is missing (in empty box) in formula for calculating of the slope (b)

for linear regression model b =
2
?
x
Cov
x
y
220. What is missing (in empty box) in formula for calculating of the slope (b)
Cov
xy

for linear regression model b =?
-
x
2

221. According to this LSM m_eth_od, formula for calculating intercept (a) for linear
regression model is a = y - bx
222. What is missing (in empty box) in formula for calculating of the intercept
(a) for linear regression model a = y x ? - b

223. Formula for calculating the coefficient of correlation for linear regression model
is r = Cov
xy
/
x
*
y

Cov
xy

coefficient of correlation for linear regression model r =? -
x
*
y

16
225. Slope (b) for linear regression model can be interpret as how much the
value of dependent variable changes as value of independent variable
changes for one unit
226. Intercept (a) for linear regression model can be interpret as which is the value
of the dependent variable when value for independent variable is 0
227. In multiple regressions, there is a single dependent variable and more that
one independent variable
228. If we made a conclusion: "For every 1 KM invested in the cost for promotional
activity about products, gross profit, on average, increased by 0.25 KM. If you
do not invest in the promotion, the gross profit will amount to 2,500 KM",
expression for the linear regression relationship between variables gross profit
(y) and cost of promotion (x) is y = 2.500 + 0.25 * x
i

229. A study was designed to examine the relationship between work experience
(in months) and time taken to produce one product (in minutes). The
regression results are given below:

The conclusion for p value of the t-test for independent variable work experience is:
- we reject null hypothesis that the coefficient b is equal to zero and we accept
that independent variable work experience is significant in model

230. A study was designed to examine the relationship between work experience
(in months) and time taken to produce one product (in minutes). The
regression results are given below:
17

The conclusion for p value of the ANOVA F-test is: - we reject null hypothesis that the
coefficient of determination is equal to zero and we accept that given regression
model is significant

18
V. DINAMIC ANALYSIS

231. Trend like part of time series is the basic direction (flow) of movements that
occur over a longer period of time
232. Irregular component like part of time series - is caused by random factors
233. The sale of a company that sells ice-cream is typically bigger in summer, than in
winter months. We consider the stated component to be - seasonal
234. Trend is a general, underlying movement in data
235. Seasonal variation is a regular fluctation expected during the year
236. A absolute change shows absolute change for some variable occur between
two periods
237. When we work with absolute change comparison is not always possible
238. Relative change has the feature computation? -
239. We followed the movement of GDP per capita (in $) for the period 20052009.
year. Absolute change 2008. in relation to the 2006. year amounted to $B, where
B had negative value. So, in 2008. in relation to the 2006. year, GDP per capita
was
240. We followed the movement of GDP per capita (in $) for the period 20052009.
year. Relative change 2008. in relation to the 2006. year amounted to 0.1. So, in
2008. in relation to the 2006. year, GDP per capita was
241. In determining the trend, the basic advantage of the smallest square method in
relation to the moving average method is
242. We monitored the fluctuation of the number of employees university graduates
in the period of 2000-2005, and calculated that the chain index for 2004 in
relation to 2003 is 104.71. The number of employees university graduates in
2004, in relation to 2003
243. Formula for calculating the basis indices is

244. What is missing (in empty box) in formula for calculating of the basis
V
t
indices i
t/0
= ?
245. Formula for calculating the chain indices is
246. What is missing (in empty box) in formula for calculating of the chain
V
t

indices i
t/t1
= ?
247. A quantity chain index number shows
248. A price base index number shows
249. The transitivity characteristic of indices can be expressed by formula

250. Relation beetwen the basic index and the rate of change can be expressed by
formula V
t/0
/V=i
t/0
-1
251. Formula for calculating the average annual rate of change is - r = V
n
/V
1
-1
1 n
19
252. What is missing (in empty box) in formula for calculating of the average annual
rate of change r = 1? V
n
/V
1

253. Aggregate quantity index that can eliminate subjectivity in the choice of weights
is Fisher quantity index
254. Decomposition of aggregate index of values can be expressed as
I
w
=P
q,i/0
*L
p,i/0
/100

255. The additive model for time series is appropriate when periodity in moving of
phenomenon is constantly in relation to the trend

256. The multiplicative model for time series can be expressed as Y = T * S * C *
N
257. Standard error of trend shows the average deviation of empirical values of
the time series from the estimated trend values
258. What is missing (in empty box) in formula for trend isolation
y
i
100 ? - y
ti

259. For trend isolation we calculated that
y
i
100 <100. That means influenced by
residium, phenomenon was below average

yti
260. For trend isolation we calculated that
y
i
100 >100 . That means influenced by
residium, phrenomenon was above average

1 n

Recapitulation For Midterm 1

Transféré par

Informations du document

Description originale:

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Recapitulation For Midterm 1

Transféré par

Droits d'auteur :

Formats disponibles

1

Vous aimerez peut-être aussi