SHORT COURSE: PROBABILITY MODELS IN SAMPLING
- Erasmus Course -
Spring Semester 2014-15
Course Content
The course consists of a series of topics on the question of the existence of discrepancies between observed and anticipated variability in the data under a hypothesized model. The topics include: Distortion of observations - Distortion models, over-dispersion, under-dispersion, inflated and deflated distributions, biased distributions, weighted distributions, damaged observations, damage models, generating models, mixtures of distributions: finite mixtures, continuous mixtures, contagion. Truncated distributions, censored distributions, generalized (clustered) distributions, stuttering distributions, randomly stopped distributions, capture recapture method. Generalized distributions arising in the context of the generalized sampling schemes. Applications to problems in life sciences, accident theory, linguistics, internet access modeling.
Learning Outcomes
Upon successful completion of this course students should be able to:
demonstrate knowledge, understanding and ability to identify situations where the observed variability in data differs from that expected under a hypothesized model and be able to use various modelling approaches in tackling such situations.
produce a well structured, well written expository essay about a particular situation and the approach chosen to tackle it.
Announcements
Students interested in the course are kindly requested to get in contact with Professor Xekalaki's Office via e-mail oxek@aueb.gr. A meeting will subsequently be arranged to determine the weekly schedule of the course. The date and place of the meeting will be announced here.
Recommended Reading
Books:
Beard, Pentikainen and Pessonen (1993). Practical Risk Theory, 2nd edition, Chapman and Hall
Douglas J.B. (1980). Analysis with Standard Contagious Distributions. Statistical Distributions in Scientific Work Series 4. International Cooperative Publishing House, Fairland, Maryland USA
Everitt B.S, Hand D.J. (1981). Finite Mixtures Distributions. Chapman and Hall, London
Grandell (1997). Mixed Poisson Process, Chapman and Hall
Hogg and Klugman (1984). Loss Distributions. Wiley and Sons
Johnson N.L., Kotz S., Kemp A.W. (1992). Univariate Discrete Distributions. 2nd Edition Willey-New York
Lindsay B. (1995). Mixture Models: Theory, Geometry and Applications. Regional Conference Series in Probability and Statistics, Vol. 5, Institute of Mathematical Statistics and American Statistical Association
McLachlan G., Basford K. (1988). Mixture Models: Inference and Application to Clustering. Marcel and Decker Inc.
Ord K (1972). Families of frequency distributions. Griffins
Patil (ed) (1984). Statistical Distributions in Ecological Work. Vol. 4, Statistical Ecology Series
An Indicative List of Research Papers:
Charalambides, Ch. & Papageorgiou H. (1981). “Bivariate Poisson Binomial Distributions”. Biom. J., 23(5), 437-450.
Chatfield, C. & Theobald, C.M. (1973). “Mixtures and Random Sums”. The Statistician, 22(4), 281-287.
Greenwood, M. & Woods, H.M. (1919). “The incidence of industrial accidents upon individuals with special reference to multiple accidents”. Industrial Fatigue Research Board, Medical Research Committee, Report No. 4. Her Majesty's Stationery Office, London.
Dimaki, C. & Xekalaki, E. (1990). “Identifiability of Income Distributions in the Context of Damage and Generating Models”. Commun. Stat. A-Theor., 19(8), 2757-2766.
Famoye, F. & Singh, K.P (2006). “Zero-Inflated Generalized Poisson Regression Model with an Application to Domestic Violence”. J. Data Sci., 4, 117-130.
Gupta, P.L., Gupta, R.C. & Tripathi, R.C. (2004). “Score Test for Zero Inflated Generalized Poisson Regression Model”. Commun. Stat. A-Theor., 33, 47-64.
Hinde, J. & Demetrio, C.G.B. (1998). “Overdispersion: Models and Estimation”. Comput. Stat. Data An., 27, 151-170.
Irwin, J. (1968). “The Generalised Waring Distribution Applied to Accident Theory”. J. Royal Stat. Soc., A 131, 205-225.
Irwin, J. (1975). “The Generalised Waring Distribution, Part I”. J. Royal Stat. Soc., A 138, 18-31.
Irwin, J. (1975). “The Generalised Waring Distribution, Part II”. J. Royal Stat. Soc., A 138, 204-227.
Irwin, J. (1975). “The Generalised Waring Distribution, Part III”. J. Royal Stat. Soc., A 138, 374-384.
Karlis, D. & Xekalaki, E. (2003). “Mixtures Everywhere”. In: Panaretos, J (ed.) Stochastic Musings: Perspectives from the Pioneers of the Late 20th Century, 78-95. Laurence Erlbaum, USA.
Karlis, D. & Xekalaki, E. (2005). “Mixed Poisson Distributions”. Int. Stat. Rev., 73(1), 35-58.
Panaretos, J. (1982). “An Extension of the Damage Model”. Metrika, 29, 189-194.
Panaretos, J. (1983). “A Generating Model Involving Pascal and Logarithmic Series Distributions”. Commun. Statist. – Theory Meth., A 12 (7), 841-848.
Panaretos, J. (1989). “On the Evolution of Surnames”. Int. Stat. Rev., 57(2), 161-167.
Panaretos, J. & Xekalaki, E. (1986). “On Some Distributions Arising From Certain Generalized Sampling Schemes”. Commun. Statist. – Theor. Meth., 15(3), 873-891.
Panjer, H. (1981). “Recursive Evaluation of a Family of Compound Distributions”. ASTIN Bulletin, 12, 22-26.
Patil, G.P. & Rao, C.R. (1978). “Weighted Distributions and Size-Biased Sampling with Applications to Widlife Populations and Human Families”. Biometrics, 34, 179-189.
Patil, G.P., Rao, C.R. & Ratnaparkhi, M.V. (1986). “On Discrete Weighted Distributions and Their Use in Model Choise for Observed Data”. Commun. Statist. – Theory Meth., 15, 907-918.
Sichel, H.S. (1974). “On A Distribution Representing Sentence-Length in Written Prose”. J. Royal Stat. Soc., A 137, 25-34.
Willmot, G. (1993). “On Recursive Evaluation of Mixed Poisson Probabilities and Related Quantities”. Scand. Act. J., 18, 114-133.
Xekalaki, E. (1981). “Chance Mechanisms for the Univariate Generalized Waring Distribution and Related Characterizations”. Stat. Distr. Scien. W., 4, 157-171.
Xekalaki, E. (1983). “The Univariate Generalized Waring Distribution in Relation to Accident Theory. Proneness, Spells or Contagion?”. Biometrics, 39(3), 887-895.
Xekalaki, E. (1983). “Infinite Divisibility, Completeness and Regression Properties of the Univariate Generalized Waring Distribution”, Ann. I. Stat. Math., A 35, 279-289.
Xekalaki, E. (1983). “A Property of the Yule Distribution and its Applications”. Commun. Stat. A-Theor., 12(10), 1181-1189.
Xekalaki, E. (1984). “The Bivariate Generalized Waring Distribution and its Application to Accident Theory”. J. Royal Stat. Soc., A 147(3), 488-498.
Xekalaki, E. (1984). “Linear Regression and the Yule Distribution”. J. Econometrics, 24(1), 397-403.
Xekalaki, E. (1984). “Models Leading to the Bivariate Generalized Waring Distribution”. Utilitas Mathematica, 25, 263-290.
Xekalaki, E. (1985). “Factorial Moment Estimation for the Bivariate Generalized Waring Distribution”. Statistiche Hefte, 26, 115-129.
Xekalaki, E. (1986). “The Multivariate Generalized Waring Distribution”. Commun. Stat. A-Theor., 15(3), 1047-1064.
Xekalaki, E. (2006). “Under- and Overdispersion”. In: Enc. Act. Sci., 3, 1700-1705, Wiley.
Xekalaki, E. (2014). “On The Distribution Theory of Over-Dispersion”. J. Stat. Distr. Appl. (to appear).
Xekalaki, E. & Zografi, M. (2008). “The Generalized Waring Process and its Application”. Commun. Stat. A-Theor., 37(12), 1835-1854.
Xekalaki, E. & Panaretos, J. (1983). “Identifiability of Compound Poisson Distributions”. Scand. Act. J., 66, 39-45.
Xekalaki, E. & Panaretos, J. (1989). “On Some Distributions Arising in Inverse Cluster Sampling”. Commun. Statist. – Theory Meth., 18(1), 355-366.