-
社交网站的数据挖掘与分析
Facebook、Twitter和LinkedIn产生了大量宝贵的社交数据,但是你怎样才能找出谁通过社交媒介正在进行联系?他们在讨论些什么?或者他们在哪儿?这本简洁而且具有可操作性的书将揭示如何回答这些问题甚至更多的问题。你将学到如何组合社交网络数据、分析技术,如何通过可视化帮助你找到你一直在社交世界中寻找的内容,以及你闻所未闻的有用信息。 每个独立的章节介绍了在社交网络的不同领域挖掘数据的技术,这些领域包括博客和电子邮件。你所需要具备的就是一定的编程经验和学习基本的Python工具的意愿。 •获得对社交网络世界的直观认识 •使用GitHub上灵活的脚本来获取从诸如Twitter、Facebook和LinkedIn之类的社交网络API中的数据 •学习如何应用便捷的Python工具来交叉分析你所收集的数据 •通过XHTML朋友圈探讨基于微格式的社交联系 •应用诸如TF-IDF、余弦相似性、搭配分析、文档摘要、派系检测之类的先进挖掘技术 •通过基于HTML5和JavaScript工具包的网络技术建立交互式可视化 -
深入浅出统计学
样章试读请到下面的链接下载: 目录 http://goo.gl/tlCLf 序言 http://goo.gl/65x6e 第一章 http://goo.gl/WTnC9 第二章 http://goo.gl/5WUhT 若下载遇到问题,请邮件联系:lispython@gmail.com。谢谢! 《深入浅出统计学》具有深入浅出系列的一贯特色,提供最符合直觉的理解方式,让统计理论的学习既有趣又自然。从应对考试到解决实际问题,无论你是学生还是数据分析师,都能从中受益。本书涵盖的知识点包括:信息可视化、概率计算、几何分布、二项分布及泊松分布、正态分布、统计抽样、置信区 间的构建、假设检验、卡方分布、相关与回归等等,完整涵盖AP 考试范围。本书运用充满互动性的真实世界情节,教给你有关这门学科的所有基础,为这个枯燥的领域带来鲜活的乐趣,不仅让你充分掌握统计学的要义,更会告诉你如何将统计理论应用到日常生活中。 -
生物统计学基础
本书是国外优秀教材Fundamentals of Biostatistics (第五版)的中译本,由哈佛大学具有丰富教学经验的一流教授编写。 本书是介绍生物统计学重要知识和基本应用的导论性教材。书中运用丰富的医学和生物学实例及流程图,生动形象地阐明了生物统计学的概念内涵和方法公式。为了便于读者自学,本书尽量贯穿初等数学讨论,而不过多涉及高等数学证明,并且每章末附摘要、练习题和参考文献,书末有习题解答、索引及数据光盘。 本书适用于高等院校生物学和医学相关专业师生。 -
统计学
《统计学:从概念到数据分析》主要介绍了概率基础、统计的基本概念、描述性统计、估计、假设检验、回归与分类等内容,同时介绍了决策树、神经网络和随机森林等组合方法以及如何用R、SPSS、SAS等软件来实现相应的计算目标。 -
An introduction to categorical data analysis
Praise for the First Edition "This is a superb text from which to teach categorical data analysis, at a variety of levels. . . [t]his book can be very highly recommended." — Short Book Reviews "Of great interest to potential readers is the variety of fields that are represented in the examples: health care, financial, government, product marketing, and sports, to name a few." — Journal of Quality Technology "Alan Agresti has written another brilliant account of the analysis of categorical data." —The Statistician The use of statistical methods for categorical data is ever increasing in today's world. An Introduction to Categorical Data Analysis, Second Edition provides an applied introduction to the most important methods for analyzing categorical data. This new edition summarizes methods that have long played a prominent role in data analysis, such as chi-squared tests, and also places special emphasis on logistic regression and other modeling techniques for univariate and correlated multivariate categorical responses. This Second Edition features: Two new chapters on the methods for clustered data, with an emphasis on generalized estimating equations (GEE) and random effects models A unified perspective based on generalized linear models An emphasis on logistic regression modeling An appendix that demonstrates the use of SAS(r) for all methods An entertaining historical perspective on the development of the methods Specialized methods for ordinal data, small samples, multicategory data, and matched pairs More than 100 analyses of real data sets and nearly 300 exercises Written in an applied, nontechnical style, the book illustrates methods using a wide variety of real data, including medical clinical trials, drug use by teenagers, basketball shooting, horseshoe crab mating, environmental opinions, correlates of happiness, and much more. An Introduction to Categorical Data Analysis, Second Edition is an invaluable tool for social, behavioral, and biomedical scientists, as well as researchers in public health, marketing, education, biological and agricultural sciences, and industrial quality control. -
Statistics And Truth
This book deals with the philosophical and methodological aspects of information technology and the collection and analysis of data to provide insight into a problem, whether it is scientific research, policy making by government or decision making in our daily lives. The author seeks to dispels the doubts that chance is an expression of our ignorance which makes accurate prediction impossible and illustrates how our thinking has changed with quantification of uncertainty by showing that chance is no longer the obstructor but a way of expressing our knowledge. Indeed, chance can create and help in the investigation of truth. This theory is eloquently demonstrated with numerous examples of applications that statistics is the science, technology and art of extracting information from data and is based on a study of the laws of chance. It shows how statistical ideas played a vital role in scientific and other investigations even before statistics was recognized as a separate discipline, and how statistics is now evolving as a versatile, powerful and inevitable tool in diverse fields of human endeavour such as literature, legal matters, industry, archaeology and medicine. The use of statistics to the layman in improving the quality of life through wise decision-making is emphasized.