-
Naked Statistics
The field of statistics is rapidly transforming into a discipline that Hal Varian at Google has called "sexy". And with good reason - from batting averages and political polls to game shows and medical research - the real-world application of statistics is growing by leaps and bounds. In Naked Statistics, Charles Wheelan strips away the arcane and technical details to get at the underlying intuition that is key to understanding the power of statistical concepts. Tackling a wide-ranging set of problems, he demonstrates how statistics can be used to look at questions that are important and relevant to us today. With the trademark wit, accessibility and fun that made Naked Economics a bestseller, Wheelan brings another essential discipline to life with a one-in-a-million statistics book that you will read for pleasure. -
The Signal and the Noise
"Nate Silver's The Signal and the Noise is The Soul of a New Machine for the 21st century." —Rachel Maddow, author of Drift Nate Silver built an innovative system for predicting baseball performance, predicted the 2008 election within a hair’s breadth, and became a national sensation as a blogger—all by the time he was thirty. The New York Times now publishes FiveThirtyEight.com, where Silver is one of the nation’s most influential political forecasters. Drawing on his own groundbreaking work, Silver examines the world of prediction, investigating how we can distinguish a true signal from a universe of noisy data. Most predictions fail, often at great cost to society, because most of us have a poor understanding of probability and uncertainty. Both experts and laypeople mistake more confident predictions for more accurate ones. But overconfidence is often the reason for failure. If our appreciation of uncertainty improves, our predictions can get better too. This is the “prediction paradox”: The more humility we have about our ability to make predictions, the more successful we can be in planning for the future. In keeping with his own aim to seek truth from data, Silver visits the most successful forecasters in a range of areas, from hurricanes to baseball, from the poker table to the stock market, from Capitol Hill to the NBA. He explains and evaluates how these forecasters think and what bonds they share. What lies behind their success? Are they good—or just lucky? What patterns have they unraveled? And are their forecasts really right? He explores unanticipated commonalities and exposes unexpected juxtapositions. And sometimes, it is not so much how good a prediction is in an absolute sense that matters but how good it is relative to the competition. In other cases, prediction is still a very rudimentary—and dangerous—science. Silver observes that the most accurate forecasters tend to have a superior command of probability, and they tend to be both humble and hardworking. They distinguish the predictable from the unpredictable, and they notice a thousand little details that lead them closer to the truth. Because of their appreciation of probability, they can distinguish the signal from the noise. With everything from the health of the global economy to our ability to fight terrorism dependent on the quality of our predictions, Nate Silver’s insights are an essential read. -
SPSS宝典
《SPSS宝典》(宝典丛书100万)共分24章,主要介绍SPSS for Windows的基础知识、统计数据的创建和管理、SPSS统计分析功能、SPSS的图形绘制功能以及SPSS编程功能。其中包括SPSS的窗口及其设置、统计数据的创建与编辑、SPSS数据的管理、数据转换与SPSS函数、SPSS基本统计分析、多重反应分析、SPSS的自定义表格、均值的比较与检验、方差分析、非参数检验、相关分析、回归分析、对数线性模型、聚类分析、判别分析、因子分析、对应分析、信度分析、统计图形的创建和编辑、交互图形的创建和编辑、SPSS的命令语句程序设计、利用SPSS语句读取数据文件、宏等内容。 本书内容全面,论述翔实,深入浅出。全书以SPSS统计功能为主线,涵盖数据管理和SPSS高级编程等内容,可供高等院校相关专业本科生、研究生,以及从事统计分析和决策的各领域相关的读者学习参考,亦可作SPSS培训和自学教材。 -
多变量分析
《多变量分析:SPSS的操作与应用》所介绍的多变量分析技术,除了SPSS/Base功能外,也针对Advanced等模块的功能加以说明,如平均数检定、一般线性模式、因素分析、集群分析、区别分析、回归分析等,并探讨一般书上少见的多元尺度法、TREE、Logistic、规则相关分析、联合分析、时间数列分析等进阶的多变量分析。此外,有关SPSS的外挂程序,包括结构方程模型AMOS与数据探勘Clementine等也多有着墨。 -
机器学习
这本书为机器学习技术提供了一些非常棒的案例研究。它并不想成为一本关于机器学习的工具书或者理论书籍,它注重的是一个学习的过程,因而对于任何有一些编程背景和定量思维的人来说,它都是不错的选择。 ——Max Shron OkCupid 机器学习是计算机科学和人工智能中非常重要的一个研究领域,近年来,机器学习不但在计算机科学的众多领域中大显身手,而且成为一些交叉学科的重要支撑技术。本书比较全面系统地介绍了机器学习的方法和技术,不仅详细阐述了许多经典的学习方法,还讨论了一些有生命力的新理论、新方法。 全书案例既有分类问题,也有回归问题;既包含监督学习,也涵盖无监督学习。本书讨论的案例从分类讲到回归,然后讨论了聚类、降维、最优化问题等。这些案例包括分类:垃圾邮件识别,排序:智能收件箱,回归模型:预测网页访问量,正则化:文本回归,最优化:密码破解,无监督学习:构建股票市场指数,空间相似度:用投票记录对美国参议员聚类,推荐系统:给用户推荐R语言包,社交网络分析:在Twitter上感兴趣的人,模型比较:给你的问题找到最佳算法。各章对原理的叙述力求概念清晰、表达准确,突出理论联系实际,富有启发性,易于理解。在探索这些案例的过程中用到的基本工具就是R统计编程语言。R语言非常适合用于机器学习的案例研究,因为它是一种用于数据分析的高水平、功能性脚本语言。 本书主要内容: ·开发一个朴素贝叶斯分类器,仅仅根据邮件的文本信息来判断这封邮件是否是垃圾邮件; ·使用线性回归来预测互联网排名前1000网站的PV; ·利用文本回归理解图书中词与词之间的关系; ·通过尝试破译一个简单的密码来学习优化技术; ·利用无监督学习构建股票市场指数,用于衡量整体市场行情的好坏; ·根据美国参议院的投票情况,从统计学的角度对美国参议员聚类; ·通过K近邻算法构建向用户推荐R语言包; ·利用Twitter数据来构建一个“你可能感兴趣的人”的推荐系统; ·模型比较:给你的问题找到最佳算法。 -
How to Lie With Statistics
"There is terror in numbers," writes Darrell Huff in How to Lie with Statistics. And nowhere does this terror translate to blind acceptance of authority more than in the slippery world of averages, correlations, graphs, and trends. Huff sought to break through "the daze that follows the collision of statistics with the human mind" with this slim volume, first published in 1954. The book remains relevant as a wake-up call for people unaccustomed to examining the endless flow of numbers pouring from Wall Street, Madison Avenue, and everywhere else someone has an axe to grind, a point to prove, or a product to sell. "The secret language of statistics, so appealing in a fact-minded culture, is employed to sensationalize, inflate, confuse, and oversimplify," warns Huff. Although many of the examples used in the book are charmingly dated, the cautions are timeless. Statistics are rife with opportunities for misuse, from "gee-whiz graphs" that add nonexistent drama to trends, to "results" detached from their method and meaning, to statistics' ultimate bugaboo--faulty cause-and-effect reasoning. Huff's tone is tolerant and amused, but no-nonsense. Like a lecturing father, he expects you to learn something useful from the book, and start applying it every day. Never be a sucker again, he cries! Even if you can't find a source of demonstrable bias, allow yourself some degree of skepticism about the results as long as there is a possibility of bias somewhere. There always is. Read How to Lie with Statistics. Whether you encounter statistics at work, at school, or in advertising, you'll remember its simple lessons. Don't be terrorized by numbers, Huff implores. "The fact is that, despite its mathematical base, statistics is as much an art as it is a science." --Therese Littleton