<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0">
<channel>
<title>Statistics</title>
<copyright>Copyright (c) 2013 California Polytechnic State University All rights reserved.</copyright>
<link>http://digitalcommons.calpoly.edu/statsp</link>
<description>Recent documents in Statistics</description>
<language>en-us</language>
<lastBuildDate>Sun, 21 Apr 2013 01:39:55 PDT</lastBuildDate>
<ttl>3600</ttl>


	
		
	

	
		
	

	
		
	







<item>
<title>Is Obesity Socially Contagious?</title>
<link>http://digitalcommons.calpoly.edu/statsp/32</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/32</guid>
<pubDate>Fri, 19 Apr 2013 16:51:14 PDT</pubDate>
<description>
	<![CDATA[
	<p>The main objective of this paper is to analyze three different articles that discuss whether obesity could be socially contagious. According to the World Health Organization in 2013, obesity is the fifth leading risk for deaths around the world. This disease has dramatically increased in the last decade, which has led scientists to believe there are other factors contributing to the epidemic besides genetics. The first article I analyzed, written by Nicholas Christakis and James Fowler, provided a logistic regression model to estimate the odds of a person becoming obese. The model included the explanatory variables: age, sex, education, smoking behavior, geographical distance, social distance, and the BMI of a close friend. Christakis and Fowler found clustering of obesity in the network, and claimed it was caused by influence from one person to another. The second article, written by Ethan Cohen-Cole and Jason Fletcher, included environmental factors into the model and the coefficients for influence were no longer statistically significant. This led Cohen-Cole and Fletcher to conclude the clustering in the network could be partially explained by environmental factors. The last article, written by Cosma Shalizi and Andrew Thomas, claimed that when using observational data, it impossible to distinguish between mechanisms in networks. They provided a counterexample where they simulated a network with no influence and received results where influence was present in the model. This disproved the results made in the Christakis and Fowler article, claiming that influence causes clustering of obese people. Shalizi and Thomas provided the code they used in their paper. When I reproduced the results and changed the parameters, I found an example when the Christakis and Fowler argument may hold. Both of these networks were simulated, therefore more research needs to be done with a real network in order to see if the Christakis and Fowler claim is true or not.</p>

	]]>
</description>

<author>Ciani Jean Sparks</author>


</item>






<item>
<title>Views on Sexual Assault among IFC Fraternities</title>
<link>http://digitalcommons.calpoly.edu/statsp/31</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/31</guid>
<pubDate>Fri, 19 Apr 2013 16:51:11 PDT</pubDate>
<description>
	<![CDATA[
	<p>The data collection and analysis for this project was performed for a consulting client Cierra, a fourth year Sociology major that works at the Safer Office on campus. She went to the consulting center on campus for help with the analysis of her project. She wanted to survey the IFC fraternities at Cal Poly on their views on sexual assault and rape. Thirteen IFC fraternities were surveyed with a total of 488 respondents. The responses to the 30 question True/False survey were used to evaluate the respondent’s empathy towards women, hostility towards women, and sexual aggression. Another research interest was to analyze the responses to the question: “If you could be assured of not being caught, how likely are you to commit rape?” Demographic variables, such as Year, whether the respondent has sisters, and survey version, were used as the independent variables in the statistical models. From the analysis it was discovered that respondent’s that reported had sisters generally had higher empathy towards women than those who reported that they did not have sisters. Also, it was discovered that some of the fraternities had on average higher sexual aggression than others. Analyzing the likelihood to commit rape question it was discovered that some fraternities answered higher on the likert scale more often than other fraternities.</p>

	]]>
</description>

<author>Steven LeGore</author>


</item>






<item>
<title>Journal Acceptance Policies on ETDs</title>
<link>http://digitalcommons.calpoly.edu/statsp/30</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/30</guid>
<pubDate>Fri, 19 Apr 2013 16:50:50 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Chelsea Kern</author>


</item>






<item>
<title>Analysis of Alcohol Use Among Pregnant Women in San Luis Obispo County</title>
<link>http://digitalcommons.calpoly.edu/statsp/29</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/29</guid>
<pubDate>Tue, 18 Dec 2012 11:21:01 PST</pubDate>
<description>
	<![CDATA[
	<p>Drinking alcohol during pregnancy is harmful to the fetus, and can lead to serious alcohol related developmental birth defects. Utilizing prenatal screening, such as the 4P’s Plus© screening tool, during a woman’s first prenatal doctors visit can help educate women and reduce continued alcohol use during pregnancy. Currently the CDC reports that 1 in 13 women in the US drink alcohol while pregnant compared to local reports that 1 in 3 women in San Luis Obispo County continue to drink alcohol during pregnancy. A primary concern for many local county health care experts and organizations is to raise awareness that pregnant women should completely abstain from drinking alcohol at all times during pregnancy. While any substance abuse during pregnancy is alarming, the biggest concern among health officials in San Luis Obispo County is alcohol use during pregnancy due to the high rate of women in this county who continue to drink alcohol while pregnant. The purpose of this analysis was to gain insight into the risk of perinatal drinking for San Luis Obispo women, by investigating possible risk factors associated with drinking alcohol while pregnant.</p>
<p>What led me to this project was my desire to assist the local community by turning health data into usable knowledge, as well as my aspiration to gain statistical consulting experience while working with a large real-life data set.</p>

	]]>
</description>

<author>Samantha Law</author>


</item>






<item>
<title>Comparative Analysis of Dispersion Parameter Estimates in Loglinear Modeling: Applied to E-Commerce Sales and Customer Data</title>
<link>http://digitalcommons.calpoly.edu/statsp/28</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/28</guid>
<pubDate>Tue, 25 Sep 2012 15:00:52 PDT</pubDate>
<description>
	<![CDATA[
	<p>When loglinear models are applied to count data the issue of over-dispersion often arises. Moment and maximum likelihood estimation methods in accounting for over-dispersion are widely used because they allow for model checking tools such as Chi-square, F, and likelihood ratio tests. Here is a comparison between R functions that each uses one method; glm.nb uses MLE, and glm.poisson.disp uses MME. The Index of Dissimilarity and visual model selection (ECDF plots) are also incorporated. These are applied to sales data using product and customer information compiled over the last five years that was generously provided by an e-commerce company.</p>

	]]>
</description>

<author>Scott Davis</author>


</item>






<item>
<title>Predictors of Hypertension and Prehypertension in Cal Poly students</title>
<link>http://digitalcommons.calpoly.edu/statsp/27</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/27</guid>
<pubDate>Thu, 09 Aug 2012 09:51:54 PDT</pubDate>
<description>
	<![CDATA[
	<p>This study analyzed predictors of hypertension and prehypertension in Cal Poly students. Hypertension and prehypertension are known to increase the risk of blood clots, plaque buildup, and tissue/organ damage from blocked arteries. Researching predictors of hypertension and prehypertension can help to determine methods of minimizing the probability of hypertension and prehypertension in a patient. Data from the FLASH study was used to analyze associations between possible predictor variables, such as stress and physical activity, and hypertension/prehypertension. BMI, bodyfat, and the interaction between videogames per weekend day and gender were found to be significantly associated with hypertension and prehypertension in Cal Poly students.</p>

	]]>
</description>

<author>Toria Mock</author>


</item>






<item>
<title>Analysis of Dietary Patterns over Freshman Year of College</title>
<link>http://digitalcommons.calpoly.edu/statsp/26</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/26</guid>
<pubDate>Wed, 08 Aug 2012 10:19:52 PDT</pubDate>
<description>
	<![CDATA[
	<p>This analysis is an investigation of changes in Cal Poly students’ eating habits over freshman year. The motivation behind this was an interest in college students’ lifestyles; college is the first time most students live on their own and it can be an important maturation period. College is stressful, exciting, liberating, and terrifying all at the same time. This distinctive life experience, along with my desire to handle big and messy data, led me to this research question.</p>
<p>The response variable analyzed was food consumption and the explanatory variables were: sex, race, quarter, food group, stress, exercise, BMI, sleep quality and quantity. These variables were chosen based on interest in how they could relate to the change in dietary patterns over the first year of college.</p>
<p>After investigating multiple methods, a split-split plot design was used to determine a significant difference in food consumption between fall 2009 and spring 2010. This was done using PROC MIXED in SAS 9.2 ®. The results provided evidence that dairy consumption is significantly different for males compared to females in fall 2009. There was also evidence that dairy and fruit consumption significantly differ for males compared to females in spring 2010. However, this is still a work in progress and many issues were encountered during the analysis.</p>

	]]>
</description>

<author>Chelsea Lofland</author>


</item>






<item>
<title>The River Crossing Game</title>
<link>http://digitalcommons.calpoly.edu/statsp/25</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/25</guid>
<pubDate>Thu, 21 Jun 2012 16:24:54 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Tyler Bramhall</author>


</item>






<item>
<title>An Analysis of Change of Major Behavior of Cal Poly Students</title>
<link>http://digitalcommons.calpoly.edu/statsp/24</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/24</guid>
<pubDate>Thu, 21 Jun 2012 16:24:50 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Logan Lossing</author>


</item>






<item>
<title>Lifestyle Choices in Relation to BMI and Blood Pressure</title>
<link>http://digitalcommons.calpoly.edu/statsp/23</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/23</guid>
<pubDate>Thu, 21 Jun 2012 16:24:45 PDT</pubDate>
<description>
	<![CDATA[
	<p>Cal Poly currently has one of the largest ongoing university health studies in the United States. Launched in Fall 2009, the Cal Poly FLASH study, led by the Kinesiology department and STRIDE, is a longitudinal study that tracks the classes of 2013 and 2014 through online surveys and physical assessments. The data collected covers various areas such as perceived health, lifestyle choices, and actual physical health.</p>
<p>My project analyzed the FLASH data to investigate the relationship between various perceived variables and actual health measures for Cal Poly freshmen. The motivation for this analysis was an interest in both diet and exercise and its impact on an individual’s overall health. In particular, my interest lies in what a person perceives as their diet and exercise regimen and how that relates to overall health. To assess overall health, I examined both the Body Mass Index (BMI) and blood pressure of students. BMI was computed using the standard formula involving height and weight. Blood pressure was classified by using both systolic and diastolic blood pressure.</p>
<p>Conventional wisdom states that proper diet and exercise leads to better overall health. I was interested in the following research question: “Can we simultaneously model college students’ BMI and blood pressure using various lifestyle variables?” The response variables chosen were BMI and blood pressure and the explanatory variables examined consisted of various lifestyle variables such as diet preference, activity level, marijuana use, cigarette use, and alcohol use. Before I simultaneously modeled BMI and blood pressure, I created several models that had univariate responses.</p>
<p>My goal was to simultaneously model college students’ BMI and blood pressure using different lifestyle variables. Using the FLASH data containing the first time physical assessment with its survey from that corresponding quarter, I was able to investigate this question.</p>
<p>In my investigation, I used Discrete Multivariate Analysis to compute two separate generalized logit functions for each response, BMI and blood pressure, and Cluster Analysis to group lifestyle variables by their similarities to each other. By using Discrete Multivariate Analysis, I was able to take into account the relationship that existed between BMI and blood pressure. In each model, sex was a significant explanatory variable. Cluster Analysis illustrated that while certain variables can be grouped together, many lifestyle variables are different from each other.</p>
<p>While an incredibly useful method, the sample size limitations that exist make it difficult to create models with multiple explanatory variables. For future analysis, it would be interesting to see the association of the overall health measures, BMI and blood pressure, with other lifestyle variables. Additionally, with further data cleaning, it might be interesting to add more lifestyle variables into the cluster analysis to see if more clusters form.</p>

	]]>
</description>

<author>Shawna Perry</author>


</item>






<item>
<title>Using the R Library RPanel for GUI-Based Simulations in Introductory Statistics Courses</title>
<link>http://digitalcommons.calpoly.edu/statsp/22</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/22</guid>
<pubDate>Thu, 21 Jun 2012 16:24:18 PDT</pubDate>
<description>
	<![CDATA[
	<p>As a student, I noticed that the statistical package R (<a href="http://www.r-project.org/">http://www.r-project.org</a>) would have several benefits of its usage in the classroom. One benefit to the package is its free and open-source nature. This would be a great benefit for instructors and students alike since it would be of no cost to use, unlike other statistical packages. Due to this, students could continue using the program after their statistical courses and into their professional careers. It would be good to expose students while they are in school to a tool that professionals use in industry. R also has powerful graphical abilities that would allow students to visualize their data and the effects of simulating no association on their data. Utilizing R would also allow students to read in their own data in order to further explore analyses and see more examples rather than just built in datasets as with other applets. Finally, simulation-based instruction that R is capable of does not necessarily need a formal test-statistic unlike other methods of instruction. This would allow students to use more intuition in learning statistical concepts.</p>
<p>Unfortunately, there are several challenges to using R in the classroom. Primarily, the user interface is designed around coding. This abstracts the important statistical concepts that students are trying to learn. Many times, students try to learn the code rather than learn the relevant statistics and end up not learning from the technology. This was the key motivation for my project. The idea was that I would build a graphical user interface (GUI) on top of R to allow students to explore statistical concepts and be exposed to R.</p>
<p>As such, I created five applets to allow students to explore simulation and distribution based tests found in introductory statistics courses:  <ol> <li>2x2 Table</li> <li>2x3 Table</li> <li>rxk Table</li> <li>Regression</li> <li>One-Way Analysis of Variance</li> </ol></p>
<p>These applets have the benefit over other similar approaches in that they allow students to see the animation of the effect of each repetition on simulation of no association, students do not need to have any knowledge of code or programming to utilize the applets, and most importantly, students and instructors have the ability to use their own data sets.</p>
<p>After utilizing several of these applets in introductory statistics courses, students responded very positively. Specifically, a survey was distributed to students following use of the ANOVA applet on a lab and on a homework assignment. The survey indicated that the project was a resounding success. Further, all students recommended use of the applets in future statistics courses. Specifically, students felt that it was helpful in furthering their knowledge of ANOVA. Students felt that it was particularly helpful because of the step-by-step animation showed students the simulation as it was happening. It allowed students to understand what was occurring in a much more visual fashion. Some students even preferred the applets over other packages like Minitab that students were more familiar with due to the straightforward and visual nature. Finally, students also recommended that statistics professors use the applet in future courses, as well.</p>

	]]>
</description>

<author>Ryan M. Allison</author>


</item>






<item>
<title>Adaptive Randomization Designs</title>
<link>http://digitalcommons.calpoly.edu/statsp/21</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/21</guid>
<pubDate>Thu, 21 Jun 2012 16:24:12 PDT</pubDate>
<description>
	<![CDATA[
	<p>Adaptive design methodologies use prior information to develop a clinical trial design. The goal of an adaptive design is to maintain the integrity and validity of the study while giving the researcher flexibility in identifying the optimal treatment. An example of an adaptive design can be seen in a basic pharmaceutical trial. There are three phases of the overall trial to compare treatments and experimenters use the information from the previous phase to make changes to the subsequent phase before it begins.</p>
<p>Adaptive design methods have been in practice since the 1970s, but have become increasingly complex ever since. One type of adaptive design is adaptive randomization. This is where the researcher makes changes in the way patients are randomized to treatment groups based on information gathered so far in the trial. Adaptations can be made to either trial procedures (eligibility criteria, study dose, treatment duration, study endpoints, laboratory testing procedures, diagnostic procedures, criteria for evaluation, and assessment of clinical responses) or statistical procedures (randomization, study design, study objectives/hypotheses, sample size, data monitoring and interim analysis, statistical analysis plan, and methods for data analysis).</p>
<p>The biased coin design, adaptive biased coin design and covariate adaptive randomization are specific types of adaptive randomization designs which are aimed at balancing certain aspects of a clinical trial. The biased coin design and the adaptive biased coin design aim to balance treatment group sample sizes by assigning the next patient to the group with the smaller sample size with higher probability. The biased coin design uses a fixed probability to achieve this whereas the adaptive biased coin design determines the severity of the imbalance between treatment groups using the total sample size to scale the difference in treatment group sample sizes. The probability of assignment to a specific group in the adaptive biased coin design thus depends on the ratio of difference in sample sizes between treatment groups to total sample size. Covariate randomization designs aim to balance the covariates across the treatment groups by assigning the next patient to the group that causes the smallest maximum imbalance across the covariate groups.</p>
<p>SAS® Macros to perform the adaptive biased coin design and covariate adaptive randomization are available upon request.</p>

	]]>
</description>

<author>Jenna Colavincenzo</author>


</item>






<item>
<title>Analyzing Multiple Independent Spatial Point Processes</title>
<link>http://digitalcommons.calpoly.edu/statsp/20</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/20</guid>
<pubDate>Mon, 04 Jun 2012 17:08:07 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Neal Grantham</author>


</item>






<item>
<title>MANOVA: Type I Error Rate Analysis</title>
<link>http://digitalcommons.calpoly.edu/statsp/19</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/19</guid>
<pubDate>Tue, 10 Jan 2012 17:22:41 PST</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Christopher Dau Wei Ling</author>


</item>






<item>
<title>An Analysis of Breast Cancer Metastasis</title>
<link>http://digitalcommons.calpoly.edu/statsp/18</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/18</guid>
<pubDate>Tue, 06 Dec 2011 15:13:10 PST</pubDate>
<description>
	<![CDATA[
	<p>The main objective of this paper is to evaluate possible socio-economic status, clinical, and treatment associations with the occurrence of distant metastasis in Stage I – III breast cancer patients.  After analysis in a logistic regression model, four variables were found to be significant with occurrence of distant metastases.  These variables were: education, disease group (Triple-negative, Her2Neu-positive and Luminal A), stage at diagnosis, and concordance to chemotherapy based on the NCCN guidelines.  Patients without a college degree were found to be more likely to develop distant metastasis than those with a college degree (OR = 2.46 95% CI 1.44 – 4.23).  Triple-negative and Her2Neu-positive patients had higher odds of having distant metastasis than those in with luminal A disease (OR = 3.88 and 3.22 95% CI 2.25 – 6.69 and 1.88 – 5.52, respectively).  Stage III patients also had higher odds of having distant metastasis than those with Stage I disease (OR = 5.41 95% CI 2.74 – 10.65).  Finally, an unusual result was discovered where patients who were not classified to a chemotherapy guideline were significantly less likely to have distant metastasis than their counterparts who received the recommended chemotherapy (OR = .32 95% CI 0.17 - 0.58).</p>

	]]>
</description>

<author>Jennifer Lee Gildner</author>


</item>






<item>
<title>An Exploration of Non-Detects in Environmental Data</title>
<link>http://digitalcommons.calpoly.edu/statsp/17</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/17</guid>
<pubDate>Wed, 28 Sep 2011 09:43:01 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Juliana Fajardo</author>


</item>






<item>
<title>Dietary Patterns in Relation to Sleep and Stress in Cal Poly Freshman</title>
<link>http://digitalcommons.calpoly.edu/statsp/16</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/16</guid>
<pubDate>Mon, 20 Jun 2011 15:09:23 PDT</pubDate>
<description>
	<![CDATA[
	<p>The Cal Poly FLASH study is a research project that was developed to assess overall health of college students. Beginning in Fall 2009, data have been collected longitudinally via online surveys and physical assessments on Cal Poly freshmen.  Responses from 1520 students from Fall 2009 were used to investigate whether stress and sleeping habits are related to dietary patterns among Cal Poly students.</p>
<p>Factor analysis was used to categorize 33 food frequency variables into two categories – junk food and healthy food. Then, stepwise selection in a general linear model was conducted to identify lifestyle and demographic variables associated with these two food categories.</p>
<p>Statistically significant predictors of junk food consumption were sex, stress, average hours of sleep on the weekend, the interaction between sex and average hours of sleep on the weekend, and eating preference (vegetarian vs. non-vegetarian). When predicting the consumption of healthy food, the variables sex, days of moderate physical activity, days of vigorous physical activity, eating preference, and whether one perceives oneself as an early bird or a night owl were statistically significant. The study results show that how much junk food and healthy food a Cal Poly freshman student consumes per month is related to stress, sleep, and exercise, as well as the student’s sex and eating preference.</p>

	]]>
</description>

<author>Emily J. Conklin et al.</author>


</item>






<item>
<title>Teaching Introductory Statistics</title>
<link>http://digitalcommons.calpoly.edu/statsp/15</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/15</guid>
<pubDate>Mon, 20 Jun 2011 15:09:21 PDT</pubDate>
<description>
	<![CDATA[
	<p>Exploration and research on how to be an effective teacher of a introductory statistics course while in graduate school. Collected ideas and suggestions through reading articles, taking courses in statistics education, and administered student surveys to introductory statistics students. With all of the collected ideas and suggestions, sumarized what others believe is the proper way to teach and how I think a class should be taught.</p>

	]]>
</description>

<author>Alex Herrington</author>


</item>






<item>
<title>Using Survival Analysis Methods to Study Santa Barbara County Divorces</title>
<link>http://digitalcommons.calpoly.edu/statsp/14</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/14</guid>
<pubDate>Tue, 14 Jun 2011 16:16:03 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Joel Vazquez</author>


</item>






<item>
<title>Music and Radio Preferences on the Cal Poly Campus</title>
<link>http://digitalcommons.calpoly.edu/statsp/13</link>
<guid isPermaLink="true">http://digitalcommons.calpoly.edu/statsp/13</guid>
<pubDate>Tue, 14 Jun 2011 09:32:34 PDT</pubDate>
<description>
	<![CDATA[
	
	]]>
</description>

<author>Rory Bloch</author>


</item>





</channel>
</rss>
