Explore BrainMass

Data Collection

Data collection is fundamental to the study of statistics because it is an integral step in the process of carrying out a research study. There are different types of methods which can be implemented to go about collecting data. In statistical studies some of the most common techniques for data collection include population censuses, sample surveys and conducting experiments.

Before a researcher can begin collecting data, a planned research design with a focused study objective needs to be drawn out. Data collection has an influence on many factors in a statistical study such as on the study design, the survey methodology and later on, the conclusions which can be formulated. In order for a study to formulate valuable conclusions, the method of data collection needs to be organized in a way which meets the needs of the study, most importantly the question(s) being asked and units being collected.

For example, pretend you are a bird researcher and you are studying red-winged blackbird populations in all of the marshes located near busy, urban roads in your local city. Although you plan on collecting current data, you are also fortunate enough to have past collected data on red-wing blackbird populations residing in the marshes you are studying, from 20 years ago. The purpose of this current study is to analyze whether an increase in urban activity over 20 years has caused a change in population size.

From the above scenario, it is clear that the units being measured are red-wing black birds, which are being counted, and the question of interest is whether urban activity influences population size. Now when deciding how to design the experimental study, considering how exactly to go about collecting the data is critical. Some things to consider may include: how large of a sample size should be collected at each marsh or what time of day to collect the data. Clearly this example illustrates how the process of data collection is an important step in designing a successful statistical study.


© BrainMass Inc. brainmass.com November 18, 2018, 11:54 am ad1c9bdddf

Categories within Data Collection

Dave's Bottling Company

Dave's Bottling Company has a filling machine that can be set to 8 ounces, twelve ounces or 24 ounces. The standard deviation of the fill volumes is supposed to be no more than .05 ounces. The production manager, Charlene Tibbs, is afraid the machine needs servicing and wants to test the production. The machine was set for 12 ou

Linear Regression Analysis of SAT & PSAT scores

Listed below (and in the available Excel Data Set file) are the PSAT and SAT scores from prospective college applicants. The scores were reported by subjects who responded to a request posted by the web site talk.collegconfidential.com. PSAT 183 207 167 206 197 142 193 176 SAT 2200 2040 1890 2380 2290 2070 2370 1980 a) W

adding a survey questionnaire to this project

For more than a century, the newspaper industry has been a dominant source of news in the U.S. (Stephens, 2010). Newspapers have experienced substantial declines in circulation, advertising sales, profitability, and readership (Tucker, 2009). Some print newspaper organizations are transitioning to online news distribution models

Anova - hand calculations in Excel

Chpt 12: Inferences Pairs of Treatment Means Assignment The following are six observations collected from treatment 1, ten observations collected from treatment 2, and eight observations collected from treatment 3. Test the hypothesis that the treatment means are equal at the .05 significance level. Treatment 1 Treatment 2

Simple regression and hypothesis testing for correlation

1.The production department of Celltronics International wants to explore the relationship between the number of employees who assemble a subassembly and the number produced. As an experiment, 3 employees were assigned to assemble the subassemblies. They produced 8 during a one-hour period. Then 5 employees assembled them. They

Solve a questions to two independent samples t-test using excel

Two Population Means A tomato farmer with a very large farm of approximately 2200 acres had heard about a new type of rather expensive fertilizer which would supposedly significantly increase his production. The frugal farmer wanted to test the new fertilizer before committing the large investment required to fertilize a farm

Solve question on scatter plot, relationship, outlier, causality

Hello, Please I want to answer these problems. Please help me thinks. Problem 1) Look at the scatter plot below. Does it demonstrate a positive or negative correlation? Why? Are there any outliers? What are they? Problem 2) Look at the scatter plot below. Does it demonstrate a pos

Hypergeometric, binomial and uniform distribution

1.The Internal Revenue Service is studying the category of charitable contributions. A sample of 20 returns is selected from young couples between the ages of 20 and 35 who had an adjusted gross income of more than $100,000. Of these 20 returns, 6 had charitable contributions of more than $1,000. Suppose 4 of these returns are s

Data Collection Variables

1) Identify the following variables as either qualitative or quantitative and EXPLAIN your answers. a. The number of people on a jury. b. The color of your house 2) Identify the number as either continuous or discrete and EXPLAIN your answers. a. The average height of all freshmen entering college in a certain year is

Proposal paper

Write a proposal paper (1,250-1,500 words) for a major change project that you would like to lead. Identify a health care issue that interests you and explain why. Develop a rationale using evidence-based research, including: 1) The background. 2) Statistical findings. 3) Probable stakeholders. 4) Logical conclusions. Des

Statistical Analysis - Roddy Analysis Worksheet

Roddy Analysis Worksheet Using the worksheet below, complete an assessment of the following article. Roddy, E., Zhang, W., Doherty, M., Arden, N. K., Barlow, J., Birrell, F., et al. (2006). Evidence-based clinical guidelines: A new system to better determine true strength of recommendation. Journal of Evaluation in Clinica

Descriptive and inferential

Conduct descriptive and inferential analyses using your selected data set, your prepared database, and SPSS. Summarize numerical results in tables. • Conduct selected descriptive statistics analyses using SPSS. • Summarize the numerical results with descriptive analysis tables or graphs, including your interpretation. •

New health care program in the United States

Details: For the following assignment, use the Rank Correlation Utilizing Excel Use a rank correlation coefficient to test for a correlation between two variables. Use a significance level of α=0.05. x (cigarettes per day) 60 10 4 15 10 1 20 8 7 10 10 20 y(cotinine) 179 283 75.6 174 209 9.51 350 1.85 43.4 25.1 408 344

IQ report

Home work 1. Effects of Heredity and Environment on IQ. In studying the effects of heredity and environment on intelligence, it has been helpful to analyze the IQs of identical twins who were separated soon after birth. Identical twins share identical genes. By studying identical twins raised apart, we can eliminate the vari

Grouped frequency distribution using Excel

1. Open a blank Excel file and create a grouped frequency distribution of the maximum daily temperatures for the 50 states for a 30 day period. Use 8 classes. 2. Add midpoint, relative frequency, and cumulative frequency columns to your frequency distribution. 3. Create a frequency histogram using Excel. You will probabl


Explain how fonts or formats might incorrectly be used to represent information, and give an alternative way of representing that same information that would be suitable for analysis by IBM SPSS.

Journal Article Review (Statistics)

Answer the question according to the attached article (Branding smartphone industry.pdf) Overview: 1) What is the main hypothesis / question posed by this study ? 2) Why is it an important question to ask/investigate ? 3) Describe the study design (e.g. cross-sectional, longitudinal, etc.) Statistical Anal

Statistics - Data Classification (Numerical Vs. Categorical)

1) The following information is collected from students upon exiting the campus bookstore during the first week of classes. a. Amount of time spent shopping in the bookstore b. Number of textbooks purchased c. Academic major d. Gender Classify each of these variables as categorical or numerical. If the variable is numerical

Basic Concepts in Probability Distributions

QUESTION 1 A continuous uniform distribution U(100,200) will have the same standard deviation as a continuous uniform distribution U(200,300). True False QUESTION 2 There is a simple formula for normal areas, but we prefer a table for greater accuracy. True False QUESTION 3 The area under a normal curve is 1 onl

Descriptive, Graphical and T-Test Using SPSS

You will create this Word document by cutting and pasting SPSS output into Word. 1. Why do we care whether the assumptions required for statistical tests are met? (You might also want to write your answer on a note card you paste to your computer.) 2. Open the data set that you corrected in a previous assignment for Downlo

SPSS and Exploratory Data Analysis

Write a short paragraph that highlights your understanding of why exploratory data analysis is a critical part of any analytical strategy (300-500 words).

Selecting the Appropriate T-test

For this assignment, use data from W1 Assignment 2. Compute a t-test comparing males' and females' heights. You must determine which type of t-test to compute. Move your output into a Microsoft Word document and write a one-paragraph, APA-formatted interpretation of the results.

Business Statistics: Chi-square Test

Use the Student_Data.xls file which consists of 200 MBA students at Whatsamattu U. The file includes variables regarding students' age, gender, major, GPA, Bachelors GPA, course load, English speaking status, family, weekly hours spent studying. Each of the three assigned problems should be formatted as a one page memo. Answers

Normal Probabilities and Confidence Interval

Use the Student_Data.xls file which consists of 200 MBA students at Whatsamattu U. The file includes variables regarding students' age, gender, major, GPA, Bachelors GPA, course load, English speaking status, family, weekly hours spent studying. Each of the three assigned problems should be formatted as a one page memo. Answers

Calculating Mean, Absolute Value, and Sum of 5 Deviation Scores

Given the following set of numbers: 85, 100, 90, 95, 80; 1). Show each step on how to calculate the arithmetic mean for the five numbers 2). Show each step on how to obtain the absolute value of each of the five deviation scores and show me how to sum them. 3) Divide the sum of the deviation scores by the number of me

Normal Probabilities - Solving Problems Using Z Score

If a drive to work each morning is µ = 38 mins with a σ = 5 mins, on a randomly selected morning what is the probability that the drive will take between 36 and 40 mins? IQ scores form a normal distribution with µ = 100 and σ = 15. Individuals with IQs above 140 are classified in the genius category. What proportion of th

Normal Probabilities - Using Z Score

A normal distribution has a mean of u= 40 and o=10. if a vertical line is drawn through the distribution at x= 55, what proportion of the scores on the right side of the line? A normal distribution has a u= 80 and o= 10. what is the probability of randomly selecting a score greater than 90 from this distribution? A normal