Log In Start studying!

Select your language

Suggested languages for you:
StudySmarter - The all-in-one study app.
4.8 • +11k Ratings
More than 3 Million Downloads
Free
|
|
Data Handling and Analysis

Every time you bake a cake, you probably break down the process into different steps. You may pick a recipe, go to the shops, arrange the ingredients, and follow the recipe. Data handling is somewhat similar. Researchers follow a logical order to get the best quality end product possible.Data handling is vital in any psychological study and directly impacts the…

Content verified by subject matter experts
Free StudySmarter App with over 20 million students
Mockup Schule

Explore our app and discover over 50 million learning materials for free.

Data Handling and Analysis

Want to get better grades?

Nope, I’m not ready yet

Get free, full access to:

  • Flashcards
  • Notes
  • Explanations
  • Study Planner
  • Textbook solutions
Data Handling and Analysis

Save the explanation now and read when you’ve got time to spare.

Save
Illustration

Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken

Jetzt kostenlos anmelden

Nie wieder prokastinieren mit unseren Lernerinnerungen.

Jetzt kostenlos anmelden
Illustration

Every time you bake a cake, you probably break down the process into different steps. You may pick a recipe, go to the shops, arrange the ingredients, and follow the recipe. Data handling is somewhat similar. Researchers follow a logical order to get the best quality end product possible.

Data handling is vital in any psychological study and directly impacts the quality of the results obtained. Let's take a closer look at data handling and analysis.

  • The explanation will start by exploring data analysis in psychology, covering its uses and importance in research.
  • Next, we will delve into the steps of data handling.
  • Then we will review data handling and quantitative analysis together.
  • Then, we will explore how data handling and analysis in research is carried out.
  • And lastly, we will explore a data handling and analysis example in a research setting.

Data Analysis Psychology

Once data has been collected, there are several things that the researchers need to do, and one of these is data handling.

Data handling is the process of organising and analysing raw data using a logically valid and reliable process to establish if the findings from the study support or reject the hypothesis stated at the start of the experiment.

The steps that researchers take during data analysis are important because they can affect the validity and reliability of the findings. For instance, if the researcher found something unexpected and chose to ignore the variables they were initially interested in, the study will no longer be investigating what it intends to. Thus, lowering the validity of the study.

Moreover, if researchers don't use standardised procedures to analyse findings, this can lower the reliability of the study.

Steps of Data Handling

Now that we have learned the importance of data handling, let's uncover what steps data handling includes. There are six steps in data handling, which are:

Steps
Description
Collection
The first step of data handling is to collect raw data from participants or whichever source the information is being collected from.
Preparation
The second step involves the preparation of the raw data for further analysis; this may involve coding data. E.g. male participants may be ascribed 0, and female participants may be attributed 1.
Input
The third involves inputting and storing the data. The findings should be stored securely to maintain participant confidentiality.
Processing
The fourth step involves conducting computational or statistical tests relevant to the research. E.g. if trying to understand the relationship between two variables, a correlational analysis should be run.
Output
The next step is verifying if the findings support or disprove the hypothesis.
Storage
The last step is to save the data securely.

Before a researcher analyses their research or even handles it, the researcher should have a clear and carefully plan on the direction of their research. For instance, before data is collected, the researcher should know the source of where they will collect the information.

Researchers can use primary or secondary data in their research.

Primary research refers to data the researcher collects themself, and secondary data uses data collected from others, e.g. previously published findings or statistics from government sites and databases.

Data analysis is a step that follows after a researcher has handled data.

Data analysis is the process in which graphical and quantitative or statistical techniques are applied to raw data to identify general patterns.

In research methods, two types of data are collected: qualitative and quantitative. Qualitative data is processed slightly differently from quantitative data. Let's take a look at data handling in quantitative analysis.

Quantitative Analysis and Data Handling

Quantitative data is data that is expressed in numerical form. Quantitative analysis usually involves using a mathematical approach and statistics to identify whether the findings support or disprove hypotheses.

Statistical analysis includes various mathematical calculations using probability models to make inferences from a given data set and draw conclusions about broader populations.

When carrying out statistical analyses, there are essential data handling steps that the researcher needs to consider. The reason is that statistical tests have certain conditions.

One of these conditions is identifying the level of measurement. There are four levels of measurement which essentially distinguish the different characteristics of variables. It is essential to understand the characteristics of variables because these will hint at which statistical analyses could be done and which could not.

Level of Measurement
Definition
Variable Example
Nominal Distinguishes differences, but there is no order to them, and we can't measure how much each quantitatively differs.Participants eye colour
Ordinal Distinguishes differences and identifies that the values have a rank order, but the difference can't be quantitively measured.Response on a Likert scale
Interval There is an order, and the differences between figures are measurable. Interval and ratio data can be of infinite value, but unlike ratio, interval data can go below 0. Temperature
RatioThe ratio is the same as the interval with the difference that there is an absolute 0, meaning the values of the variable cannot go below 0. Height

Data analysis in psychology

As you learned, data analysis is the process in which statistical techniques are applied to find patterns within a sample. The data analysis in a study usually follows two steps. First, descriptive statistics are inspected, and then statistical analyses are performed.

As the name suggests, descriptive statistics describe the data's characteristics, and the two main types of descriptive statistical tests used are the measures of central tendency and measures of dispersion.

The measures of central tendency tests are used to calculate averages, and the three main types of tests are:

  • Mean: the average score of the data set. It is calculated by adding all scores up and dividing them by the number of scores.
  • Median: the central score in a given data set. It is calculated by putting all scores in order and picking the one in the middle.
  • Mode: the most common score. It is calculated by looking at the scores and identifying which number has the highest frequency.

The measures of dispersion tests are used to measure the spread/ variance of the data. The common computations calculated are:

  • Range: this is the difference between the lowest and the highest score.
  • Standard deviation: the average distance of each score from the mean.

Data handling and analysis in research

Generally, descriptive statistics involve presenting the data. This presentation of data is usually done using graphs. The following are types of graphs:

Tables are used to show contrasts between a few sets of data. For instance, the table above shows the difference between control and drug conditions according to mean and standard deviation measurements.

Control
Drug condition
Mean
119
86
Standard deviation
23
98

Bar charts show the results of different conditions (or variables) using bars of different heights. Bar charts illustrate the differences between groups and make identifying trends and patterns easier.

Data handling and analysis. Bar chart. VaiaFig 1. Bar chart example

Scattergrams are also incorporated in correlational research. From scatterplots, researchers can identify the direction and strength of the relationship between two variables.

Data handling and analysis. Scattergram. VaiaFig 2. Scatterplot example.

Before inferential tests are conducted, researchers usually run descriptive analyses. Following this, the distribution of the data is analysed.

Distribution measures the spread of data from the average; it is a form of probability statistic that makes estimations concerning a sample.

The type of distribution found will affect what statistical analyses can do later.

When data is normally distributed, parametric tests are employed, and when data is not normally distributed, non-parametric tests are employed.

There are two types of distribution:

  • Normal distribution forms a bell-shaped curve, as most data points are clustered towards the central values, while fewer data points are at the extremes.

Data-handling and analysis. Normal distribution depicted in a graph. Vaia.

Fig. 3. Depiction of normal distribution.
  • Skewed distributions, the data set can be positive (right) or negative (left).

Data Handling and Analysis. Skewed Distribution Depiction. Vaia.

Fig 4. Depiction of skewed distributions.

Statistical Analysis

The statistical analyses employed in psychology research use inferential statistics to identify if the data supports or negates their hypothesis.

Sometimes we may find differences, but these may not be significant. The accepted level of probability in psychology is 0.05 (5%). Now, what exactly does this mean in research? Significance levels below this threshold suggest the observed change in the dependent variable is likely due to the manipulation of the independent variable. However, if it is above the threshold, the observed changes are likely due to chance.

The 5% considers extraneous variables that may have influenced the dependent variable.

The findings should not be accepted when research establishes differences, but the results are non-significant.

Data Handling and Analysis Example

Let's look into a real example of how data is handled and analysed in research.

In a study looking into the relationship between revision time and exam performance, researchers would first consider they will gather their data.

Revision time can be collected by asking participants to record how long they spent revising, and exam performance can be measured by looking at pupils' test scores on their report cards.

Initially, the researchers would carry out analyses concerning descriptions, e.g. the mean scores for revision time and exam performance. The findings may be reported as students in the current sample reported a mean revision time of around 6h (M = 5.78) and an average score of 78 points out of 100 in the exam (M = 78).

The two variables' distribution will be explored through a histogram. Depending on the visual inspection of the distribution, parametric or non-parametric tests would be performed. Let's imagine that the distribution of the two variables was normally distributed.

Researchers would then move on to data analysis, i.e. inferential statistics. To test the hypothesis, the researchers may run a correlational analysis. The Pearson correlation may be interpreted as the analysis shows a positive correlation between revision time and exam performance, r (20) = .78, p = .05.

Further to the written interpretation, researchers would include a scatterplot visually resembling the same interpretation.

Data Handling and Analysis - Key takeaways

  • Psychologists use data handling and analysis to interpret the data they collect from their studies.

  • There are different data types: qualitative, quantitative, primary, and secondary.

  • Descriptive statistics are graphs, tables, and summaries used to identify trends and analyse research data.

  • Levels of measurement include ordinal, ratio, interval, and nominal.

  • Thematic analysis is used to analyse qualitative data, and inferential testing is used to analyse quantitative data.

Frequently Asked Questions about Data Handling and Analysis

Psychologists use data handling and analysis to interpret data collected from research. 

Data analysis is concluding data that has been gathered and presented using graphs and charts. Before data can be analysed, it needs to be handled, e.g., it needs to be stored and organised in a way that makes analysis easier. 

To identify if the findings from the study support or reject the hypothesis proposed. 

There are many types of data analysis, including measures of central tendency, graphs, inferential testing, (non-) parametric tests, probability and significance, thematic analysis, and more.

An example of data analysis is thematic analysis; this involves analysing qualitative data by identifying common themes throughout the text. 

Final Data Handling and Analysis Quiz

Data Handling and Analysis Quiz - Teste dein Wissen

Question

Define data handling and analysis. 


Show answer

Answer

Data handling and analysis are used by psychologists to interpret the data they collect from their studies.

Show question

Question

What are the steps in data handling?

Show answer

Answer

  1. Collection.
  2. Preparation.
  3. Input.
  4. Processing.
  5. Output.
  6. Storage.

Show question

Question

What is quantitative data?


Show answer

Answer

Numerical data.

Show question

Question

What is qualitative data? 


Show answer

Answer

Data that uses words rather than numbers.

Show question

Question

Give an example of qualitative data.


Show answer

Answer

 A diary entry.

Show question

Question

Which of these is not a measure of central tendency?


Show answer

Answer

Medium.

Show question

Question

What do measures of central tendency aim to find?

Show answer

Answer

Averages.

Show question

Question

Name two types of graphs.

Show answer

Answer

Bar chart and scattergram.

Show question

Question

Name three distribution types.

Show answer

Answer

Normal, positive skew, negative skew.

Show question

Question

In which direction does a positive skew go?

Show answer

Answer

To the right.

Show question

Question

In which direction does a negative skew go?


Show answer

Answer

To the left.

Show question

Question

Analysis that aims to find common themes is known as _____ analysis.


Show answer

Answer

Thematic.

Show question

Question

Give an example of a case study used in psychology.


Show answer

Answer

Phineas Gage.

Show question

Question

What is inferential statistics?


Show answer

Answer

Inferential statistics is data that allows us to make predictions or inferences.

Show question

Question

What is the difference between parametric and non-parametric tests?


Show answer

Answer

Parametric tests assume knowledge of the population, while non-parametric tests do not.

Show question

Question

What are two examples of statistical tests?


Show answer

Answer

Sign test and critical value.

Show question

Question

What is the accepted level of probability in psychology? 


Show answer

Answer

0.05 or 5%.

Show question

Question

What are inferential tests?

Show answer

Answer

Inferential tests are tests such as hypothesis testing that help understand if data collected can be used to make predictions/inferences concerning generalisability to the population.

Show question

Question

Give examples of experimental and sampling errors that may influence inferential tests.

Show answer

Answer

Small sample size, confounding variables that affect the dependent variable, inaccurate or lack of precision when conducting research.

Show question

Question

How are alpha scores used as an inferential measure of analysis?

Show answer

Answer

If the alpha level is analysed to be lower than 0.5, then the alternative hypothesis can be accepted. This indicates that the results are unlikely due to chance or a Type 1 error and can be generalised to the population.

Show question

Question

How are p scores used as an inferential measure of analysis?

Show answer

Answer

If an appropriate p-value is indicated, then the null hypothesis can be rejected, and the data indicates suitability to be generalised to the general population.

Show question

Question

How are confidence intervals used as an inferential measure of analysis?

Show answer

Answer

Confidence intervals can guide how much the sample deviates from the population. If the data vastly differs, it is unlikely that it can be generalised to the population.

Show question

Question

What does an 83% confidence interval indicate?

Show answer

Answer

An 83% confidence interval indicates that researchers can be 83% confident that the sample consists of the mean population. If the sampling method were repeated multiple times, 83% of the intervals analysed would represent the population mean.

Show question

Question

Give an example of an alternative hypothesis.

Show answer

Answer

There will be a significant difference between patients who received drug therapy treatment and those randomly assigned to the placebo group.

Show question

Question

Give an example of a null hypothesis.

Show answer

Answer

There will be no observed difference between the day of an exam and time spent studying.

Show question

Question

Why do researchers need to form a null hypothesis when carrying out the hypothesis test inferential analysis?

Show answer

Answer

To identify if there is a relationship between the variables. If the null hypothesis is accepted, then results are likely due to chance.

Show question

Question

After carrying out hypothesis testing, a significance level of .07 was indicated. Should the researchers accept or reject the null hypothesis?

Show answer

Answer

The researchers should accept the null hypothesis and reject the alternative hypothesis. 

Show question

Question

Define sample errors.

Show answer

Answer

Sampling errors are the expected difference between the sample and the general population, as obtaining a truly representative sample is challenging.

Show question

Question

Why does hypothesis testing take into account sampling errors?

Show answer

Answer

To inhibit errors of accepting or rejecting the hypothesis. i.e., decrease the likelihood of type 1 and type 2 errors occurring.

Show question

Question

What are descriptive statistics?

Show answer

Answer

Descriptive statistics are a form of statistical analysis that is utilised to provide a summary of a dataset. These can be summaries of samples, variables or results.


Show question

Question

What are the benefits of measuring descriptive statistics?

Show answer

Answer

These can be beneficial as they provide researchers with information about potential relationships between variables and statistical tests that could be appropriate to test the hypotheses proposed.


Show question

Question

Where can you find data concerning the N of males and females in a sample?


Show answer

Answer

Frequency table

Show question

Question

What statistical information do tests measuring central tendency tell us?


Show answer

Answer

They give a single value that summarises an average representing the entire dataset.

Show question

Question

Here is an example dataset, calculate the mean, median and mode: 2, 7, 5, 3, 9, 12, 3


Show answer

Answer

Mean - 5.86 (2 d.p), Median - 5, Mode - 3


Show question

Question

Which is the most commonly reported central tendency measurement and how is it reported?


Show answer

Answer

Mean (M = x).


Show question

Question

What are the statistics used to measure variability/dispersion?


Show answer

Answer

Range, interquartile range, standard deviation and variance. 


Show question

Question

How is the interquartile range calculated?


Show answer

Answer

The interquartile range is calculated by subtracting the difference between the median value in the first half and second half of a dataset.  

Show question

Question

A study recruited 10 participants, and the descriptive analysis indicated the mean as 22.8 and the standard deviation as 8.12. How would this correctly be reported in psychology research? 

Show answer

Answer

'There were a total of 10 participants recruited for this study (M = 22.8 & SD = 8.12)'.

Show question

Question

What are percentiles?


Show answer

Answer

Percentiles are when data is split into 100ths and data points are observed within the different sections of the percentiles. For instance, if you are trying to identify the data point at 36%, then the values would be placed in ascending order and the value that is representative of 36% of the data would be identified.


Show question

Question

What tests can researchers carry out to identify if parametric tests can be used?


Show answer

Answer

Researchers can identify if parametric tests can be used for statistical analysis if a normally distributed chart is plotted. For instance, if the bell curve is not skewed and if q-q plots show data to be normally distributed. 

Show question

Question

What is the purpose of inferential statistics?


Show answer

Answer

The purpose of inferential statistics is to identify if a sample or procedure used is appropriate to generalise to the general population.

Show question

Question

What are the principles of hypothesis testing?


Show answer

Answer

Hypothesis testing requires researchers to formulate a null and alternative hypothesis. The null hypothesis is then tested using an appropriate statistical test and if found to be significant then the null hypothesis can be accepted. This means that the results are likely due to chance or confounding variables rather than the intended independent variable. From these findings, it can be inferred that results observed from research are inappropriate to be generalised to the population.

Show question

Question

When is it appropriate to use non-parametric tests? 

Show answer

Answer

  • When data is nominal, data is nominal when assigned to groups; these groups are distinct and have limited similarities (e.g. responses to ‘What is your ethnicity?’)
  • When data is ordinal, that is when data has a set order or scale (e.g. ‘Rate your anger from 1-10’.)
  • When there are outliers identified in the data set
  • When the data is collected from a small sample

Show question

Question

What is the criterion of non-parametric tests?


Show answer

Answer

The following criterion is required for non-parametric tests: 

  • At least one violation of parametric tests assumptions,
  • Non-normally distributed data
  • Data is random (taken from random sample)
  • Data values ​​are independent from one another (no correlation between data collected from each participant)

Show question

Question

Why does data need to be ranked prior to carrying out non-parametric data analysis?

Show answer

Answer

Data needs to be ranked prior to statistical analysis as these ranked values are used as data points for the analysis rather than the raw values obtained from the experiment/observation. 


Show question

Question

What is the 'reference value'?


Show answer

Answer

The reference value is where the researchers predict/hypothesise where the median value is expected to fall.


Show question

Question

Rank the following data values and assign them with the correct sign. 

Researchers hypothesised that the reference value would be 13. The dataset is 3, 5, 3, 19, 16, 21, and 14. 


Show answer

Answer

-3, -3, -5, +14, +16, +19, +21.

Show question

Question

What do '+' and '-' signs mean in terms of ranking data for non-parametric analysis?

Show answer

Answer

Data is assigned as '+' if it is greater than the reference value and ‘-’ if it is lower than the reference value. 


Show question

Question

What are some of the most common non-parametric tests?

Show answer

Answer

Standard non-parametric tests include the Wilcoxon Rank sum Test, Mann-Whitney U test, Spearman correlation, Kruskal Wallis test and Friedman's ANOVA test.

Show question

Question

Researchers are trying to identify what would be an appropriate statistical analysis to run to identify the difference in average fitness test scores of participants during the morning, afternoon, and evening. The researchers identified that their data was skewed, and there were a few extreme outliers. 


Which test should they run? 


Show answer

Answer

The appropriate analysis test to use would be the Friedman's ANOVA test, as the data can be assumed to be non-normally distributed. The study used a within-subjects design and the analysis can help identify the difference in average scores between the morning, afternoon, and evening by comparing the ranked median values.

Show question

More about Data Handling and Analysis
60%

of the users don't pass the Data Handling and Analysis quiz! Will you pass the quiz?

Start Quiz

How would you like to learn this content?

Creating flashcards
Studying with content from your peer
Taking a short quiz

How would you like to learn this content?

Creating flashcards
Studying with content from your peer
Taking a short quiz

Free psychology cheat sheet!

Everything you need to know on . A perfect summary so you can easily remember everything.

Access cheat sheet

Discover the right content for your subjects

No need to cheat if you have everything you need to succeed! Packed into one app!

Study Plan

Be perfectly prepared on time with an individual plan.

Quizzes

Test your knowledge with gamified quizzes.

Flashcards

Create and find flashcards in record time.

Notes

Create beautiful notes faster than ever before.

Study Sets

Have all your study materials in one place.

Documents

Upload unlimited documents and save them online.

Study Analytics

Identify your study strength and weaknesses.

Weekly Goals

Set individual study goals and earn points reaching them.

Smart Reminders

Stop procrastinating with our study reminders.

Rewards

Earn points, unlock badges and level up while studying.

Magic Marker

Create flashcards in notes completely automatically.

Smart Formatting

Create the most beautiful study materials using our templates.

Sign up to highlight and take notes. It’s 100% free.

Start learning with StudySmarter, the only learning app you need.

Sign up now for free
Illustration