0% found this document useful (0 votes)

66 views

Time Series Analysis With Arima Model: Business Analytics Project (Group 5)

The document discusses using an ARIMA model for time series analysis and forecasting of rainfall data from Kerala, India. It describes the methodology used, which includes checking for stationarity, identifying the p and q values using ACF and PACF plots, estimating the best ARIMA model, and performing residual diagnostics. The main challenges were the team's lack of analytical experience and needing to learn time series concepts and tools. SPSS was used to process the data and develop an ARIMA model that adequately fit the data based on the autocorrelation of residuals being less than 1.25.

Uploaded by

PANDYA DHRUMILKUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views

Time Series Analysis With Arima Model: Business Analytics Project (Group 5)

Uploaded by

PANDYA DHRUMILKUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

TIME SERIES ANALYSIS WITH

ARIMA MODEL
Business Analytics Project (Group 5)
Table of Contents

Sr no Topic Page no
1 Introduction 2
2 Methodology 2
3 Theory 3
4 Challenges 3
5 Process 4
6 Conclusion 11
7 Reference 11

1|Page
Introduction:
Today, Analytics has taken the entire world over. It has been extensively used in
every Industry to identify its weak points and extract the maximum out of the company’s
available resources. The margin for error has reduced drastically and thus data analytics has
become an integral part of any Industry.
Time series analysis is one of the concept under the big umbrella of data analytics
used for analysing time series data in order to extract meaningful statistics and other
characteristics of the data. Time series forecasting is the use of a model to predict future
values based on previously observed values. Whether we wish to predict the trend in
financial markets or electricity consumption, time is an important factor that must now be
considered in our models. For example, it would be interesting to not only know that a stock
will move up in price, but also when it will move up.

Data Set:
The dataset used in the modelling is rainfall data in Kerala from Kaggle.
Link: https://www.kaggle.com/rajanand/rainfall-in-india
This dataset is released by Indian Meteorological Department (IMD) Govt. of India
under Govt. Open source license- India. It contains monthly rainfall data from 36
meteorological subdivisions in India. The time period is from 1901 to 2015 and the unit of
measurement of rainfall is mm.

Tool Used: SPSS

SPSS (Statistical Package for the Social Sciences) was first launched in 1968. It was
acquired by IBM in 2009 and thereafter it is officially known as IBM SPSS Statistics. SPSS is
a software used for analysing all types of data. It can open all file formats commonly used for
structured data like MS Excel, text file (.txt .csv) , SQL, Stata and SAS etc.

Methodology:
The methodology used for estimating and forecasting the univariate series of rainfall
data is B-J method (Box-Jenkins). The model adopted for the case is ARIMA (Auto
Regressive Integrated Moving Average). The flow of the entire process is as follows:
1. Check for White noise
2. Check if the data series is stationary, if not, use differential method to transform it into
stationary data series
3. Identification of p and q values for Autoregressive and Moving average model using
the Autocorrelation function (ACF) and Partial Autocorrelation function (PACF).
4. Estimation of appropriate model
5. Performing residual Diagnostics

2|Page
Theory:
Trend and prediction of time series can be computed by using ARIMA model. ARIMA
(p,d,q) model is a complex linear model. This acronym is descriptive, capturing the key
aspects of the model itself. Briefly, they are:
AR: Auto-regression- A model that uses the dependent relationship between an observation
and some number of lagged observations.
I: Integrated- The use of differencing of raw observations (i.e. subtracting an observation
from an observation at the previous time step) in order to make the time series stationary.
MA: Moving Average- A model that uses the dependency between an observation
and residual errors from a moving average model applied to lagged observations.
Each of these components are explicitly specified in the model as a parameter.
A standard notation is used of ARIMA (p,d,q) where the parameters are substituted with
integer values to quickly indicate the specific ARIMA model being used.
The parameters of the ARIMA model are defined as follows:
p: The number of lag observations included in the model, also called the lag order.
d: The number of times that the raw observations are differenced, also called the degree of
differencing.
q: The size of the moving average window, also called the order of moving average.

Challenges:
The biggest challenge that the team faced was that none of the members had
any analytical background nor an experience with any of the analytical tools. Thus
the team had to start right from scratch. We divided the entire project into 3 phases:
The first phase was to understand the concept of Time-series analysis. We referred
to various research papers, online tutorials to understand time series analysis.
The second phase included understanding different tools and finalising one. After
exploring R, SPSS, Tableau and Python, the team decided to use SPSS based on
ease of use, its familiarity with excel and convenience.
The third phase was execution where the dataset was processed using ARIMA in
SPSS and model was developed.

3|Page
A. Diagnosis:

The idea of diagnostic checking is to look for evidence that the model is not a
good fit for the data. The tool used to check in this case is Residual error. A
review of the distribution of errors can help tease out bias in the model. The
errors from an ideal model would resemble white noise that is a Gaussian
distribution with a mean of zero and a symmetrical variance.

The diagonal residue is found out by Autocorrelation function of the ‘Error’

column in the data set that was obtained during processing of the data set.
The resultant is some tables and graphs. The autocorrelation table as shown
below is of importance.

Autocorrelations

Series: Error for JAN from ARIMA, MOD_1, CON

Box-Ljung Statistic

Lag Autocorrelation Std. Errora Value df Sig.b

1 -.076 .094 .670 1 .413

2 -.160 .094 3.684 2 .159

3 -.276 .097 12.764 3 .005

4 -.020 .103 12.814 4 .012

5 .113 .103 14.368 5 .013

6 -.072 .104 15.006 6 .020

7 -.099 .105 16.226 7 .023

8 -.030 .106 16.336 8 .038

9 .249 .106 24.170 9 .004

10 -.080 .111 24.991 10 .005

11 -.036 .111 25.160 11 .009

12 -.153 .111 28.211 12 .005

13 .081 .113 29.067 13 .006

14 .123 .114 31.073 14 .005

15 .059 .115 31.538 15 .007

16 -.077 .115 32.332 16 .009

4|Page
To check whether the model is ok and does not contain any noise, we perform
Autocorrelation/Std error value for lag 1. If the value is less than 1.25, then the
selected model is ok.

In our case the value is 0.076/0.094 = 0.8085. Thus, the value is less than 1.25
and thus the model estimated is a correct one. The ACN and PACF graphs
obtained are

5|Page
Conclusion:
By working on this project, the team studied the basics of Time series
analysis with ARIMA model. Certain pre-requisites, rules and checks that needs
to be performed before developing model for the data series were understood.
The project was performed on SPSS platform. The Data, Output and Syntax
windows were explored for the activity.

Reference:
 https://people.duke.edu/~rnau/411arim3.htm
 https://www.youtube.com/watch?v=_7jivAiwZGw
 https://www.youtube.com/watch?v=erlRfau80PM
 https://www.youtube.com/watch?v=NoAUprEguoY

6|Page
 https://ncss-wpengine.netdna-ssl.com/wp-
content/themes/ncss/pdf/Procedures/NCSS/The_Box-Jenkins_Method.pdf
 https://www.analyticsvidhya.com/blog/2015/12/complete-tutorial-time-series-modeling/
 https://arxiv.org/ftp/arxiv/papers/1302/1302.6613.pdf

7|Page

Machine Learning for Time Series Forecasting with Python
From Everand
Machine Learning for Time Series Forecasting with Python
Francesca Lazzeri
4/5 (2)
Arima 1b
No ratings yet
Arima 1b
6 pages
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
No ratings yet
Time Series Analysis and Forecasting Using ARIMA Modeling, Neural Network and Hybrid Model Using ELM
14 pages
Statistical Methods Unit 5 Presentation
No ratings yet
Statistical Methods Unit 5 Presentation
19 pages
Arima Modeling With R Listendata
No ratings yet
Arima Modeling With R Listendata
12 pages
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
No ratings yet
ARIMA Modelling and Forecasting: by Shipra Mishra Intern
17 pages
ARIMAKASYOKI
No ratings yet
ARIMAKASYOKI
5 pages
Module 5 PDF
No ratings yet
Module 5 PDF
23 pages
Module 3.1 Time Series Forecasting ARIMA Model
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
19 pages
08 ASAP TimeSeriesForcasting - Day 8-11
No ratings yet
08 ASAP TimeSeriesForcasting - Day 8-11
62 pages
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
100% (1)
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
4 pages
Time Series Methods_ Arima
No ratings yet
Time Series Methods_ Arima
11 pages
Class Notes
No ratings yet
Class Notes
6 pages
07 Time_Series_Analysis_with_R_Ranjeet Paul-
No ratings yet
07 Time_Series_Analysis_with_R_Ranjeet Paul-
10 pages
ARIMA
No ratings yet
ARIMA
3 pages
Wipro
No ratings yet
Wipro
21 pages
Time-Series Modelling
No ratings yet
Time-Series Modelling
55 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
Time Analysis in Statistics Presentation
No ratings yet
Time Analysis in Statistics Presentation
16 pages
40_10566_434149
No ratings yet
40_10566_434149
30 pages
From News to Forecast
No ratings yet
From News to Forecast
18 pages
End Term Project (BA)
No ratings yet
End Term Project (BA)
19 pages
Time Series Analysis in R A Beginner's Guide
No ratings yet
Time Series Analysis in R A Beginner's Guide
13 pages
321
No ratings yet
321
10 pages
Arima Word
No ratings yet
Arima Word
13 pages
Notes
No ratings yet
Notes
37 pages
UNIT 5 Time Series Analysis
No ratings yet
UNIT 5 Time Series Analysis
17 pages
Business analytis C4
No ratings yet
Business analytis C4
10 pages
Time Series Analysis Thesis
100% (2)
Time Series Analysis Thesis
4 pages
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Intro To Time Series
No ratings yet
Intro To Time Series
85 pages
Time Series Lecture Notes-Ch-5
No ratings yet
Time Series Lecture Notes-Ch-5
27 pages
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
7 pages
Introduction to Time Series
No ratings yet
Introduction to Time Series
6 pages
Be A 65 Ads Exp 8
No ratings yet
Be A 65 Ads Exp 8
10 pages
Arima Modelling by Ankit Bhandari
No ratings yet
Arima Modelling by Ankit Bhandari
6 pages
Arima
No ratings yet
Arima
12 pages
Autoregressive Integrated Moving Average
No ratings yet
Autoregressive Integrated Moving Average
10 pages
Time - Series - in - Brief
No ratings yet
Time - Series - in - Brief
11 pages
Finance Workshop Honda
No ratings yet
Finance Workshop Honda
30 pages
ARIMA (With Seasonal Index) +SARIMA
No ratings yet
ARIMA (With Seasonal Index) +SARIMA
5 pages
Effective Stock Price Prediction Using Time Series Forecasting
No ratings yet
Effective Stock Price Prediction Using Time Series Forecasting
5 pages
Conf1 Ieee Icaesm
No ratings yet
Conf1 Ieee Icaesm
5 pages
Vinay Presentation
No ratings yet
Vinay Presentation
18 pages
Time Series Methods-ARIMA
No ratings yet
Time Series Methods-ARIMA
2 pages
Resumos Forecasting
No ratings yet
Resumos Forecasting
17 pages
Arima Notes
No ratings yet
Arima Notes
4 pages
Assignment
No ratings yet
Assignment
3 pages
Arima Time Series Stock Prediction
No ratings yet
Arima Time Series Stock Prediction
23 pages
Autoregressive Integrated Moving Average Models (Arima)
No ratings yet
Autoregressive Integrated Moving Average Models (Arima)
2 pages
Effective Stock Price Prediction Using Time Series Forecasting
No ratings yet
Effective Stock Price Prediction Using Time Series Forecasting
5 pages
Unit III Time Series Analysis Lesson 6
No ratings yet
Unit III Time Series Analysis Lesson 6
22 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
Time Series
No ratings yet
Time Series
45 pages
Time Series Models. AR, MA, ARMA, ARIMA _ by Charanraj Shetty _ Towards Data Science
No ratings yet
Time Series Models. AR, MA, ARMA, ARIMA _ by Charanraj Shetty _ Towards Data Science
3 pages
Time Series Analysis With The KNIME Analytics Platform
No ratings yet
Time Series Analysis With The KNIME Analytics Platform
35 pages
Introduction to Time Series Analysis
From Everand
Introduction to Time Series Analysis
Vikas Rathi
No ratings yet
Notes in Operations Research
From Everand
Notes in Operations Research
Rahul Basu
5/5 (1)
Applied Predictive Analytics: Principles and Techniques for the Professional Data Analyst
From Everand
Applied Predictive Analytics: Principles and Techniques for the Professional Data Analyst
Dean Abbott
No ratings yet
HRM Group Project Guidelines 2019-2020
No ratings yet
HRM Group Project Guidelines 2019-2020
2 pages
Financial Statement Analysis
No ratings yet
Financial Statement Analysis
3 pages
Weekly Report Format
No ratings yet
Weekly Report Format
1 page
Strategic Aspiration Using Blue Ocean Strategy Outlook
No ratings yet
Strategic Aspiration Using Blue Ocean Strategy Outlook
2 pages
Bottlenecks
No ratings yet
Bottlenecks
2 pages
C++ Notes Unit 5
No ratings yet
C++ Notes Unit 5
14 pages
Sdtechandeducation - In-Emerging Trends in Computer and Information Technology Practice MCQ Question Amp Answer
No ratings yet
Sdtechandeducation - In-Emerging Trends in Computer and Information Technology Practice MCQ Question Amp Answer
16 pages
Colors CMYK
No ratings yet
Colors CMYK
1 page
Windowsserver
No ratings yet
Windowsserver
7 pages
Oc View 7 Manual
No ratings yet
Oc View 7 Manual
53 pages
Geospatial Analysis With SQL: A Hands-On Guide To Performing Geospatial Analysis
No ratings yet
Geospatial Analysis With SQL: A Hands-On Guide To Performing Geospatial Analysis
184 pages
Security and Privacy of Electronic Health Records: Concerns and Challenges
No ratings yet
Security and Privacy of Electronic Health Records: Concerns and Challenges
7 pages
Studio One 7 - Release Notes
No ratings yet
Studio One 7 - Release Notes
6 pages
SubbaChary SQL DBA 4 Yrs
No ratings yet
SubbaChary SQL DBA 4 Yrs
4 pages
Advanced Tester Guide
No ratings yet
Advanced Tester Guide
8 pages
The Binary File Descriptor Library
No ratings yet
The Binary File Descriptor Library
252 pages
Assignment 5
No ratings yet
Assignment 5
9 pages
S - Hadoop Ecosystem
No ratings yet
S - Hadoop Ecosystem
14 pages
Top 200 Iot Projects For Engineering Student
No ratings yet
Top 200 Iot Projects For Engineering Student
38 pages
Evasive - Threats For Malware
No ratings yet
Evasive - Threats For Malware
8 pages
Cse
No ratings yet
Cse
120 pages
Os Notes
No ratings yet
Os Notes
67 pages
Pentest Report Template - Format ISO 27001
No ratings yet
Pentest Report Template - Format ISO 27001
8 pages
MEP Planning Manual
No ratings yet
MEP Planning Manual
118 pages
Installation Media For An SAP HANA SPS
No ratings yet
Installation Media For An SAP HANA SPS
18 pages
Design Process in Refrigerator
No ratings yet
Design Process in Refrigerator
8 pages
Installation Guide of Pos Module
No ratings yet
Installation Guide of Pos Module
9 pages
Powered by Google
No ratings yet
Powered by Google
2 pages
AP2600(N)_AP2610(N)_Sm
No ratings yet
AP2600(N)_AP2610(N)_Sm
335 pages
mad ex1
No ratings yet
mad ex1
6 pages
Sales Order Processing Exercises: 1.1.1. Exercise
No ratings yet
Sales Order Processing Exercises: 1.1.1. Exercise
10 pages
Manual Do Teclado Sensomatic American Dynamics
No ratings yet
Manual Do Teclado Sensomatic American Dynamics
36 pages
Electronic Logbook
No ratings yet
Electronic Logbook
65 pages
Database Management System Using Libreoffice Base: Ntroduction
No ratings yet
Database Management System Using Libreoffice Base: Ntroduction
49 pages
Unix Programming Module 2
No ratings yet
Unix Programming Module 2
82 pages

Uploaded by

Uploaded by

TIME SERIES ANALYSIS WITH

Tool Used: SPSS

The diagonal residue is found out by Autocorrelation function of the ‘Error’

Series: Error for JAN from ARIMA, MOD_1, CON

Lag Autocorrelation Std. Errora Value df Sig.b

1 -.076 .094 .670 1 .413

2 -.160 .094 3.684 2 .159

3 -.276 .097 12.764 3 .005

4 -.020 .103 12.814 4 .012

5 .113 .103 14.368 5 .013

6 -.072 .104 15.006 6 .020

7 -.099 .105 16.226 7 .023

8 -.030 .106 16.336 8 .038

9 .249 .106 24.170 9 .004

10 -.080 .111 24.991 10 .005

11 -.036 .111 25.160 11 .009

12 -.153 .111 28.211 12 .005

13 .081 .113 29.067 13 .006

14 .123 .114 31.073 14 .005

15 .059 .115 31.538 15 .007

16 -.077 .115 32.332 16 .009

You might also like