Big Data To Improve Strategic-3
Big Data To Improve Strategic-3
Figure 4.5 – Current usage of data types across case study group129
There is no common data type that is used across all airlines. Flight sched-
ules, operating cost, and traffic rights are the most widely used data types
(7 out of 9 airlines). Rather innovative data sources, such as clickstream
data from booking websites and real-time data feeds from global distribu-
tion systems, are the least used data types, corresponding to the initial
hypothesis that NP departments rely mostly on traditional sources for NP.
Data on meteorological events is only used by two NP departments, since
129
Source: Own illustration.
178 Status quo of strategic network planning in airlines
In the second part of the structured survey, participants were asked to rate
the reliability and usefulness of the data types for their specific NP process.
While reliability was only ranked by the airlines that are currently using the
data type or have used it in the past five years, the usefulness was evalu-
ated by all participants. The participants could rate the data types on a four-
point scale130, from very useful/reliable to not useful/reliable. Figure 4.6
displays the average survey results.
The average reliability score across all 22 data types is 3.23, with most
evaluations within the 75% quantile. This observation indicates that airline
network planners are generally satisfied with the reliability level of the cur-
rently used data sources. The most reliable data type is flight event data
from actually flown flights with an average score of 4.0 that is also the max-
imum score. Three data types – flight schedules, airport characteristics,
and aircraft specifications – are also perceived as highly reliable with an
average score of 3.7. Clickstream data from booking websites is perceived
the least reliable with a score of 2.4, followed by ready-made market fore-
casts (2.6) and political event forecasting (2.7).
Average usefulness across all data types was evaluated at 3.30, which is
slightly higher than the average reliability. This indicates that the currently
used data sources provide satisfactory information. Operating cost are
deemed most useful with an average score of 3.9, followed by real-time
booking curves (3.8), airport slot allocation (3.7), and flight schedules (3.7).
130
Compare section 2.3.3: scale following the very positive – slightly positive – slightly nega-
tive – very negative logic, as suggested by Berg (2004).
Data types used for network planning 179
Figure 4.6 – Perceived usefulness and reliability of the 22 currently used data types131
The difference in scores for usefulness and reliability allow for a clustering
of data types. When plotted on a matrix graph (Figure 4.7) with reliability
along the X-axis and usefulness along the Y-axis, four generic clusters
could be defined. If a data type is both useful and reliable (value over 3.0),
131
Source: Own illustration.
180 Status quo of strategic network planning in airlines
it should be maintained for fully meeting its purpose. These are the data
essentials for airline NP, comprising exactly half of all data types132.
Figure 4.7 – Usefulness and reliability analysis of the currently used data types133
On the contrary, if a data type is neither useful nor reliable, it does not add
any value to the NP process. It can even destroy value through bad deci-
sions. Two data types fall in this cluster: political event forecasting and me-
teorological event forecasting. If airlines are not able to improve either the
reliability or usefulness of these data sources, they should eliminate such
hands-off data.
132
Demographic data, real-time booking curves, real-time booking data from global distribu-
tion systems (GDS), oil price forecasts, MIDT data, fare data, airport traffic statistics, airport
slot allocation, real-time slot availability, traffic rights, including overflight rights, aircraft spec-
ifications.
133
Source: Own illustration.
IT tools used for network planning 181
Data types that are reliable but not useful can be retained if they do not
incur additional cost; these are nice-to-have data. Operating cost and air-
port characteristics are the only ones in this category. Operating cost infor-
mation is usually available from internal sources, and airport characteris-
tics are procured for flight operations in any case.
The last cluster are data types that are very useful but not “yet” reliable.
These “unfulfilled data needs” include seven data types, namely ready-
made market forecasts, economic forecasts, clickstream data from book-
ing websites, airport and traffic fees, flight schedules, flight event data on
actually flown flights, and competitor-related event forecasting.
This clustering is the first step towards identifying BDOs for airline NP. In
principle, big data can generate value in three different ways. First, it can
improve the reliability of currently used data types by shedding light on
“blind spots” or improving the validity of a data type. Second, it can improve
the usefulness of a current data type by allowing for additional analyses
that extract the right information for airline network planners. Finally, big
data can create entirely new data types that complement the currently used
set of data types. Clustering provides a starting point for the first two value
generation mechanisms, as it highlights the improvement potential of the
currently used data types. Sub-chapter 5.2 discusses the findings of this
section to develop potential BDOs.
The data used for NP has to be processed and analyzed to generate useful
information for NP staff. This sub-chapter analyzes the current market of
IT tools for airline NP (section 4.3.1) and presents the practical use of these
systems by the airlines of the case study group (section 4.3.2). The last
section 4.3.3 draws on additional interviews with an IT system provider as
well as on publicly available information in order to map out the data capa-
bilities of existing NP IT solutions.
182 Status quo of strategic network planning in airlines
Figure 4.8 – Market overview over commercially available IT systems for airline network
planning134
134
List derived from Airline Software (2017); specific information derived from solution web-
sites: AIMS (2017); APM Technologies (2017); AVES – Flight Operations Management Sys-
tems (2017); Aviation Research Technologies (2017); Aviolinx (2017); HITIT (2017); ICF
IT tools used for network planning 183
System developer firms can also be distinguished in three groups. The first
one includes large aviation system developers, covering a wide range of
business processes. Sabre, Lufthansa Systems, and APM Technologies
belong to this group. Boutique system developers that focus only on NP
and flight operations are the second group, which includes Optym, Lift,
(2017); L.E.K. Consulting (2017); Lift – airline planning solutions (2017); Lufthansa Systems
(2017); Optym (2017); PDC Aviation (2017); Sabre (2017); Seabury Consulting (2017)
184 Status quo of strategic network planning in airlines
PDC Aviation, Aviolinx, AVES, and AIMS. The third group are consultan-
cies that have started to market their proprietary decision-support tools.
L.E.K, Seabury (now part of Accenture), ICF, and Aviation Research Tech-
nologies are part of this last group.
It is remarkable that all the available solutions are provided by system de-
veloping companies with a specific focus on aviation. Large commercial
software development companies such as Infosys, SAP, Oracle, or IBM do
not offer any NP solutions for airlines. This further supports the hypothesis
that airline NP is highly specialized and interlinked with other airline busi-
ness processes.
Despite the diverse offer of IT solutions for NP, the participating case study
airlines utilize solutions from only three providers. The NetLine suite devel-
oped by Lufthansa Systems is the most widely used NP system in the case
study group. The NetLine/Sched application that covers the tactical plan-
ning process is used by five airlines, the strategic NetLine/Plan by one air-
line. AIMS and Sabre solutions are each used by two airlines. LCC 1 does
not use any airline NP solution, relying solely on Microsoft Excel. Eight out
of the nine case study airlines use Microsoft Excel as an important tool,
making it the most important unspecific IT tool.
Furthermore, two airlines built their own market forecasting models, as they
deemed existing ready-made market forecasts to be unreliable (compare
previous section). One cargo airline employs a cargo-specific revenue
management system (CHAMP) to inform NP decisions. Figure 4.9 sum-
marizes the currently used IT systems.
IT tools used for network planning 185
Figure 4.9 – IT systems currently used by the case study participants for network plan-
ning135
The reliance on Microsoft Excel and IT solutions for tactical NP has also
been reflected in the current degree of process automation. While demand
forecasting is partly automated in the NP process, strategic NP mostly re-
mains a manual process with little IT support. Standardized business case
calculations are136 the only noteworthy automation, even in large airlines,
before the flight scheduling process step. While FSC 1 and FSC 2 have
started to automate the first steps beyond business plan creation, all other
airlines rely on a fully manual strategic NP process.
With the exception of LCC 1 and LCC 2, the tactical NP process is more
automated than the strategic planning process. In most airlines, the stand-
ard AIMS and NetLine functionality is used. FSC 1 and CAR 1 are currently
working towards a complete automation of the tactical planning process.
However, the extreme complexity of airline NP requires non-exact optimi-
135
Source: Own illustration.
136
With a fleet size of more than 100 aircraft.
186 Status quo of strategic network planning in airlines
137
Source: Own illustration.
IT tools used for network planning 187
information (right column of Figure 4.8), the commonalities among the dif-
ferent IT solutions are evident at the onset. Most IT systems rely on existing
schedules, including revenue and QSI information and an O&D level fore-
cast of expected demand. Airline internal data, such as operating cost and
past booking data, as well as strategic assumptions on market and demand
development can be handled by most systems. Not a single IT solution for
NP claims to support real-time data, such as live bookings or search-en-
gine data.
Besides the mandatory data types, there are additional optional data types
that can be processed by the NP IT system. Real-time internal booking
data and external GDS data can be incorporated in addition to airport traffic
statistics and traffic rights. Data from booking platforms (such as Skyscan-
ner) is currently in a pilot testing phase.
Figure 4.11 – Exemplary integration of the currently used data sources in airline net-
work planning IT solution139
138
Among these, 66% use demographic data, 56% use real-time slot information, and 78%
use at least one external shock data type.
139
Source: Own illustration.
Performance measurement of airline network planning 189
Airline NP departments can be charged with revenue, cost, and profit re-
sponsibilities (see section 3.2.4). In our case study group, five airlines as-
signed profit responsibility to their NP department, while three airlines
opted for cost responsibility and one chose revenue responsibility for this
(see Figure 4.12).
The targets of an organizational unit should align with its role and respon-
sibilities. If a NP department is considered a profit center, the primary target
should ideally be profit maximization. Similarly, a cost center should strive
to minimize cost and a revenue center should maximize revenue. Across
our case study group, only seven airlines have aligned the stated primary
target of NP with the assigned role for the department.
All profit centers are measured against aggregate route profitability (which
excludes network effects) or network profitability. Here, the targets are
aligned with NP departments’ role as profit centers. In CAR 3, the sole
190 Status quo of strategic network planning in airlines
LCC 1 assigns a cost center role to its NP department but ranks aggre-
gated route profitability as most important target. Although secondary NP
targets, such as aircraft utilization and operating cost minimization, are
aligned with the departments’ cost center role, the mismatch of role and
primary target may result in difficulties in prioritizing big data investments
for NP. SCA 2 exhibits an even greater mismatch in this regard. Despite
assigning a joint revenue responsibility to its NP department, none of the
140
Source: Own illustration.
Performance measurement of airline network planning 191
In summary, the targets in airlines, barring a few, are well aligned with its
role and responsibilities. However, SCA 2 and LCC 1 present misalign-
ments that can potentially complicate the benefit analysis pertaining to big
data investments.
Direct KPIs squarely contribute to the profit and loss statement of an airline
and are thus global performance measures within these organization. Air-
lines with a significant share of connecting traffic usually rely on network
141
Source: Own illustration following Sterzenbach et al. (2013).
192 Status quo of strategic network planning in airlines
profit calculations, whereas other airlines use aggregate route profit fig-
ures142. Revenues depend on the available seat kilometers, which is con-
sidered the unit measure of the total offered capacity controlled for route
length143, the average load factor, and the yield per seat kilometer. In sim-
ple words, revenues can be raised if more capacity is offered, a larger
share of capacity is sold, and the charged ticket price per kilometer in-
creases. Airline cost are typically segmented as direct and indirect operat-
ing cost and non-operating overhead cost (see section 3.4.4).
142
See section 3.4.4 for a detailed discussion on the differences between network profit and
route profit.
143
Take a flight with an Airbus A320 from Berlin to London as example: 180 seats are flown
950 kilometers, which results in 180 * 950 = 171,000 ASK for this specific flight.
Performance measurement of airline network planning 193
The most frequently used NP KPI among the case study group is net-
work/route profitability. It serves as the primary KPI for six NP departments
and as secondary KPI for one other (see Figure 4.14).
Figure 4.14 – Primary and secondary NP performance targets in case study airlines145
Asset utilization ranks second, being the primary target for two NP depart-
ments and secondary target in four airlines. The minimization of direct and
indirect operating cost is the primary KPI for CAR 3 and secondary target
for two more airlines. Forecast accuracy, FTE efficiency, schedule robust-
ness, and load factor are other subsidiary KPIs for NP in the airline case
study sample. Three potential KPIs are not employed at all, two of those
144
An FTE of 1.0 is equivalent to one full-time employee (40h per week).
145
Source: Own illustration.
194 Status quo of strategic network planning in airlines
being the revenue KPIs “ASK” and “yield” and another being the direct
overhead cost KPI. The fact that only one airline chooses a direct revenue
KPI for its NP department also validates the finding that NP departments
are considered profit or cost centers rather than revenue centers.
5 Big data opportunities for airline network planning
Air transport demand is a broad concept involving all relevant actors in the
market. Hence, airline network planners need information on various
groups of actors as well as on the entire market system. Especially early
in the NP process, long-term demand forecasts depend on a holistic per-
spective of the entire transportation market, including the air transport mar-
ket as a specific sub-market. Furthermore, relevant trends in travel and
transportation must be identified and evaluated. This results in two con-
crete information needs, the holistic transportation demand forecast and a
Demand Fleet Network Route Flight Flight Fleet Aircraft Crew Disruption
Information forecast planning selection frequency scheduling assignment routing assignment management
structure
objects
Trend identification
External
Early warnings for relevant incidents
shocks
146
Source: Own illustration.
Information needs for airline network planning 197
Besides the information on the entire market, airline network planners re-
quire information on its main actors, mainly competitors, customers, and
the airline itself. Information on competitors can be pooled as competitor
monitoring. Competitor monitoring focuses either on regular or irregular
operations. Information on regular competitor operations includes the
backward-looking analysis of actually flown flights, delays, traffic volume,
and reported yield. Forward-looking information consists mainly of pub-
lished schedules, real-time fare data, and news analysis of planned routes
or fleet decisions. The monitoring of regular operations is most relevant for
strategic NP, as it informs an airline’s route and schedule decisions. In
contrast, irregular operations monitoring aims to exploit the short-term
changes of a competitor’s operation. This includes strikes, flight cancella-
tions, and other unforeseen events that may justify the offering of additional
flights or changes in fleet assignment. Consequently, information on irreg-
ular competitor operations are most relevant for tactical NP.
and goods provide complete information on the real origin and the real
destination of a passenger or a cargo item. Ideally, airline network planners
should wish to know the exact address of both origin and destination for
every passenger and every cargo item. Furthermore, even the travel pat-
terns of non-airline customers are highly relevant for the identification of
potential routes that can compete with railway and individual traffic. The
real movements of passengers and goods are most relevant to the strate-
gic planning steps, from route selection to flight scheduling. Cargo airlines
especially benefit from the real-time location information of cargo items
when a delayed delivery requires short-term re-planning.
The information object “cost” is typically associated with the need for only
one type of information: the cost forecast. Cost forecasts are typically struc-
tured in line with the airline cost classification scheme (Niehaus et al.,
2009, p. 177). Because overhead costs are not in the remit of NP, the need
for information is focused on direct and indirect operating cost. Most air-
lines utilize internal cost calculations to satisfy this need for information.
Cost forecasts are relevant in all the strategic and tactical NP process
steps, as they are required for any profitability analysis.
Finally, external shocks create the need for early warnings for relevant in-
cidents. Airline network planners are primarily concerned with the political
and meteorological events that may have a permanent impact on the travel
demand or passengers’ capability to fly to a specific destination. Political
events include long strikes, travel restrictions, or conflicts that affect the
stability of a destination. Meteorological events are usually less relevant
for NP, since they usually cause only short-time interruptions. However,
major catastrophes such as floods, tsunamis, or tropical storms may influ-
ence travel demand in the long term. Thus, early warnings for relevant in-
cidents are relevant to both strategic and tactical NP, depending on the
impact and severity of the shock.
In this section, the currently used data sources are assigned to the infor-
mation needs derived in the previous section. Figure 5.2 visualizes the al-
location of currently used data sources to information needs. The useful-
ness and reliability classifications are included as the shaded areas of the
data sources.
147
Exemplary technical planning mistakes: aircraft too heavy for airport, night curfew not re-
spected, aircraft range not sufficient for stage length, slot or traffic rights unavailable.
200 Big data opportunities for airline network planning
External
Information Demand Cost Planning constraints shocks
object
Information Transpor- Trend Competitor Real move- Internal Cost Transpa- Real-time Early
need tation identifica- monitoring ments of booking forecasts rency on monitoring warnings
demand tion passengers situation planning of for
forecast and goods constraints planning relevant
constraint incidents
s
Currently MIDT data Cost data Political
used data
Information on unconstrained demand Airport data events
types
Market estimates Service Oil price Aircraft Aircraft
bench- forecasts data data
Economic data
marking
Demographic data data Traffic Real-time Meteoro-
rights slot logical
Real time GDS data
availability events
Published flight schedules Airline slot
portfolio
Real traffic data Competito
r-related
Fare data Internal booking data
events
Figure 5.2 - Allocation of currently used data sources to airline NP information needs148
148
Source: Own illustration.
Information needs for airline network planning 201
Thus, there are multiple big data opportunities to improve the need for in-
formation on transportation demand forecast. First, BDOs may improve the
reliability of economic forecasts, general market estimates, and uncon-
strained demand. Second, the data pool can be extended using data on
booking intentions to predict actual bookings better. Third, a suitable proxy
for real-time data on cargo demand is urgently required, as MIDT and GDS
data cover passenger bookings only.
Trend identification
Competitor monitoring
Most data sources provide only the air travel–specific origin and destina-
tion, which includes the first and the last airport on the itinerary. MIDT data
also contains the booking agent, but neither any personal address nor the
internet protocol (IP) address used while making the booking. If a booking
is made through the airline’s website, most airline can locate the IP ad-
dress, which indicated either the real origin or the real destination. How-
ever, no current data type can provide comprehensive real O&D infor-
mation. Cargo airlines usually have the real shipping O&D for individual
shipments, since it is a part of the air waybill. However, many freight for-
warders consolidate individual shipments and issue new air waybills for
such shipments. The shipping addresses are then the forwarder’s airport
facilities, which do not provide any further information on the real O&D of
the consolidated cargo. Both passenger and cargo airlines can use big
data opportunities to fill this gap and produce reliable information on the
real O&D of their customers.
Cost forecasts
Two of the three main data sources are highly reliable but of limited utility.
Both operating cost and airport fees are considered must-have factors by
airline network planners, since these need to be included in the route and
network profitability calculations. However, they are not in the remit of the
Information needs for airline network planning 203
All of the currently used data sources are highly reliable for airline NP.
However, many interview participants confessed that the data sources are
not connected, and the actual useful information has to be compiled man-
ually. Big data not only consists of new data sources but also a combination
of existing data sources. Therefore, there is great big data harmonization
and automation potential in planning constraint data. The harmonization
aspect is considered part of the real-time planning constraint solutions,
which are introduced subsequently.
149
ACL International covers UK, Canada, Ireland, Luxemburg, New Zealand, Poland, and the
United Arab Emirates. e-Airportslots covers Australia, France, Belgium, Germany, Nether-
lands, Philippines, Spain, Tunisia, and Mexico.
204 Big data opportunities for airline network planning
150
The current systems warn for specific meteorological events, such as a thunderstorm or a
hurricane. However, a generalized analysis, such as one on the occurrence of fog at a specific
airport, can indicate specific times of day during certain months with a high fog risk, thus
informing scheduling.
Development and evaluation of BDOs for airline NP 205
complement the big data types. The airline-specific big data types identi-
fied in section 3.6.5 can be classified under these categories (see Table
5.1).
Table 5.1 – Classification of airline big data types151
151
Source: Own illustration.
206 Big data opportunities for airline network planning
(Tuckett et al., 2014). Social media data can then be used in combination
with search engine data to determine the specific or aggregated consumer
price elasticity (Ghose, Ipeirotis, & Li, 2018). Travel-specific social media,
such as travel rating platforms or travel blogs, present valuable data on
transportation streams. Reviews and visitor volumes can indicate shifts in
travel trends (Bowen, 2015). Clickstream data152 from travel rating plat-
forms can also reveal travel patterns of consumer groups. The analysis of
specific events in social networks may yield a better prediction of one-time
travel flows, e.g., for large concerts or industry fairs.
The IoT provides multiple BDOs, especially for air cargo network planners.
The location transmitting sensors attached to individual shipments can re-
veal the real O&D and further indicate short-term delivery delays. Alterna-
tively, a cheaper RFID-chip can be attached to cargo carrying an improved
electronic air waybill. Flight tracking data can complement competitor mon-
itoring and fill gaps in MIDT data and published flight schedules. The in-
creasing number of weather-tracking devices can improve the usefulness
of long-term meteorological forecasts. Similarly, machine learning-based
image recognition tools can use geo-spatial images to predict the impact
of extreme meteorological events and even improve economic forecasting.
Aggregated and anonymized mobile location data can draw a seamless
picture of all travel movements, independent from the chosen transporta-
tion mode. Airline network planners can discern a real-time view on travel
behavior, market shares, and real O&Ds.
Furthermore, there are multiple BDOs using other web-based content. The
most prominent opportunity is conducting an analysis of search engine
data to predict future bookings. This can either be based on general search
engine data (e.g., Google, Baidu) or on specific clickstream data from
meta-search websites for flights (e.g., Kayak, Skyscanner). Another im-
portant data source is e-commerce platforms (e.g., Amazon, Alibaba).
152
A clickstream is the sequence of hyperlinks followed by a unique website visitor. The click-
stream is presented chronologically and allows retracing the search pattern of customers.
Development and evaluation of BDOs for airline NP 207
Data on cross-border orders can reveal trends in air cargo demand from
e-commerce; this is predicted to become a major air cargo growth driver in
the coming years (IATA, 2017b). Moreover, e-commerce pricing data can
be used to track inflation in real-time in order to better inform economic
forecasts (Cavallo & Rigobon, 2016).
Improved scraping153 technology enables the collection of information from
different web-based services. Airlines can analyze daily hotel prices to
identify micro-seasonality in specific destinations or identify a supply mis-
match between transportation and accommodation. Similarly, cargo air-
lines can collect data on industry fairs and related exhibitors in order to
predict cargo flows. Online forwarders quote public rates on the major ship-
ping lines, which can indicate the expected booking situations in a manner
similar to passenger airline fare intelligence. The scraping of unstructured
data in combination with natural language processing (NLP) analytics can
vastly improve news monitoring and political risk assessments.
Two of the BDOs identified do not fit the classification scheme presented
in Table 5.1. First, an integrated real-time slot trading platform can facilitate
short-term schedule adjustments and decrease the need to freeze the
schedule before the IATA slot conferences. A real-time traffic right infor-
mation platform can bundle all the relevant information on global traffic
rights and indicate potential violations, e.g., overusing an overfly contin-
gent. In total, 23 BDOs have been identified. Figure 5.3 schematizes the
BDOs by their primarily addressed information need154.
153
Scraping describes a technique in which a computer program extracts data from human-
readable output, coming from another program or website.
154
Please refer to Appendix 17 for a different view structured by the NP process steps.
208 Big data opportunities for airline network planning
Information needs
A B C D E F G H I
Real-time monitoring of
Transportation demand
planning constraints
planning constraints
Real movements of
Trend identification
Cost forecasts
situation
forecast
Big data
opportunities
Since many of the BDOs address more than one specific need for infor-
mation, the secondarily addressed information needs are also displayed in
155
Source: Own illustration.
Development and evaluation of BDOs for airline NP 209
light grey. In line with the argumentation presented in the previous sub-
chapter, no BDO is included for internal booking situation and cost fore-
casts. The transparency in planning constraints is addressed as a second-
ary need for information in real-time platforms.
The following sections, which are organized according to the addressed
information needs, describe each BDO in detail.Appendix 18 summarizes
the big data properties of each opportunity and indicates the potential data
providers and applicable BDA methods to generate useful information from
the data.
The analysis of geo-spatial images, which usually are captured using sat-
ellites but more recently also with drones, can help overcome all three
shortcomings of traditional growth forecasting. In the scientific community,
Henderson, Storeygard, and Weil (2012) pioneered the approach and
showed that the nighttime light intensity analysis is a better predictor for
economic growth in developing countries than traditional forecasts. Simi-
larly, Jean et al. (2016) have predicted poverty levels with this approach.
Keola, Andersson, and Hall (2015) have contributed land cover data as an
additional image-based indicator for economic forecasts. Gennaioli et al.
(2014) proved that geo-spatial image analysis generates robust economic
forecasts on sub-regional level.
156
The International Monetary Fund updated the World Economic Outlook quarterly, Word
Bank, and OECD every six months.
Development and evaluation of BDOs for airline NP 211
The primary use case for airline NP is the precise analysis of local eco-
nomic activity, which can improve both the reliability and usefulness of eco-
nomic forecasts. Furthermore, a geo-spatial image analysis can reveal the
travel and trade trends by counting vehicles in tourist destinations or ships
and trucks on trade lanes. If the frequency of imaging is further in-
creased157, image analysis can be used to monitor competitors by reveal-
ing the position of spare aircraft, maintenance activities, or cargo shipment
routing. Similarly, real O&D monitoring can become feasible, perhaps more
so for cargo than for passengers. Outsized cargo can be tracked from the
real origin to destination and reveal all the involved transportation provid-
ers. Several companies already use this technology for their early warning
systems for political risks and natural disasters, which can also become
relevant for airline NP (Komissarov, 2017).
High inflation rates directly influence the air transportation demand and in-
directly endanger revenues if tickets are sold in local currencies (Wadud,
2014). Inflation measurement and forecasts are often unreliable, especially
in countries with high inflation rates. In a sample of Latin American coun-
tries, Jalles (2017) found that inflation forecasts are highly volatile and not
dependable. Extreme cases, such as that of Venezuela, show that pro-
longed high levels of inflation significantly affect the demand for air travel,
as it lowers the purchasing power of the population (Echenique, 2018).
157
The current maximum data frequency is daily (Orbital Insight, 2018).
212 Big data opportunities for airline network planning
Furthermore, the official inflation statistics are not reliable in some coun-
tries.
Real-time inflation measuring and forecasting can improve both the use-
fulness and reliability of economic forecasts. The data can also reveal
short-term trends and thus serve as an early warning system for political
risks, e.g., for protests due to high inflation or poor economic development.
Airline network planners can anticipate routes with high demand risk better
due to inflation and thus adjust flight frequencies or the route mix. It impacts
the three NP process steps of demand forecasting, route selection, and
flight frequency determination.
Consumer confidence indices are the central input factors for GDP growth
forecasts (Angelini et al., 2011, C30). Currently, consumer confidence is
measured through representative polls conducted on a monthly basis by
statistical bureaus and research facilities (Dominitz & Manski, 2004,
pp. 52–53). Since regular large-scale polls are expensive and fail to meas-
ure real-time results, scholars have proposed to employ the sentiment
analysis of social media postings to measure consumer confidence in real
time. Daas and Puts (2014) have shown that the sentiment analysis of
Dutch Twitter and Facebook posts resembles the traditional consumer
confidence index while delivering faster results. Similarly, Tuckett et al.
(2014) utilize the sentiment analysis of broker and analyst reports to meas-
ure business confidence.
Development and evaluation of BDOs for airline NP 213
Three categories of demographic data are most relevant for airline NP,
namely ethnic background, business contacts, and economic status. Large
social networks, such as Facebook, contain vast information on ethnicity
and economic status. Professional social networks, such as LinkedIn, con-
tain data on direct global business contacts. Facebook offers a free ana-
lytics tool called “Audience Insights” that allows the segmentation of the
targeted audience through both demographic and socio-economic factors
(Facebook, 2018a). The user can either segment a specific audience (e.g.,
the followers of his/her page) or the entire public audience. It offers seg-
mentation criteria, such as language, gender, and educational background.
Until August 2018, the data could be extracted through an open application
programming interface (API)158, but Facebook discontinued this API after
a massive data scandal (Johnson, 2017). Now, companies have to use the
integrated “Audience Insights” tool to process raw data.
158
An open API, also known as a public API, is an application programming interface that
allows the owner of a network-accessible service to give universal access to other develop-
ers.
159
The main competitors include SproutSocial, Spredfast, agora Pulse, Salesforce Social
Studio, and Socialbakers.
Development and evaluation of BDOs for airline NP 215
Airline network planners can use demographic data from social networks
in various ways to improve the usefulness and reliability of current demo-
graphic data. Data from global platforms and LinkedIn allow for seamless
analyses across locations, thereby overcoming the issue of varying data
quality from different data sources. Moreover, demographic data can be
narrowed down to the city level (Facebook, 2018a), allowing for a greater
data granularity. Filtering by language or interests may yield a much more
accurate picture of people with a migration background than official statis-
tics. If professional networks, such as LinkedIn, begin marketing direct
business contact data, the demand for business travel may be forecasted
more accurately. All of this affects the need for information pertaining to
transportation demand forecast. Demographic data is mostly utilized in the
strategic NP process steps from demand forecast up to route selection.
All the booking data that airlines currently use for demand forecasting is
based on actual bookings, even if it is updated in real time. Thus, it reflects
booking decisions that have already been made. Many potential passen-
gers search for flights in search engines before they actually process the
booking. Analyzing search engine data could thus indicate future booking
intentions, which makes it a truly future-oriented data source. In addition,
search engine data can indicate real O&D demand, since people are likely
to search for their real destination. IP analytics allow for the detection of
the geographic origin of the search, so that airlines have indications for
both the real origin and the real destination.
Search engine data has attracted significant attention in the scientific re-
search community. Disease prediction was the first proven application
216 Big data opportunities for airline network planning
(Lazer et al., 2014), although the range of opportunities is infinite (Choi &
Varian, 2012). Carrière-Swallow and Labbé (2013) employed Google
Trends data to measure economic indicators in emerging markets in real
time. In tourism research, Google Trends has been widely used to predict
tourist flows (Önder & Gunter, 2016; Park, Lee, & Song, 2016; Yang et al.,
2015) and hotel room demand (Pan, Chenguang Wu, & Song, 2012; Ri-
vera, 2016). Indeed, there is no research on predicting air travel demand
with search engine data, but a similar methodology as for hotel demand
forecasts may be applicable.
All major research is conducted using Google data, which can be explained
by its high market share of about 90% in the global search engine market
(StatCounter, 2018). Google offers its proprietary “Google Trends” webtool
to analyze search volumes. It allows the comparison of different search
terms, which are displayed in a normalized scale. One can obtain absolute
search volumes using the Google Adwords plug-in “Keywords.” The search
statistics can be filtered geographically down to the city level in North
America or to the regional level in other continents (Google, 2018).
Searches can then be displayed as a time series with daily search data for
up to ten years. The data is also available through a Google Trends API
(npm Inc., 2018).
Airlines can combine the search term “flight” with a list of potential destina-
tions and filter these results by the intended origin geographies. The results
should indicate changing preferences and seasonal patterns. The data
needs to be controlled for different booking lead times and conversion rates
for different O&Ds. On some routes, people may search and book several
months ahead, whereas on other routes, the lead time is much shorter. In
any case, search engine data indicates booking intentions and may thus
be a highly valuable complement for actual external and internal booking
data. The broadness of information comes at the cost of specificity and
reliability. Google Trends data have to be adjusted to booking lead times,
Development and evaluation of BDOs for airline NP 217
conversion rates, and search terms. This requires significant data pro-
cessing before it becomes reliable.
Network planners can use general search engine data to fine-tune demand
forecasts and identify unserved markets. Furthermore, this data can reveal
trends in booking and travel behavior. Due to its general character, this
data contributes to various information needs. Primarily, it can improve
transportation demand forecasts and also inform trend identification and
the internal booking situation. In combination with competitor search terms
(e.g., Aeroflot + Moscow), it can enrich competitor monitoring and serve as
an early warning indicator. The broad range of applications makes the data
valuable for almost all NP process steps. Long-term demand forecasts can
benefit from 10-year trend patterns, while mid- and short-term forecasts
can be directly informed by concrete booking intentions. This not only af-
fects the strategic process steps like route selection and flight frequency
determination but also the tactical scheduling and fleet assignment deci-
sions.
Meta-search websites for flights can provide much more focused data than
general search engines. However, the market is fragmented and features
multiple players, such as Skyscanner, Kayak, and Expedia (Reals & Had-
wick, 2015). Since every player can commercialize only their own click-
stream data, these datasets have limited market coverage. In turn, the data
contains complete search information, including set filters, clickstreams,
and IP addresses (Skyscanner, 2018). The IP address can indicate the
real origin of the passenger.
Clickstream data from meta search websites can improve the usefulness
and reliability of transport demand forecasts by increasing the granularity
and specificity of O&D demand estimates. If the data is compiled to a time
series, airline network planners can also identify demand trends. The click-
stream data supports competitor monitoring and market share modelling,
since competitor bookings are visible in this data. At the same time, internal
bookings can be traced. With the IP address included, the real origin of
passengers may be approximated. This information primarily serves the
demand forecast as well as route selection and scheduling. Scheduling,
especially, can be improved through this, since clickstream can contain
preferred departure and arrival time windows. Fleet assignment can also
160
All the results can also be refined by the filters that users have set, e.g., to identify price-
sensitive or time-sensitive customers.
Development and evaluation of BDOs for airline NP 219
competitor monitoring and explain the internal booking situation. This has
an impact on almost all NP process steps from demand forecasting until
the final aircraft routing, which can be changed if short-term demand in-
creases in the spot market are detected.
Fairs for industrial goods are the main driver of the micro-seasonality in
cargo airlines. Especially those fairs that involve high value goods such as
cars or specialized machinery, usually rely on air cargo services. Identify-
ing cargo flows associated with industry fairs requires two components: an
exhaustive list of industry fair events must be complemented with the anal-
ysis of the manufacturing facilities of the exhibitors to predict the micro-
seasonality at the O&D-level. While the cargo airlines in our case study
group maintain an exhaustive list of the most important industry fairs, they
all lack transparency in terms of the entire industry fair landscape. In addi-
tion, trade flows are only recorded by experience, not predictively. Cargo
airlines have currently no transparency in manufacturing sites from exhib-
itors that can help them predict the exact demand origins.
Air cargo network planners can use such data to plan for industry fair-spe-
cific extra stops that can be actively marketed to potential shippers. By
offering a more direct flight connection, the trucking cost can be reduced
and a value proposition for exhibitors can be created. Let us consider the
222 Big data opportunities for airline network planning
Airline network planners can use trends in travel reviews to inform the mar-
ket growth expectations of their demand forecasts. Destinations with an
increasingly positive review trajectory can be identified as potential new
routes. Similarly, destinations with decreasing volume and quality of re-
views may be candidates for frequent reviews. Thus, travel review data is
most important for the strategic NP steps, as it indicates long-term trends
rather than the transient short-term demand.
161
The reviewer has already been at the reviewed point of interest.
Development and evaluation of BDOs for airline NP 225
NP managers can use clickstream data for three purposes. First, they can
forecast future travel flows as per travel intentions. Second, they can rank
potential destinations by their popularity to identify route candidates. Third,
they can monitor the trends in popularity using time-series clickstream
data. While the first use case supports a mid-term demand forecast, the
rest aim to primarily identify long-term trends. Therefore, demand forecast-
ing is the most impacted NP process step. Network planners can further
use clickstream data for route selection. Mid-term travel flow forecasts may
enable schedule adjustments and fleet assignment decisions.
Transportation represents only one side of the coin in a travel system. The
other is accommodation, which largely includes hotels, where leisure and
business travelers are lodged (Bieger & Wittmer, 2006, p. 40). Hotels are
bound to their location and usually know the local demand specificities
much better than any transportation carrier. They may reflect events in their
pricing, of which airline P&RM departments are completely unaware
(Herrmann & Herrmann, 2014). It may be a good strategy for airline net-
work planners and P&RM departments to leverage the local pricing
knowledge for their own decision making.
Hotel prices present two important pieces of information for network plan-
ners. They indicate micro-seasons driven by specific events, which are ex-
pressed through temporary price spikes. It is likely that the demand for
transportation also increases during these price spikes, which may justify
an additional frequency or the deployment of a larger aircraft. If hotel prices
are structurally lower than comparable hotels in other destinations, airlines
should check their own booking ramp up. Low hotel prices may be a symp-
tom of a transportation supply shortage, leading to a concrete business
opportunity – adding transportation capacity to a destination. If a route is
226 Big data opportunities for airline network planning
Airline network planners can use hotel prices to fine-tune their demand
forecasts at the O&D level and make the schedule more dynamic for local
micro-seasons. This implies short-term route addition, flight frequency ad-
justment, and fleet assignment changes.
Airline network planners can use social network event data for similar pur-
poses as hotel price patterns. The major applications entail the fine-tuning
of demand forecasts and the dynamization of the schedule, which affects
NP process steps from frequency selection to fleet assignment.
speed, altitude, hex-code for aircraft identification, and the aircraft registra-
tion or flight number (Flightradar24, 2018). Companies such as
Flightradar24 and FlightAware maintain their own receiver network to cap-
ture the ADS-B data in relevant regions. The data is already commercial-
ized and can be purchased through a real-time data feed subscription or
as batch data.
Research on ADS-B data has focused almost exclusively on air traffic con-
trol applications (Strohmeier et al., 2014) and air traffic flow research
(Schäfer et al., 2014). Older research has focused on the security aspect
of the transmission protocol and general technology adoption issues
(McCallie, Butts, & Mills, 2011).
Batch data can be used to fill the gaps of MIDT and flight scheduling data.
Since almost all the commercial airplanes are equipped with an ADS-B
transponder, the dataset includes a seamless recording of flight move-
ments in the covered geographies. By being aware of the outline data of
charter tenders, airlines can retrace the winning competitors. Real-time
data also enables the detection of the operational issues of competitors,
which can be used to selectively increase frequencies or change aircraft
types to the airline’s benefit. The recorded flight paths can support cost
calculation for new routes, since the likelihood of specific air routings can
be estimated. In summary, transponder data primarily serves competitor
monitoring and has secondary applications in cost forecasting and as an
early warning system.
Airline network planners can use these specialized media monitoring tech-
niques to filter those competitor events that may have an impact on NP.
Larger events such as bankruptcies mostly have a lasting impact and in-
fluence strategic NP. Temporary events such as strikes or groundings may
call for tactical actions, such as the scheduling of additional flights or a
change in fleet assignment. In any case, advanced media monitoring ben-
efits the airlines in terms of competitor monitoring and serves as an early
warning system for critical events.
230 Big data opportunities for airline network planning
Airline network planners have only extremely limited data on the real origin
and the real destination of their passengers. Recorded IP numbers of direct
bookings are often misleading, and location tracking with an airline’s mo-
bile app covers only a small portion of the passengers. The widespread
prevalence of mobile phones has created a completely new opportunity for
location tracking. Every time a cell phone receives a call or message or
data package, mobile network operators can link the phone to the trans-
mitter catchment area. In the early years of mobile phones, operators could
locate users only when they placed or received a call or text message.
However, the increasing popularity of data flat rates have caused constant
flows of data package exchanges and allows mobile operators to track
smartphones seamlessly in their networks (Bonnel et al., 2015, pp. 381–
385).
Researchers started using mobile location data for urban travel flow anal-
ysis around ten years ago (Calabrese et al., 2011). Subsequently, more
studies on urban travel flows based on mobile location data have been
conducted (Çolak et al., 2015; Iqbal et al., 2014). Tolouei, Psarras, and
Prince (2017) demonstrated that properly processed mobile location data
is as reliable as traditional travel surveys, but it has the advantage of a
much larger sample size. Mobile network operators cover a significant part
of the population, resulting in highly representative samples.
The American start-up Streetlight Data, which also joined the data provider
group, follows a different approach. As against using mobile phone signals,
Streetlight Data purchases the GPS-tracked location data from over 400
application providers and now covers over 30% of the mobile phone users
in the US (StreetLight Data, 2018). GPS-based location data has the ad-
vantage of being recorded independent from the mobile network and thus
can be tracked across country borders.
Mobile location data provide a broad application range for airline network
planners. They draw an accurate picture of the relevant travel streams
across all transportation modes, which can be filtered by origin, location,
time of travel, and mode of travel. No other data type currently provides a
comparable information depth across transportation modes. Airline net-
work planners can employ mobile location data to precisely define the
catchment areas of airports. Moreover, in case of low flight frequency, they
can track the real origins and destinations of their passengers162. If GPS
data is used, passengers can be matched to flights, even at busier airports.
The data allows also for competition monitoring, since location data is not
limited to their own passengers. The time series of mobile location data
reveal changing travel patterns, including the used transportation mix. The
data can be combined with an event list (see B4 – Analysis of events on
social networks) to identify the origin of event visitors. Micro-seasonality
162
Data is currently available as 15-minute time stamps; if a limited number of flights arrive
or depart during a 15 minutes time window, passengers may be allocated to the individual
flights.
232 Big data opportunities for airline network planning
Air cargo network planners lack transparency in terms of the real O&D of
shipments when the shipments are consolidated by a freight forwarder.
The current paper-based air waybills (AWBs) require the logistics facilities
only before and after the air transport, so there is little information on the
real origin and destination addresses. The digitization of air waybills is now
a major initiative of the IATA, who defined the report and information stand-
ards for electronic AWBs (IATA, 2018a). However, the e-AWB resembles
the data entry fields of the paper-based AWB, thus not solving the infor-
mation transparency issue for air cargo network planners.
to predict cargo volumes and products for land transport. However, there
exists no air cargo-specific scientific literature on electronic waybills. The
IATA started the e-AWB initiative in 2010, aiming to digitize all AWBs by
2025 (IATA, 2018a).
In order to improve the usefulness of e-AWB for air cargo NP, the standard
template defined in the IATA Resolution 672 needs to be changed. A de-
tailed composition list of the consolidated cargo must be added as a man-
datory data field. This change has to be backed by an air cargo industry
initiative, since all cargo airlines would benefit from the increased real O&D
transparency. However, it is unlikely that freight forwarders will agree to
this change, since they fear that direct sales from cargo airlines will endan-
ger their business model.
If such changes are applied and all e-AWBs contain real O&Ds, the need
for information related to real O&Ds would be fully satisfied. Real O&D data
for cargo shipments is absolutely crucial for strategic NP decisions. It would
enable a substantially more granular demand forecast based on the real
O&D areas, thereby allowing the optimization of the route portfolio and
flight frequencies. Since a large portion of air cargo is trucked over large
distances, a reconfiguration of the network structure may be useful to cre-
ate shared value for airlines and shippers.
Another solution for the lack of transparency of real O&Ds for cargo ship-
ments is the attachment of location tracking sensors to cargo shipments.
An increasing number of shippers request real-time tracking of location and
environmental parameters (e.g., temperature, air pressure, gravity peaks),
especially for expensive and highly sensible cargo (Boylan, 2016). Cargo
airlines, such as Cathay Pacific and Lufthansa Cargo, cooperate with sen-
sor producers to offer real-time cargo tracking as a service to their custom-
ers (Air Cargo News, 2018; Lufthansa Cargo, 2018b). However, the cur-
rently available tracking solutions are costly, since the devices need to be
234 Big data opportunities for airline network planning
leased for about USD 1.50 per day (Lufthansa Cargo, 2018b). Multiple pro-
viders of cargo tracking devices offer sensors directly to shippers, without
leasing them (Cargo Signal, 2018; Sendum, 2018; Sensitech, 2018).
Cargo location data can have a substantial impact on the entire NP pro-
cess. Air cargo network planners can employ this data to conduct hyper-
granular O&D analyses to retrace the end-to-end shipment streams. This
data can be used to optimize network structure, route portfolio, and flight
frequencies. If the data is available in real-time, prediction models for short-
Development and evaluation of BDOs for airline NP 235
term spot bookings can inform scheduling and fleet assignment as a reac-
tion to foreseeable demand shifts. Real-time location data can also inform
aircraft routing and short-term dispatch, since it increases visibility in terms
of whether booked shipments will arrive on time or not.
Airports with high traffic volumes assign landing time windows (i.e., slots)
to airlines to avoid congestion in the available infrastructure. Airport slots
are allocated using the grandfathering principle, implying that the slot
owner keeps the slot if he complies with a specified use quota (e.g., 80%
of days used). If an airline fails to comply with the use quota or stops a
service, the slot ownership is retracted by the airport or the regulator (de-
pending on the country). The slot are then re-allocated based on the pub-
lished allocation criteria (Madas & Zografos, 2006, p. 54).
Airline network planners are faced with two transparency issues with re-
gard to airport slots. First, many countries do not support transparency in
the slot register, i.e., the database containing all the allocated and available
slots. Second, slot changes require in almost all countries (except for the
236 Big data opportunities for airline network planning
163
UK enjoys a special exception from the current EU regulations; slot trading in the US is
restricted to a limited number of congested airports (Fukui, 2010).
Development and evaluation of BDOs for airline NP 237
rapidly than today. Flight frequency changes and (re-)scheduling will also
become more dynamic.
Traffic rights, including overflight rights and transport freedom rights164, are
usually negotiated between two countries and codified in a bilateral agree-
ment. The exceptions are multilateral agreements, such as the European
Single Sky Agreement, where one agreement includes multiple countries.
Those agreements are legal documents and usually comprise a public and
a classified part.
Both the International Civil Aviation Organization (ICAO, 2018a) and the
World Trade Organization (WTO, 2013) maintain global traffic right data-
bases, which suffer from similar shortcomings. The databases contain only
the public information regarding the agreements, which sometimes lacks
penalty and contingent information. Furthermore, the databases are up-
dated infrequently (the last WTO update was in 2013), so they may not
reflect the latest legal situation. In addition, the coverage of agreements
between countries with poor reporting standards is low. Consequently,
most airlines from the case study group maintain their own databases on
traffic rights, which are evaluated as reliable and useful in section 4.2.3.
Airlines usually contact the respective government authority to gather in-
formation on relevant existing traffic rights in case they plan to serve a new
route. Moreover, the airlines sometimes experience denied overflights due
to a non-public overflight contingent, which is not recorded in official data-
bases and is not tracked real time.
This process is both time and labor intensive and can potentially be simpli-
fied by a global traffic right database, which is linked to a real-time tracking
164
Transport freedom rights regulate how airlines can transport passengers and goods from
and to a specific country. The ICAO (2018b) defines nine freedom rights, from simple over-
flight rights (First Freedom) to the right for a foreign airlines to serve domestic routes within
the home country (Ninth Freedom).
238 Big data opportunities for airline network planning
tool, such as flight transponder data. Modern text mining algorithms can
simplify the labor requirement to scan and analyze bilateral agreements
and deliver instead just the relevant information. Such information, e.g.,
number of permitted overflights per day, can then be cross-checked
against the published schedules and real-time flight tracking data.
Scholars and practitioners are applying big data analytics to both meteor-
ological forecasts and disaster impact assessments. Larsen (2013) pro-
posed to use historical meteorological data to inform flight planning sys-
tems, which is a central IT tool of the flight operations process. Artificial
intelligence is being used to forecast turbulence development (Williams,
2014) and provide decision-making tools in case of significant weather
phenomena (Choi et al., 2016; McGovern et al., 2017). A research project
Development and evaluation of BDOs for airline NP 239
late a real-time political risk index (Shieber, 2017). Other companies de-
velop media monitoring algorithms that scan relevant sources every 30
seconds and flag potentially critical events (Muñoz, 2016). The Chinese
company Grisk combines NLP and machine learning to produce sectorial
risk indices, considering the different vulnerability levels of specific indus-
tries towards political events (Grisk, 2018).
Airline network planners can use political risk assessments to adjust their
demand forecasts and subsequently route portfolio and flight frequencies.
The applicability in tactical NP is rather limited. However, unexpected po-
litical events may be relevant to flight operations if they require short-term
re-scheduling or re-routing of flights.
All nine case study airlines participated in the qualitative potential assess-
ment. The evaluation scale ranged from 0 for no potential to 5 for very high
potential. The arithmetic mean across all the BDOs is 2.2, which is slightly
less than the median of 2.5. This indicates that the overall potential of the
BDOs is considered only low to medium. Only four BDOs were scored with
an overall potential of more than 3 (A5, A6, H1 and H2). A9 was assessed
Evaluation of BDO potential 241
by the airline network planners with the lowest potential across all BDOs,
with a score of 0.5. Figure 5.4 visualizes the overall potential scores (black
line) and the score range across all individual assessments (grey line).
No Very high
potential potential
Answer
0 1 2 3 4 5 variance
A1 Geo-spatial image analysis 0.79
A2 Real-time inflation forecasts 1.98
A3 Consumer confidence measurement with social media 1.27
A4 Detailed demographic profiling with social networks 2.84
Transportation
A5 General search engine data for detailed O&D demand 0.55
demand
A6 Click-stream data of meta-search websites for flights 2.84
forecast
A7 Social media mining to determine price elasticity 1.64
A8 Analysis of air cargo rates from online freight portals 3.43
A9 Analysis of exhibitor data of industry fairs 0.86
A10 Shipping data from e-commerce platforms 4.41
B1 Analysis of reviews in travel rating communities 2.41
Trend B2 Click-stream data from travel platforms 2.00
identification B3 Analysis of local hotel prices 2.57
B4 Analysis of events in social networks 2.55
Competitor C1 Real-time flight tracking with transponder data 0.98
monitoring C2 Network-relevant competitor news monitoring 1.13
D1 Mobile location data from smartphones 4.50
Real move-
D2 Extended electronic AWB with shipper address 4.57
ments
D3 Location-transmitting sensors for cargo shipments 4.41
Planning H1 Real-time slot trading platform 1.27
constraints H2 Real-time traffic right information platform 1.07
Incident I1 Global meteorological warning platform 1.13
warnings I2 Global political risk assessments 2.13
The variance in the assigned scores for potential across the case study
group is 2.27 on an average, indicating significant differences in BDO po-
tential for airlines. Location transmitting cargo sensors (D3) exhibited the
largest differences. Two CARs estimated the potential to be very high (5.0),
while five out of the six passenger airlines see absolutely no potential in
this BDO. The differences seem to be business model specific and are
examined further in the section 5.3.2.
165
Source: Own illustration.
242 Big data opportunities for airline network planning
Furthermore, there are also BDOs with comparably uniform potential esti-
mates. Some BDOs for economic forecasts (A1, A3) presented similar po-
tential for all business models. Moreover, the potential for a real-time traffic
right information platform and the incident warnings did not differ dramati-
cally between airlines.
The variance within the need for information clusters seems to be inversely
related with the average potential score. The highest rated cluster “plan-
ning constraints” had a comparably low answer variance of only 1.17,
which is much lower than the average answer variance of 2.27. On the
contrary, real movements had an extraordinarily high variance of 4.41, and
it was rated below average with a potential score of 2.0. However, a simple
correlation analysis performed on BDO level does not support this assump-
tion. The correlation166 between the variance and average score is -0.23,
which is a negligible value, at most indicating a mild association between
variance and potential score.
166
The Pearson correlation coefficient has been calculated using the score variance as the
independent variable and the average score as the dependent variable.
Evaluation of BDO potential 243
No Very high
potential potential
0 1 2 3 4 5
FSC SCA
LCC CAR
167
Source: Own illustration.
244 Big data opportunities for airline network planning
In most cases, low-cost carriers evaluate the potential of BDOs lower than
FSCs, which is confirmed by the below-average potential score of 2.0
across BDOs. Both planning constraint BDOs bear high potential for LCCs,
since they would benefit from real-time data due to their short planning
cycles. The identification of micro-seasonality with local hotel prices and
events is also considered to have high potential by LCC network planners.
Demographic profiling can further benefit the identification of underserved
ethnic markets, which is a considerable traffic driver for LCCs. Looking at
need for information clusters, LCCs rated BDOs above average in the plan-
ning constraint cluster and in the trend identification cluster. Whereas, the
potential of BDOs for transportation demand forecast and real movements
was scored lower than average. This aligns with the findings on the LCC
NP process, which is highly dynamic due to short lead times. LCCs want
to identify unserved revenue pockets, which is reflected in the serving of
ethnic routes and schedules reflecting micro-seasonality.
Cargo airlines show a completely different evaluation pattern than the other
business models. They see high potential in a real-time slot trading plat-
form, which is in line with LCCs and caused by the short planning cycles
of the cargo NP process. However, besides the real-time slot trading plat-
form, cargo airlines highly rated those BDOs that have an explicit role in
cargo demand forecasting and tracking. Overall, CARs rated the 23 BDOs
with an average potential of 2.3, which is similar to the FSCs and higher
than the LCCs and SCAs. They rated all the need for information clusters
except for trend identification higher than the average of the case study
group. Table 5.2 shows the BDOs ranked by business model.
Table 5.2 – BDOs ranked by business model168
168
Source: Own illustration.
246 Big data opportunities for airline network planning
Real-time slot trading platforms were the highest ranked BDO for both
LCCs and CARs, and the fourth highest for FSCs. It was also the only
shared top 5 BDO of CARs with any other business model. Within the pas-
senger airlines, demographic profiling appears in the top 5 of all three busi-
ness models. Four BDOs appeared twice in the business model-specific
top 5, namely clickstream data of meta-search websites, mobile location
data from smartphones, general search engine data (all FSCs and SCAs),
and analysis of local hotel prices (LCCs and SCAs). In total, 12 different
BDOs appeared at least once in the top 5 of one business model, which
are highlighted in grey in Table 5.2.
Evaluation of BDO potential 247
169
The assessments of passenger airlines for cargo-specific BDOs and the assessments of
cargo airlines for passenger-specific BDOs have been excluded.
248 Big data opportunities for airline network planning
170
Source: Own illustration.
Evaluation of BDO potential 249
the BDO (which is discussed in the next section) but the possibility of the
current NP software solution to integrate the analytics result. The scores
were given on a 6-point scale from 0 (not possible to implement) to 5 (very
easy to implement). Since the current system is focused on passenger air-
line NP, all cargo-specific BDOs were evaluated with the lowest score of
0.
The answers displayed in Figure 5.7 show that the analytical results from
BDOs are in most cases either very easy or very hard to implement, with
a few exceptions that show a medium degree of ease of implementation.
All BDOs that influence assumption values in the NP software solutions
171
Source: Own illustration.
250 Big data opportunities for airline network planning
are easy to integrate. This group includes economic growth and inflation
forecasts as well as assumptions concerning O&D demand and price elas-
ticity. A second group of BDOs that are comparably simple to integrate are
those providing structured demand data, which can be calibrated to inform
O&D growth forecasts. This includes search data (both Google and flight
websites), clickstream data from travel platforms, and event data from so-
cial media.
The third group of easy-to-implement BDOs includes slots and traffic right
information. The current IT systems already feature a slot and traffic right
database, currently updated manually by airline network planners. The
data pull can be automated if the data is available in a suitable format, i.e.,
a table of current slots and codified traffic rights.
BDOs that provide data on information categories which are not included
in the current IT systems are difficult to implement, because the entire sys-
tem would need to be recalibrated for the new data. Mobile location data,
for example, includes information on non-airline movements, which would
add a completely new dimension to the current IT systems. Similarly, real-
time flight tracking and micro-seasonality forecasting are not included in
present IT systems.
the financial feasibility evaluation (Milis & Mercken, 2004) instead of tech-
nological and legal feasibility aspects. Other literature follows a strategic
perspective and classifies the feasibility assessment as an important pro-
cess step in IT project planning (Willcocks, 2009, pp. 239–259). However,
it does so without further specifying the feasibility criteria apart from finan-
cial feasibility. Similarly, Ehie and Madsen (2005) found that feasibility is
the third most important step in an implementation process for enterprise
resource planning through an empirical analysis.
Scoring
2 1 0 -∞
A How technologically feasible is the data Feasibility proven Feasible, but not Potentially Not feasible
collection? yet proven feasible
B What are legal or political restrictions to the No restrictions Minor restrictions Severe Use of data not
use of data? restrictions permitted
C How is data ownership organized? Internal data Data Data potentially Data not available
commercially available
available
D How aggregated (holistic) is the available Single provider of Various providers Various providers Data not available
data? aggregated data of partly of un-aggregated
exists aggregated data data
E How accessible are analytics tools to Analytic tools Analytic tools Analytic tools Analysis not
process the data? integrated with commercially need to be possible
data available developed
Figure 5.8 – Feasibility assessment model for big data acquisition and analytics172
172
Source: Own illustration.
Evaluation of BDO potential 253
all data should be available at the same source, which is the case for tran-
sponder data or Google Trends. BDOs with multiple data sources, such as
mobile location data or clickstream data from meta-search engines, require
data aggregation, thus reducing the feasibility of the BDO. Finally, there
should be analytic tools to process the data. In an ideal case, the analytic
tool is integrated in the data set, which is true for the hotel rate intelligence
software. In the worst-case scenario, analytic tools cannot handle the data
meaningfully, e.g., when data quality is poor.
For 16 out of the 23 BDOs, it could be proven that data collection is tech-
nologically feasible. This proof stems from either a commercialized use or
scientific research using the data source. Three BDOs gave no proof of
technological feasibility. Extraction of exhibitor manufacturing sites (A9)
may be feasible, but there is no real-life example for such a scraped data
set. Similarly, there is no example for a real-time information platform for
traffic rights that connects transponder data with text mining from bilateral
agreements. The extended e-AWB suffers resistance from forwarders who
have blocked a technological proof of concept. However, all three BDOs
appear potentially feasible and are rated with a score of 0.
254 Big data opportunities for airline network planning
available data?
Total
Code Big data opportunity score
A1 Geo-spatial image analysis 2 2 1 1 1 7
A2 Real-time inflation forecasts 2 2 0 2 1 7
A3 Consumer confidence measure- 2 2 0 0 0 4
ment with social media
A4 Detailed demographic profiling 2 1 1 0 0 4
with social media mining
A5 General search engine data for 2 2 1 2 0 7
detailed O&D demand
A6 Clickstream data of meta-search 2 2 1 0 0 5
websites for flights
A7 Social media mining to determine 1 1 0 0 0 2
price elasticity
A8 Analysis of air cargo rates from 2 1 1 2 2 8
online freight portals
A9 Analysis of exhibitor data of in- 0 2 0 0 0 2
dustry fairs
A10 Shipping data from e-commerce 2 2 0 1 0 5
platforms
B1 Analysis of reviews in travel rating 2 2 0 1 0 5
communities
B2 Clickstream data from travel plat- 2 2 0 2 0 6
forms
B3 Analysis of local hotel prices 2 2 1 2 2 9
173
Source: Own illustration.
Evaluation of BDO potential 255
available data?
Total
Code Big data opportunity score
B4 Analysis of events in social net- 1 1 0 1 0 3
works
C1 Real-time flight tracking with tran- 2 2 1 2 0 7
sponder data
C2 Network-relevant competitor news 2 2 1 0 1 6
monitoring
D1 Mobile location data from 2 0 1 1 1 5
smartphones
D2 Extended electronic AWB with 0 1 2 0 0 3
shipper address
D3 Location-transmitting sensors for 2 1 0 0 0 3
cargo shipments
H1 Real-time slot trading platform 2 0 0 0 0 2
H2 Real-time traffic right information 0 1 0 0 0 1
platform
I1 Global meteorological warning 1 2 0 0 0 3
platform
I2 Global political risk assessments 1 2 1 0 1 5
Average score 1.57 1.52 0.52 0.74 0.39 4.74
Two BDOs suffer severe legal restrictions, and seven BDOs have minor
legal or political restrictions. A real-time slot trading platform relies on the
cooperation of national regulatory authorities, and so far, only free-market
economies have provided the necessary access to slot data. Transparency
in slots may not be desirable for countries that wish to protect their national
aviation industries or use slots as a means of political pressure. Mobile
location data suffers from severe legal restrictions, since all the data needs
256 Big data opportunities for airline network planning
Only the extended e-AWB can be counted as internal data, since it needs
to be transmitted for every shipment. If the real O&D were to be added as
a mandatory field, the BDA can use the internal AWB database as a pri-
mary data source. Further, 12 BDOs are not yet commercialized, which
requires bilateral negotiations with the current data owners to access these
data. Six BDOs have a single source of data, which reduces the effort of
data matching and processing. Another 12 BDOs work with dispersed and
unstructured data, which, in turn, makes complicated data matching and
processing even more complicated. Intelligence software for cargo rates
and hotel prices include analytic tools to draw useful information from given
data. Five other BDOs offer analytics as an add-on service that needs to
be procured. Integrated platform analytic tools, such as Google Trends or
Facebook Audience Insight, were not rated in this category, since they are
not customizable. The output data from Google Trends or Audience Insight
must be processed and analyzed subsequently for the specific BDO pur-
pose of airline NP.
If we contrast the potential and feasibility of all 23 BDOs, four clusters can
be derived (see Figure 5.9). Top priorities are the BDOs with high potential
(3.0 or above) and high feasibility (score of 5 or above). Six BDOs are
located in this cluster, including the two most feasible BDOs. Solutions for
cargo rate and hotel price analytics are readily available and need to be
only subscribed. Search engine and clickstream data is also already avail-
able, although not all meta-search websites have commercialized their
data yet. Mobile location data is being commercialized, and first pilot appli-
cations for public transportation do exist. Finally, shipping data from e-com-
merce platforms exist, but airlines need to negotiate with platform opera-
tors, such as Alibaba or JD.com, to access the data.
Evaluation of BDO potential 257
Long-term value creators are BDOs with high potential that currently have
limited implementation feasibility. Raw data on demographic profiling can
be accessed from Facebook, but there is no automated data transmission
and the queries need to be set manually. Furthermore, airlines can ap-
proach LinkedIn to drive the commercialization of business network data.
Event extractions from Facebook are complicated due to the discontinued
API, but third-party developers with full API access can potentially provide
a customized solution for airline network planners. Airlines should actively
lobby the IATA to include a mandatory real O&D field in the e-AWB. Simi-
larly, airlines should approach mobile network providers to commercialize
174
Source: Own illustration.
258 Big data opportunities for airline network planning
cargo location data in the same way as mobile location data, once Lo-
RaWANs are rolled out at a large scale. Finally, airlines can lobby for a
globally harmonized deregulation of slot trading rules to facilitate the crea-
tion of real-time slot trading platforms. Similarly, real-time traffic right plat-
forms will need access to bilateral agreements on a global scale.
BDOs with high feasibility but limited potential can serve as data add-ons,
which can be implemented quickly if a specific information need arises.
Geo-spatial image analysis and real-time inflation forecasts can enhance
economic growth forecasts, and travel trends can be identified by analyz-
ing clickstream data and reviews from travel platforms. Competitor moni-
toring with transponder data or media monitoring may be valuable in case
of increased volatility in the airlines industry. The services of global risk
assessment platforms may become more important if political instability
increases.
Four BDOs are potential value traps, since they are difficult to implement
and have a low potential to satisfy information needs in airline NP. Con-
sumer confidence measurement with social media is possible, but it is only
a very minor input assumption in the economic growth models that are
used by airlines. Furthermore, the data does not dramatically improve the
traditional survey-based consumer confidence indices. Moreover, social
media mining to determine price elasticity is a major technical challenge,
since many input data need to be connected to specific customer profiles.
Data privacy legislation further complicates this analysis. In reality, this
BDO may be more valuable for the P&RM department of an airline, as it
allows for individual pricing (if legally permitted). While data on industry
fairs and exhibitors is easily available, it is problematic to identify the cor-
responding manufacturing sites and predict trade flows. Airline will need to
match the data with their own bookings and invest heavily in data mining
to extract relevant trade flows. Finally, meteorological warning platforms
are the most relevant for flight operations but not for airline NP. Airlines
Evaluation of BDO potential 259
should restrain from investing in value traps, unless their primary purpose
is to serve a different function (e.g., flight operations).
The next chapter develops a cost and benefit evaluation scheme for the
top priorities and long-term value creator clusters. The BDOs in the data
add-on and value trap clusters are not included in the cost–benefit analy-
sis, as those data sources are most likely not the primary investment can-
didates for airline NP. This gives us 12 BDOs for further analysis, including
four cargo-specific BDOs (A8, A10, D2, D3), four passenger-specific BDOs
(A4, B3, B4, D1), and four general BDOs (A5, A6, H1, H2).
6 Financial impact of big data for airline network planning
Figure 6.1 – Logical process to derive cost and benefit estimate for BDOs175
175
Source: Own illustration.
The case study group primarily uses indirect KPIs as well as aggregated
route and network profitability to measure NP performance. The direct KPI
load factor, direct operating cost (DOC), and indirect operating cost (IOC)
are also used as secondary KPIs by some airlines. IOC and DOC are com-
bined in a direct cost KPI, since all airlines use them in this form only. Sim-
ilarly, network and route profitability are joined for the analysis. Figure 6.2
recapitulates the currently used KPIs from chapter 4.
Choice of appropriate BDOs based on KPIs 263
Figure 6.2 – Direct and indirect KPIs currently used by the case study group airlines176
In the last interview round, the participating airline network planners were
asked to assess the impact of the 12 BDOs selected in chapter 5. The
rating scale ranged from 0 (no impact) to 5 (very high impact), and each
primary or secondary KPI that the respective airline used was assessed.
The average scores across the case study group were calculated using an
equally weighted arithmetic mean of the answers177. The average scores
were used to produce spider graphs for each KPI, which visually indicate
the estimated impact by BDO (Figure 6.3 to Figure 6.9). Cargo-specific
BDOs were evaluated only by cargo airlines, while passenger-specific
BDOs were evaluated only by the FSCs, LCCs, and SCAs. General BDOs
were evaluated by all the applicable airlines.
176
Source: Own illustration.
177
Only answers from the airlines which use the respective KPI are included.
264 Financial impact of big data for airline network planning
Seven of the case study group airlines use route or network profitability as
primary or secondary KPI179. All of the business models are represented.
The four cargo-specific BDOs achieved the highest scores of 4.0, which
indicates that cargo airlines expect a very high impact of all BDOs on prof-
itability. Passenger airlines expect the highest profitability impact from ho-
tel price analysis (4.0) and mobile location data (3.7). General search en-
gine data received a high impact score on profitability across all airlines
(3.5). The two “planning constraint” BDOs were rated the lowest, with low
178
Source: Own illustration.
179
LCC 2 and CAR 3 do not use route or network profitability.
Choice of appropriate BDOs based on KPIs 265
and medium impact scores of 1.5 and 2.3 respectively. Besides the low
ranking of the “planning constraint” BDOs, there was no clear pattern
across the need for information clusters.
Load factor
Only CAR 1 uses load factor as a secondary KPI for the NP departments.
Thus, Figure 6.4 represents the answers of only this airline. Shipping data
from e-commerce platforms is the only BDO with an expected very high
impact on the load factor. E-commerce usually entails small merchandise
in boxes, which can be assimilated on pallets. Thus, a detailed forecast of
e-commerce shipments can improve load planning, thereby improving the
180
Source: Own illustration.
266 Financial impact of big data for airline network planning
load factor. Similarly, cargo airlines can use real-time location data for
cargo (impact score of 3.0) to predict late arrivals and fill the load space
with alternative cargo, if possible. None of the other BDOs are expected to
have significant impact on the load factor.
Forecast accuracy
The two FSCs and SCA 2 use forecast accuracy as a secondary KPI for
NP. These airlines have long planning cycles induced by their business
model and thus rely on the precision of mid- and long-term forecasts. Ac-
cordingly, general search engine data (4.0) and meta-search website data
(3.7) are expected to have the greatest impact on forecast accuracy. Local
hotel prices (3.3), events from social networks (3.0), and mobile location
data (3.0) may still have a high impact on forecast accuracy. The “planning
constraint” BDOs were ranked the lowest, whereas cargo-specific BDOs
were not ranked at all.
The pattern is not surprising since airlines focus on BDOs that contribute
primarily towards demand forecast and trend identification, thus strength-
ening the predictive power of forecasting.
Choice of appropriate BDOs based on KPIs 267
Schedule robustness
181
Source: Own illustration.
268 Financial impact of big data for airline network planning
for LCC 2. All other BDOs bear no to medium impact for schedule robust-
ness.
182
Source: Own illustration.
Choice of appropriate BDOs based on KPIs 269
Asset utilization
183
LCC 2 and SCA 2 use it as a primary KPI; FSC 2, LCC 1, SCA 1, and CAR 2 use it as a
secondary KPI.
184
Source: Own illustration.
270 Financial impact of big data for airline network planning
Passenger airlines rated local hotel price analysis the highest (3.5), as that
may enable the discovery of untapped revenue pockets and adjustment for
micro-seasonality. Real-time BDOs are not deemed relevant for asset uti-
lization across passenger and cargo airlines. The airline network planners
attested that several BDOs have a medium impact on asset utilization (A4,
A5, A6, B4, and H1), but there is no clear recognizable pattern. However,
cargo airlines expect more impact potential on asset utilization than pas-
senger airlines.
FTE efficiency
Five BDOs (A4, A5, A6, D1, and H2) received a medium impact score by
the airline network planners. Those BDOs can contribute to reducing time
for information gathering and processing, thereby making NP employees
more efficient in their daily routine.
Choice of appropriate BDOs based on KPIs 271
SCA 1, LCC 1, and CAR 3 use DOC and IOC as cost KPI for NP. In con-
trast to other KPIs, the “planning constraint” BDOs received high impact
scores. A real-time slot trading platform has very high potential for cost
reductions (4.0), and a real-time traffic right information platform has a high
potential (3.5). Real-time slot management and traffic right information can
help reduce delay-induced cost and fines for exceeding traffic rights or
missing assigned slots. Mobile location data achieved a medium impact
score of 2.0, whereas all the other BDOs were considered to have low or
no impact.
185
Source: Own illustration.
272 Financial impact of big data for airline network planning
186
Source: Own illustration.
Choice of appropriate BDOs based on KPIs 273
Schedule robustness
Forecast accuracy
Aircraft utilization
Route & network
FTE efficiency
Load factor
profitability
Direct cost
BDOs
Detailed demographic profiling with social
3 0 2 3 2 2 0
media mining
General search engine data for detailed
3 2 4 0 2 2 0
O&D demand
Clickstream data of meta-search websites
3 2 3 0 2 2 0
for flights
Scraping of published rates of cargo ship-
4 2 0 0 4 1 0
ping portals
Shipping data from e-commerce platforms 4 4 0 0 4 1 0
Legend: 4 very high - 3 high - 2 medium - 1 low - 0 very low/none (rating derived from
Figures 6.3-6.9)
187
Source: Own illustration.
274 Financial impact of big data for airline network planning
It is worth noting that the results of the impact assessment are not statisti-
cally significant, since the number of individual assessments is low due to
the limited size of the case study group. Including more airlines in a broader
survey will increase the external validity of the analysis and bridge the as-
sessment gaps, such as that of the load factor, which were assessed by
one cargo airline only. However, the result patterns confirmed the earlier
hypothesis that there is no one-size-fits-all solution for airlines when it
comes to choosing the most suitable BDO for airline NP.
Airlines must select the right KPI metrics to calculate the expected benefits
from BDO deployment. For most KPIs, there are multiple potential
measures, including static and dynamic metrics as well as metrics focusing
only on a KPI aspect that is relevant for NP. Dynamic metrics measure the
change in a specific KPI over time and are preferable to static measures,
which measure a KPI at a specific point of time188. Furthermore, the metrics
should fall in the domain of airline NP. External factors, such as fuel prices
or economic downturns, should be excluded from the NP-specific benefit
analysis to represent the actual impact of BDOs on NP.
Figure 6.10 collates a range of possible metrics for each of the seven KPIs
used by the case study group. Route profit, for example, can be calculated
using static metrics, such as the total or average route profitability for a
given period. This global static metric is influenced by several internal and
external factors (e.g., economic cycle, fuel prices) with limited influence by
the NP department. Alternatively, only the routes with long-term negative
performance can be used as an indicator for unsuccessful route decisions.
This metric is static but focuses on a remit that is more controllable by a
188
BDO impact should be assessed over an investment time period (e.g., 5 years) to inform
a business case calculation.
Choice of appropriate BDOs based on KPIs 275
The metric “profit delta from flight portfolio changes” compares the financial
performance of new flights with the financial performance of terminated
services. Ideally, the new flights should show a significantly higher profita-
bility than the ended flights, thus proving the value added by NP. Choosing
189
Otherwise, highly profitable routes can be replaced with other profitable routes, which will
not affect the total route profit for the airline.
190
Source: Own illustration.
276 Financial impact of big data for airline network planning
“profit delta from flight portfolio changes” or only the “new flight perfor-
mance” depends on the capacity development of the airline. An airline with
little capacity growth needs to replace its existing services if it wants to
pursue new route opportunities, while an airline with strong capacity growth
can simply add new flights without terminating the existing ones. For air-
lines with high capacity growth, the “new flight performance” metric is more
suitable. For all other airlines, the “profit delta from flight portfolio changes”
metric is preferable to measure the BDO impact on route profitability.
Unfortunately, this logic does not consider network effects, comprising up-
line and downline revenues. If an airline uses network profitability as a KPI
instead of route profitability, the extraction of the NP-specific impact is chal-
lenging191. One option to solve the problem is using the total network profit
as a global and static metric, acknowledging that the influence of NP is
limited. Alternatively, the “route portfolio delta” metric can be calculated as
well, omitting the network effects. The total network profit is chosen as a
metric for network profitability, as it is the only metric that specifically ad-
dresses network profitability.
191
Upline and downline revenues tend to display significant variance on many routes, so the
dynamic comparison is diffused (Sterzenbach et al., 2013, p. 393).
Choice of appropriate BDOs based on KPIs 277
Asset utilization is typically measured with the average block hours that
aircrafts fly per day. The objective is to minimize the unproductive time on
the ground and distribute the aircraft financing cost along more flight hours,
thereby reducing the indirect unit cost. The aircraft utilization can be differ-
entiated by aircraft type, as long-haul and short-haul aircrafts have signifi-
cantly different operation patterns. However, asset utilization cannot be in-
creased indefinitely, because maintenance schedules and turnaround
times are required in every rotation plan. For this reason, some airlines use
an alternative metric, called “idle hours.” “Idle hours” measure only the un-
productive time during which an aircraft is parked idle on the ground. NP
departments are responsible for the minimization of the unproductive
ground time with schedule optimization and find suitable flights instead.
Therefore, “idle time” is fully in the remit of NP, but it is still a static measure.
The metric can be dynamized by measuring only the change in average
idle time per aircraft over two different periods. These times are aggregated
and converted to a reasonable number of flights that can be operated in
this idle time to create a monetary benefit value. This metric is both dy-
namic and NP controlled. Hence, it is preferred to assess BDO value con-
tribution.