Samenvatting

Summary Supply Chain Data Analytics

Name: Summary Supply Chain Data Analytics
SKU: doc_1009042
Rating: 2.00 (1 reviews)
Author: goelin_zhang

Beoordeling

2,0

(1)

Verkocht

Pagina's

Geüpload op

06-03-2021

Geschreven in

2019/2020

Summary of lectures and literature. Supply Chain Data Analytics is focused on furnishing you with methodological and technical skills to conduct data driven research in academic and industry projects in the TSCM domain. The main topics of the course are time series forecasting, regression, classification and clustering. The software tool R will be used to process and analyse the data sets.

Meer zien Lees minder

Instelling

Vak

Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Meld schending auteursrecht

Gekoppeld boek

Kenneth C Lichtendahl Jr, Kenneth C Lichtendahl, Jr Practical Time Series Forecasting with R

Uitgave:Onbekend
ISBN:9780997847925
Druk:2

Geschreven voor

Instelling: Vrije Universiteit Amsterdam (VU)
Studie: Master Transport And Supply Chain Management
Vak: Supply Chain Data Analytics

Alle documenten voor dit vak (2)

Documentinformatie

Heel boek samengevat?: Ja
Geüpload op: 6 maart 2021
Aantal pagina's: 20
Geschreven in: 2019/2020
Type: Samenvatting

Onderwerpen

Voorbeeld van de inhoud

HC 3 Forecasting
t=1,2,3,… Index denoting time period of interest (t=1 is first period)

y1,y2,y3…,yn yt denotes value at time period t (vb. gem temperatuur; t=1 staat voor dag 1 y1 is gemiddelde temperatuur op dag 1)

Let op: Gender should be considered a factor in R instead of character, because gender is a categorical value.

Forecasting process
1. Forecast goal Definition; determine forecast goal.
- Descriptive 'Time series analysis’; describe patterns in historical data (trends, seasonal)
Predictive ‘Time series forecasting’; use to predict future values (can’t use future information).
- Forecast horizon & forecast updating; forecast use determines horizon (bv. 1 month
forecast for revenue management). Level of automation depends on task & use in practice
→ Roll forward forecast= refreshing of forecast based on new data
2. Get data: Data Collection & cleaning → data sources
3. Explore & Visualise Data; view data in R & check values (plot)

SYNTAX:
ts is a function to create time series objects start/end= begin date, frequentie
→ name.ts <- ts(filename$category, start = c(year, month), end = c(year, month), freq = #months)
plot(x, y, main="title", xlab=“X-ax naam", ylab=“y-ax naam”, xlim=c(xmin, xmax), ylim=c(ymin, ymax))

Time Series Components
Systematic; - Level; describes average value of the series in given data
- Trend; looking at past sales to determine possible trends to predict future (growth/decline of graph)
- Seasonality; predictable cyclic variation depending on the time within the year (seasonality)
Non systematic; - Noise; random variation that results from measurement error or other causes that are not accounted
Time series with additive components; yt= Level + Trend + Seasonality + Noise → Additive; values vary by constant amount
Time series with multiplicative component; yt= Level * Trend * Seasonality * Noise → Multiplicative; values vary by percentage

STL function captures 4 components

The time series components are level, noise,
trend, and seasonality. Level and noise are
always present. Trend is likely to be present.
Seasonality is likely to be present if the data
follows the same patterns (seizoenen)
4. Pre-Processing; only need a small data amount
- Obsolete or redundant fields
- Missing values
- Outliers; extreme values
- Values not consistent with policy
- Time span (irrelevant period or data)
- Unequally-spaced series; fout tijdinterval waarden
Missing values can affect ability to generate forecasts AND AFFECTS PERFORMANCE MEASURES

How to handle missing data?
- Ignore, forecasting method does not use the variable
- Omit missing value
- Replace missing value with a constant, mean, randomly generated value etc.
- Replace missing value with value based on other characteristics of the data
Remove data that don’t help with forecast (mistake / duplicates)

Imputing value; Use prediction / forecasting methods to impute the values
- Naive forecast (easiest)= predict value for next week depending on value of similar time span (this week). Ft+1=Yt
Seasonal naive forecast; Ft+k=Yt-M+k
→ Advantage; simplicity and ease of deployment
→ Disadvantage; it does not take trends & external information into account
→ Improve; Combine data-driven & model-based methods (average their Ft)
5. Partition Series; Workflow to create data partitions;
- Training set → contains data used to build various models
- Validation set → used to compare performance of each model; benchmark
- Test set → used to assess performance of chosen model with new data
Choose training/validation that mimics forecast horizon, this depends on:
- Forecast horizon
- Seasonality
- Length of series; not too long as longer period gives less recent info
- Underlying conditions affecting series

,Difference between data partitioning in cross sectional and time series data;Cross sectional data partitioning is usually done
randomly. In time series, a random partitioning does not mimic temporal uncertainty where we can use past to forecast future.

6. Apply Forecasting Method
What forecast methods are we going to use? Data-driven, model-based or judgmental? Combining or Ensembles?
Data driven methods; “learn” patterns from data (adjusts over Model based methods; use mathematical model to forecast.
time). Possible with naive forecast & smoothing methods. Works on small data sets and can include external
Advantageous when when structure of time series changes over information.Training data used to estimate parameters.
time and requires less user input and are easily automated. Estimated parameters are then used to generate forecasts.
Large time series is necessary for adequate learning. Possible with; regression, neural networks, machine learning
→ better for local (changing) patterns → better for series with global pattern
Wanneer data onvoorspelbaar is (fluctuatie), is data driven beter, Met noise en weinig signal, is model based beter omdat data
omdat het zich aanpast door de tijd heen. niet fluctueert en een mathematical functie kan gebruiken.
Judgemental Forecast; used when there is lack of good data. - Combining multiple forecast can improve predictiveness
Also used to adjust statistical forecast and to compare/combine - Two-level forecast: method 1 gives forecast , method 2 uses
data with. forecast errors (from 1) to generate future forecast errors.
→ methods; delphi method, scenario forecasting Ensembles; apply multiple forecasting methods to same series.
Average different forecast for higher precision.

7. Evaluate & Compare Performance
Model which fits data well does not necessarily forecast well
- Underfitting; Performs poorly on training data. Model is too simple.
→ Increase complexity by adding trend/seasonality
- Overfitting; Performs well on training data, but poor on validation
set (cannot generalize data).
→ Decrease model complexity by removing trend/seasonality
If forecasting model gives satisfactory performance on validation set; The
training and validation periods must be re-combined, and then the forecasting
model should be applied to the complete series before producing future forecasts.

8. Implement Forecast/System
should be applied to the complete series before producing future forecasts.

Predictive Accuracy measures: FORECAST ERROR=RESIDUALS
Positive forecast error —> systematically under forecasting
Forecast error = Actual value - Forecast value; Gives forecast difference at time t
Mean absolute error/deviation → Gives magnitude of average absolute forecasting error.
Disadvantage; forecast error is scale dependent
Average Error → Gives an indication whether forecast are averagely over/under forecasting
Disadvantage: forecast error is scale dependent
Mean absolute percentage error → Gives a percentage of forecasts deviation from the
actual values (if value is 0, MAPE cannot be calculated).
Advantage; scale independent.
Disadvantage; cannot use 0 value & puts heavier penalty on negatives
Mean squared error → Gives averaged squared error as forecasting error.
Advantage: sensitive to large errors, accepts 0.
Disadvantage; Hard to interpret, not scale independent
Root mean squared error → has same units as data series (model with lower RMSE for
training period is more useful for describing components of time series, model with lower
RMSE for validation period is better for forecasting)
Performance Charts and metrics are used to: 1. Evaluate predictive accuracy on validation period
2. Assess overfitting by comparing training & validation performance.
Example
MAE=1/3 * (|-1|+1+|-2|)=
Average error=1/3* (-1+1+-2)=
MAPE=1/3 * (|-1/1|+1/11+|-2/15|)= * 100=
RMSE= WORTEL (1/3 * ((-1)^2+(1^2)+(-2)^2))=
\

, HC 4
Smoothing method; based on averaging values over multiple periods to reduce noise
Estimating level; Naive forecasts, mean forecast, moving average (MA), and simple exponential smoothing (SES) are used to
forecast series with no observable trend or seasonality. Level estimated by previous values or averages.

1. Moving average

Centered moving average: based on a window W centered around time Yt
- Odd width: center window on time t, average the values in the window (5)
- Even width: take two most centered windows and average the values in (4)
Trailing moving average: based on a window from time Yt and backwards →

Window Width; Wider windows will expose more global trends, while narrow windows will reveal local trends.
Forecasting: use domain knowledge to determine the best window size.
Longest window → Over smoothing; you take average of entire sample (W=Sample size)
Smallest window → Under smoothing; apply naive forecast (W=1)
R—> You should de-trend as wel as
de-seasonalize the series before using
an MA to generate forecasts.

♦ Forecast jaar 20 = jaar 21+t als er geen waarden beschikbaar zijn voor jaar 20
♦ MA assumes no trend, so if trend continues to increase the predicted MA value will most likely under forecast

Link between MA & SES; In both methods the user must specify a single parameter. In moving average → window width (w).
In exponential smoothing → smoothing constant(α). Parameter determines importance of new information over older ones.
Relation between a window width (w) and smoothing coefficient (α) is approximately w= 2 / α - 1
α=2/(w+1) → low w gives high α

2. Simple exponential smoothing; use a weighted moving average (α) so that weights decrease exponentially in the past
Compared with MA, SES gives more weight to recent observations

→ yi= sum van de gegeven waarden
→ toekomstige voorspelde waarden blijven gelijk aan Lt +k wanneer
werkelijke(nieuwere) waarde niet wordt gegeven (yt+k)

Smoothing Constant Alpha → α determines how much weight is given to past
- α=1; past observations have no influence over forecast → Under smoothing
- α → 0; past observations have large influence on forecast = Over-smoothing
3. Adaptive Learning Process &
→ Toekomstige voorspelde waarden blijven gelijk aan Ft +k wanneer werkelijke waarde niet wordt gegeven (yt+k)

Differencing is removing a trend and/or seasonality from a time series by
taking the difference between two observations.
- Lag-1 difference: yt – yt-1 removes trends
- Lag-M difference: yt – yt-M removes seasonality, depending on timespan
(You compare winter 2017 with winter 2016 with lag -4, 4 seasons)
- Double-differencing: difference differenced serie to remove quadratic trend

♦ NOTE! Simple exponential smoothing or Holts Method is better if serie has a lot of noise and no seasonality!!!

€5,49

Krijg toegang tot het volledige document:

100% tevredenheidsgarantie

Direct beschikbaar na je betaling

Lees online óf als PDF

Geen vaste maandelijkse kosten

Maak kennis met de verkoper

goelin_zhang

3,8

(6)

Ook beschikbaar in voordeelbundel

Beoordelingen van geverifieerde kopers

Alle reviews worden weergegeven

mrhbout Transport And Supply Chain Management · 12 beoordelingen

3 jaar geleden

This summary is missing lots of information, and not thorough. Good for checking your knowledge, but not totally worth it

2,0

1 beoordelingen

Betrouwbare reviews op Stuvia

Alle beoordelingen zijn geschreven door echte Stuvia-gebruikers na geverifieerde aankopen.

Maak kennis met de verkoper

goelin_zhang Vrije Universiteit Amsterdam

Bekijk profiel

Volgen

Verkocht

Lid sinds

5 jaar

Aantal volgers

Documenten

Laatst verkocht

1 maand geleden

3,8

6 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper goelin_zhang. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €5,49. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 58198 samenvattingen verkocht Opgericht in 2010, al 16 jaar dé plek om samenvattingen te kopen

Summary Supply Chain Data Analytics

Gekoppeld boek

Geschreven voor

Documentinformatie

Onderwerpen

Voorbeeld van de inhoud

Meer vakken binnen Vrije Universiteit Amsterdam (VU) > Master Transport And Supply Chain Management

Ook beschikbaar in voordeelbundel

Beoordelingen van geverifieerde kopers

Maak kennis met de verkoper

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Niet tevreden? Kies een ander document

Betaal zoals je wilt, start meteen met leren

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?