程序代做CS代考 data mining Excel CORPFIN 2503 - Business Data Analytics

CORPFIN 2503 – Business Data Analytics

2021 S2, Workshop 2: Visual analytics and data mining

£ius

1 Time series stock market data

Let’s download time series stock market data for all stocks traded on ASX using

Eikon:

1. Search for Screener

2. Filter for stocks traded in Australia (country of exchange)

3. Click button `Launch Data Item Library’ (on the right of `Add Column’ bar)

4. Search for `price close’

5. Choose the following parameters: select `Series’, calendar month (CM), last

26 observations, ordered from oldest to newest

6. Click on Excel icon to save the data as Excel �le.

2 Lags and leads

Unfortunately, creating lagged and lead values of variables is not an easy task. Let’s

create lagged and lead values using the example below.

/* Creating data file: */

DATA work.citi;

SET SAShelp.Citimon;

RUN;

/* Creating another file with smaller number of observations: */

DATA work.citi_sh;

SET work.citi (keep = date FSPCOM LHUR EEC EXVUS);

IF FSPCOM ne .;

IF LHUR ne .;

IF EEC ne .;

IF EXVUS ne .;

RUN;

/**** Let’s create a lagged value for FSPCOM: 2-step procedure: ****/

/* 1) Sorting data by date: */

proc sort data=work.citi_sh;

by date;

run;

/* 2) Creating a lagged value using a command ‘LAG(…)’: */

data work.citi_sh;

set work.citi_sh;

lag_FSPCOM=LAG(FSPCOM);

run;

/**** Let’s create a lead value for FSPCOM: 2-step procedure: ****/

/* 1) Sorting data by date but in descending order: */

proc sort data=work.citi_sh;

by descending date;

run;

/* 2) Creating a lead value using a command ‘LAG(…)’: */

data work.citi_sh;

set work.citi_sh;

lead_FSPCOM=LAG(FSPCOM);

run;

/* Let’s sort the data in the original order (i.e., by date): */

proc sort data=work.citi_sh;

by date;

run;

3 Procedure CORR

SAS procedure CORR generates correlation coe�cients. The procedure allows to

display correlation matrix in the HTML window as well as to save them as a separate

�le. Let’s consider example below.

/* Let’s find the correlation matrix: */

PROC CORR DATA=work.citi_sh;

RUN;

/* Let’s create a new file with the correlation coeficients of */

/* lead_FSPCOM with the rest of variables (the command ‘OUTP=’ */

/* saves Pearson correlation coefficients as a new file): */

PROC CORR DATA=work.citi_sh

OUTP=work.corr_citi_sh;

WITH lead_FSPCOM;

RUN;

/* Removing redundant variables */

data work.corr_citi_sh;

set work.corr_citi_sh;

if _NAME_=”lead_FSPCOM”;

drop _NAME_;

run;

/* Transposing the data in order we could sort the correlation */

/* coefficients if needed */

proc transpose data=work.corr_citi_sh

out=work.corr_citi_sh2;

run;

/* Renaming the variable */

data work.corr_citi_sh2;

set work.corr_citi_sh2;

rename col1=corr;

run;

4 Various plots (in the lab and at home)

For this task, you should familiarize yourself with key plots available on SAS. Below,

I provide the necessary code.

/* Generating a new variable (return on index) */

data work.citi_sh;

set work.citi_sh;

return=FSPCOM/lag_FSPCOM-1;

run;

/* Option PLOTS generates stem-and-leaf, box and qq plots of the variable(s) */

proc univariate data=work.citi_sh PLOTS;

var return;

RUN;

/* Option HISTOGRAM generates histogram */

proc univariate data=work.citi_sh;

var return;

HISTOGRAM / NORMAL (COLOR=RED);

RUN;

/* Creating data file: */

DATA work.car_data;

SET SAShelp.Cars;

RUN;

/* Vertical bar chart */