程序代写代做代考 graph The University of Sydney School of Mathematics and Statistics

The University of Sydney School of Mathematics and Statistics
Computer Project
MATH2070/2970: Optimisation and Financial Mathematics
Web Page: http://www.maths.usyd.edu.au/u/IM/MATH2070/
Lecturer: Chunxi Jiao, Desmond Ng and Nicholas James (Computer Lab Designer)
Semester 2, 2020
Due on 11.59pm Friday 20th November via Turnitin.
Late assignments are not accepted without prior arrangement well before the deadline!
You must attach a scanned copy of the signed cover-sheet to the front of your assignment (see over)!
This is mostly a computational project so you must submit all computer programs with your project formulations, descriptions and outputs. Assessment will be based on: accuracy, programming and presentation.
MATH2070: Do all questions except Questions x for x ≥ 7. MATH2970: Do all questions.
This assignment involves analyzing real country index market data downloaded from Bloomberg. The file ‘Country Indices.xlsx’ which you can download from Ed Workpaces, contains the daily closing prices of equity indices of 20 countries in the worksheet named ‘Bloomberg Values’. Only use the data contained in this worksheet. We draw your attention to the following points relating to the data:
(a) Prices are recorded on a (business)-daily basis between 3/01/2000 – 8/10/2020 where dates are formatted using the DD/MM/YYYY convention.
(b) The price of each index is denominated in that country’s currency. (c) This price data includes five (5) periods of interest:
(i) The Global Financial Crisis (GFC). We roughly identify the GFC in this data as the period from 03/01/2007 – 31/05/2010, with the peak of the GFC (GFC Peak) as the period from 02/09/2008 – 01/06/2009.
(ii) The ongoing COVID-19 global pandemic (COVID-19) which began on 11/03/2020 according to the World Health Organization. Since the pandemic is ongoing we shall use the 31/08/2020 as the data cutoff point. Thus, for this assignment we shall identify COVID-19 in this data as the period from 11/03/2020 – 31/08/2020, with the peak of the COVID-19 (COVID-19 Peak) as the period from 11/03/2020 – 29/05/2020.
(iii) An interim market period (INTERIM) between post-GFC and pre-COVID-19. We shall identify this period in the data as as the period from 01/06/2010 – 10/03/2020.
There are two further items to note with this data:
(1) There is no data for the China equity index ‘SHSZ300’ prior to 04/01/2002.
(2) There is no data for the Mexico equity index ‘FTBIVA’ prior to 19/09/2003.
Both (1) and (2) do not affect this assignment since they are outside our scope of interest i.e., (c)(i) – (c)(iii).
Covariance and Correlation
1. Import the data into ipython as shown in computer labs. This question will investigate the correlation between the return rates of the country indices over each of the five periods of
Copyright ⃝c 2020 The University of Sydney 1

interest i.e., GFC, GFC Peak, INTERIM, COVID-19 and COVID-19 Peak. There are several choices when analyzing return rate data. A commonly used variable is the logarithmic change of price or the so called log return rate: Let Yti be the price at time t of the i-th country’s index for i = 1, 2, . . . , 20, then consider the log return rate (w.r.t. the natural base) given by
ξi=logYi−logYi ,fort≥1. t t t−1
(i) Justify the use of the log return rate. What are the advantages of using it?
(ii) Due to the different currency denominations pricing each country’s index, in order to ana- lyze across indices we must start each index from a common starting point. Thus, for the entire period of interest i.e., 03/01/2007 – 31/08/2020 rebase each country’s index to start at 100. Then plot on a single graph each country’s rebased index values for the entire period of interest. Use the following colours when plotting each period of interest: GFC (Blue), GFC Peak (Yellow), INTERIM (Green), COVID-19 (Orange), COVID-19 Peak (Red).
Where periods overlap, plot showing the most number of colours. For e.g., since GFC Peak sits within GFC the plot colours would be, Blue – Yellow – Blue.
(iii) Construct and visualize the correlation matrices for the five periods GFC, GFC Peak, INTERIM, COVID-19 and COVID-19 Peak. Comment on how the correlations between country indices change over these five periods.
(iv) Plot the histogram of the correlation coefficients ρij for 1 ≤ i, j ≤ 20 for the five periods GFC, GFC Peak, INTERIM, COVID-19 and COVID-19 Peak. Comment on your results.
Portfolio Theory In this question using your results from Question 1. you will construct the optimal portfolio P∗, the Minimum Variance Frontier (MVF), the Efficient Frontier (EF) and consider types of investors during the different market periods.
2. For the INTERIM period only i.e., from 01/06/2010 – 10/03/2020, carry out the following computational tasks to compute the optimal portfolio P∗, consisting of the 20 country indices for an agent who wishes to invest $1,000,000 with a risk-aversion coefficient of t = 0.20.
3. (i)
(i) (ii)
Compute the dollar amount invested in each country’s index and obtain the corresponding expected return, μ∗ and risk, σ∗ of the optimal portfolio, P∗.
Compute the μσ-plane graphical representation and include the following, all on the same graphical plot:
(a) All 20 country indices; and
(b) The Minimum Variance Frontier (MVF) and Efficient Frontier (EF). When displaying
your plot, use a t-range of |t| ≤ 0.40 for displaying your plot; and
(c) Generate a plot of n = 1000 random feasible portfolios with individual country in- dex weights satisfying |xni| ≤ 20 (for each of the i = 1, 2, …, 20 country indices) and
σn ≤ 0.10 for n = 1,2,…,1000.
You might notice that the random points occupy some region well-separated from the Minimum Variance Frontier (MVF) – comment on this observation and explain why this occurs. An explanation for this observation is a/the major part of the question; and
(d) Plot the indifference curve of an investor with t = 0.20 and their optimal portfolio P∗.
For the INTERIM period from 01/06/2010 – 10/03/2020, determine which investors short sell in a global investment market consisting only of the 20 country indices and which indices they short sell. Are there any country indices which no investors short sell or which all investors will short sell?
2

(ii) Repeat question 3(i) for the COVID-19 period from 11/03/2020 – 31/08/2020. Offer an explanation for any differences observed.
For questions 4. – 6. consider only the INTERIM period from 01/06/2010 – 10/03/2020.
4. Adding a Riskless Cash Fund and Constructing the Market Portfolio: Consider the position of an investor from the United States (US) investing in each country’s index. In addition to the country indices available for investment, a US riskless cash fund P0 is also available to this investor. The risk free interest rate on P0 was r0 = 0.05 before the GFC and was lowered to r0 = 0.0025 in December 2008, for both lending and borrowing. Assume that r0 = 0.0025 remains constant over the INTERIM period.
(i) Obtain the investor’s new allocation of their investment to the 21 available assets i.e., the 20 country indices plus a riskless cash fund, P0. State clearly the investor’s position in the riskless cash fund.
(ii) Describe in detail the Capital Market Line and the tangency portfolio. What can you say about the tangency portfolio? Explain your result.
(iii) Using the Gross Domestic Product (GDP) of each country devise a method of computing the market portfolio. Clearly describe and explain your methodology including the data sources used.
Note: It is strongly suggested that data sources from reputable public institutions and organisations be used to ensure the accuracy and correctness of results. GDP data is a key economic figure for nation states and is publicly available and widely reported. Suggested data sources include the IMF, World Bank, The Economist, OECD etc.
Capital Market Theory
5. The Capital Market Line: Generate a new μσ-plane graphical plot showing the riskless cash fund P0, tangency portfolio, market portfolio and the Capital Market Line relative to the risky EF. Calculate the investor’s new optimal portfolio. If the original indices have a net worth of $100 million, estimate (to the nearest $0.1 million) the total value of each index.
6. The Security Market Line: Compute the β’s of all 20 indices and any other relevant assets in this project and clearly display them on the Security Market Line. Comment on the result identifying the country indices with a β > 1 and those with a β < 1 and describe what action Portfolio Theory would recommend an investor to take. Principal Components Analysis (PCA) Often when dealing with very large data sets i.e., data sets with high dimensionality and many ober- vations, we want to know what are the underlying factors which produced that data in order to build predictive models to predict future observations. Furthermore, we want to know which subset of the underlying factors are responsible for the majority of the variation that we see in the data. This is so that, instead of building a model with many variables or factors accounting for every observation, we seek to reduce the dimensionality of the data to a subset of underlying factors and build models using those factors which explain the essence or most of the observations that we see. This is what we aim to do in this section with Principal Components Analysis or PCA, where you will implement PCA from scratch i.e., without using built-in function calls. In practical applications the singular value decomposition is the main tool for performing PCA given by the following theorem: 3 The Singular Value Decomposition (SVD) Let A be an m×n matrix with rank r. Then, there exists an m×n matrix Σ given by Σ=􏰀D 0􏰁, 00 where D is an r × r matrix with r ≤ min{m,n} (if r = m or n or both, some or all of the zero matrices do not appear) for which the diagonal entries in D are the first r singular values of A, σ1 ≥ σ2 ≥ ... ≥ σr > 0, and there exist an m×m orthogonal matrix U and an n×n orthogonal matrix V such that
A = UΣV ⊤.
The singular values of A are the square roots of the eigenvalues of A⊤A denoted by σ1,σ2,…,σn, arrangedindecreasingorder. Thatis,σi =√λi withσ1 ≥σ2 ≥…≥σn >0for1≤i≤n.
7.
(i) Prove the following theorem:
Suppose {v1,v2,…,vn} is an orthonormal basis of Rn consisting of eigenvectors of A⊤A, arranged so that the corresponding eigenvalues of A⊤A satisfy λ1 ≥ λ2 ≥ … ≥ λn, and suppose A has r nonzero singular values. Then {Av1,Av2,…,Avr} is an orthogonal basis for the column space, Col A and rank A = r.
(ii) Prove the singular value decomposition using the previous result.
Suppose now that you are given an N × t matrix of observational data [X1 X2 … Xt]
where each Xk is a N ×1 observation vector for k = 1,2,…,t. To prepare the data for PCA we first put it in mean-deviation form i.e., our data should have a sample mean of zero. Let the sample mean
be given by
and for k = 1,2,…,t let
X ̄ = 1 ( X 1 + X 2 + . . . + X t ) t
Xˆ k = X k − X ̄ . Then the following N × t matrix G given by
G=[Xˆ1 Xˆ2 … Xˆt],
has columns of observations in mean-deviation form. Lastly, define the N × N sample covariance
matrix by
S= 1 GG⊤. t−1
Let X be a vector varying over the set of observation vectors and denote the coordinates of X by x1,x2,…,xN. Then, the goal of PCA is to find an orthogonal N × N matrix P = [u1 u2 … uN] that determines a change of variable, X = P Y, or
 x1   y1   x2   y2 
 . =[u1 u2 … uN] . 
xN  yN 
with the property that the new variables y1, y2, . . . , yN are uncorrelated and are arranged in order of
decreasing variance. The unit eigenvectors u1, u2, . . . , uN of the covariance matrix S are called the 4

principal components of the data matrix of observations. The new uncorrelated variables y1, y2, . . . , yN can be determined using
X = PY ⇒ Y = P−1X = P⊤X.
You will now apply PCA to the five periods: GFC, GFC Peak, INTERIM, COVID-19 and COVID-19
Peak.
8. (i)
Apply PCA to the country indices data for each of the five periods by setting
A=√1 G⊤ t−1
and performing SVD without using a direct SVD function call. Show the full steps, code, any workings and outputs and present the following for each of the five periods:
• The sample covariance matrix S.
• The N = 20 eigenvalues of S.
• The principal components of the data.
Plot the ordered spectrum λi for i = 1, 2, . . . , 20 for the five periods. What observations can you make about how much of the total variance is explained by a subset of the principal components for each period?
Determine the minimum number of principal components needed to explain at least 95% of the variation in the data observed for the five periods. Offer an explanation for the differences (if any), in the minimum number of prinicpal components required over each period. What conclusions can you make about periods of crisis i.e., the GFC and COVID-19 periods compared to periods of non-crisis i.e., INTERIM?
Compare the percentage of variation that the largest principal component accounts for, during the five periods. Do countries behave more similarly during market crises or stable market conditions? Do you think this would be more or less pronounced if we were to look at portfolios of equities? Refer to systematic and unsystematic risks in your response.
During which market crisis would you have seen more benefits in portfolio diversification? Justify your answer by referring to the principal components seen in the different crisis periods.
(ii)
(iii)
(iv)
(v)
5

The University of Sydney School of Mathematics and Statistics
Assignment Cover Sheet
MATH2070/2970: Optimisation and Financial Mathematics
Web Page: http://www.maths.usyd.edu.au/u/IM/MATH2070/
Lecturer: Chunxi Jiao, Desmond Ng and Nicholas James (Computer Lab Designer)
Semester 2, 2020
Family Name …………………………………………………………………………
Given Names ……………………………….. SID ………………………………..
Legitimate cooperation between students on assignments is encouraged, since it can be a real aid to understanding. It is legitimate for students to discuss assignment questions at a general level, provided everybody involved makes some contribution. However, students must produce their own individual written solutions. Copying someone else’s work is plagiarism, and is unacceptable.
I certify that:
• I have read and understood the University of Sydney Student Plagiarism: Coursework Policy and Procedure at
http://sydney.edu.au/policies/showdoc.aspx?recnum=PDOC2012/254&RendNum=0.
• this assignment is all my own work, and that no part of this assignment has been copied from
another person.
• I have not allowed my work to be copied by another person.
Signature …………………………………. Date …………………………………. The University may impose severe penalties for plagiarism
This part to be completed by the marker: Grand total out of 40 ……………………………..
6