sedang Berlangsung

Write some Software


two data points x and y whose attributes are all numeric, the Euclidean distance kx − yk is a popular

measurement. To be consistent with the textbook, we assume a vector x by default is a row vector.

However, Euclidean distance is not effective if

• the ranges (scales) of attributes are different, or

• there exist correlations among attributes.

Mahalanobis distance has been introduced to address the above problems. Given a dataset whose

attributes are all numeric, we first construct the the covariance matrix Σ = (si,j )n×n, where an element

si,j is the covariance of i-th and j-th attributes, and n is the number of attributes. The Mahalanobis

distance of x and y is then defined as

M ahalanobis(x, y) = (x − y)Σ−(x − y)



where Σ

− is the inverse of the matrix Σ, and (x − y)


is the transpose of the row vector x − y.

In the part, you will first implement the above two distance measurements, and then give an experimental

evaluation of these measurements. Usually, evaluation is performed in a specific data mining


tasks, such as classification, clustering. Since we currently have not yet got into these topics, we will

simply check the consistency between two measurements based on random datasets. Specifically, you

will need to

1. generate a random dataset of two-attribute instances in Gaussian distribution

2. build the covariance matrix Σ

3. set counter = 0

4. for each instance,

(a) Compute the nearest neighbor using Euclidean distance

(b) Compute the nearest neighbor using Mahalanobis distance

(c) if the nearest neighbors are same, count++

5. output the consistency ratio (count / number-of-instances)

You will need to read the documentation of the packages [url removed, login to view] and [url removed, login to view], and

typically the class Matrix to complete this part. To generate data in Gaussian distribution, you may

refer to the standard Java class Random, and use the Java method nextGaussian().

Keahlian Pengumpulan Data, Java

Lihat lebih lanjut: software write mq4, software write chip epson, useful software write book, software write web specs, simply check booking php, software write edid, software write websites idea, software write book images, software write books, software write protection, free software write book, software write book, software write technical manual

Tentang Pemberi kerja:
( 9 ulasan ) Bowling green, United States

ID Proyek: #11729403

Diberikan kepada:


Simple project for me to complete. ................................................................ ......................................................................... ....................................

$40 USD dalam 1 hari
(74 Ulasan)

5 freelancer menawar pada rata-rata $44 untuk pekerjaan ini


Hello. Employer. I have read and understood the project. I'm an Expert in Data Structures and Algorithms. And I know well ; Java ,C/C++, Python ,PHP. I'm interested this project. So, firstly I want to discuss Lagi

$55 USD dalam 1 hari
(27 Ulasan)

Hi, I have worked with WEKA dataset (ARFF) processing in java using weka.jar library and can help you complete this assignment at my reduced price of $60 USD. With Regards, Koustav

$60 USD dalam 1 hari
(19 Ulasan)

Hello,We are a team of developers and do all work related to computer science( Software Engineering ),. We have experience in , JAVA,C# ,My SQL , website Design( wordpress ,PHP), Excel, ( vb ,(macro)), Research( biolog Lagi

$40 USD dalam 1 hari
(0 Ulasan)

i can do it for you and i will use java as the programming languagr to execute this mission

$25 USD dalam 2 hari
(0 Ulasan)