Added machine learning program (#15)

SudhiMohan · OmkarPathak · commit 0c09f366f9d6 · 2019-10-26T09:49:53.000+05:30
* created folder machine learning

* Added gradient descent program

* Delete file1
diff --git a/MachineLearning/gradient_descent.py b/MachineLearning/gradient_descent.py
@@ -0,0 +1,102 @@
+####################################################################################
+## PROBLEM1: Gradient Descent
+## Gradient descent is a popular optimization technique to solve many
+## machine learning problems. In this case, we will explore the gradient
+## descent algorithm to fit a line for the given set of 2-D points.
+## ref: https://tinyurl.com/yc4jbjzs
+## ref: https://spin.atomicobject.com/2014/06/24/gradient-descent-linear-regression/
+##
+##
+## input: directory of faces in ./data/1_points.csv/
+## function for reading points is provided
+##
+##
+## your task: fill the following functions:
+## evaluate_cost
+## evaluate_gradient
+## udpate_params
+## NOTE: do NOT change values of 'init_params' and 'max_iterations' in optimizer
+##
+##
+## output: cost after convergence (rmse, lower the better)
+##
+##
+## NOTE: all required modules are imported. DO NOT import new modules.
+## NOTE: references are given intline
+## tested on Ubuntu14.04, 22Oct2017, Abhilash Srikantha
+####################################################################################
+
+import numpy as np
+import matplotlib.pyplot as plt
+import time
+
+def load_data(fname):
+    points = np.loadtxt(fname, delimiter=',') 
+    y_ = points[:,1]
+    # append '1' to account for the intercept
+    x_ = np.ones([len(y_),2]) 
+    x_[:,0] = points[:,0]
+    # display plot
+    #plt.plot(x_[:,0], y_, 'ro')
+    #plt.xlabel('x-axis')
+    #plt.ylabel('y-axis')
+    #plt.show()
+    print('data loaded. x:{} y:{}'.format(x_.shape, y_.shape))
+    return x_, y_
+
+def evaluate_cost(x_,y_,params):
+    tempcost = 0
+    for i in range(len(y_)):
+        tempcost += (y_[i] - ((params[0] * x_[i,0]) + params[1])) ** 2 
+    return tempcost / float(10000)   
+
+def evaluate_gradient(x_,y_,params):
+    m_gradient = 0
+    b_gradient = 0
+    N = float(len(y_))
+    for i in range(len(y_)):
+        m_gradient += -(2/N) * (x_[i,0] * (y_[i] - ((params[0] * x_[i,0]) + params[1])))
+        b_gradient += -(2/N) * (y_[i] - ((params[0] * x_[i,0]) + params[1]))
+    return [m_gradient,b_gradient]
+
+def update_params(old_params, grad, alpha):
+    new_m = old_params[0] - (alpha * grad[0])
+    new_b = old_params[1] - (alpha * grad[1])
+    return [new_m,new_b]
+
+# initialize the optimizer
+optimizer = {'init_params':np.array([4.5,2.0]) , 
+             'max_iterations':10000, 
+             'alpha':0.69908, 
+             'eps':0.0000001,
+             'inf':1e10}
+
+# load data
+x_, y_ = load_data("./data/1_points.csv")
+
+# time stamp
+start = time.time()
+
+try:
+    # gradient descent
+    params = optimizer['init_params']
+    old_cost = 1e10
+    for iter_ in range(optimizer['max_iterations']):
+        # evaluate cost and gradient
+        cost = evaluate_cost(x_,y_,params)
+        grad = evaluate_gradient(x_,y_,params)
+        # display
+        if(iter_ % 10 == 0):
+            print('iter: {} cost: {} params: {}'.format(iter_, cost, params))
+        # check convergence
+        if(abs(old_cost - cost) < optimizer['eps']):
+            break
+        # udpate parameters
+        params = update_params(params,grad,optimizer['alpha'])
+        old_cost = cost
+except:
+    cost = optimizer['inf']
+
+# final output
+print('time elapsed: {}'.format(time.time() - start))
+print('cost at convergence: {} (lower the better)'.format(cost))
diff --git a/MachineLearning/readme.txt b/MachineLearning/readme.txt
@@ -0,0 +1,30 @@
+The assignment consists of three problems based on basic machine learning and computer vision. 
+Numerous problems in these areas are well studied in statistics and applied mathematics. 
+Solutions are to be implemented in python by filling out required functions in each python file. 
+A basic framework for data i/o and evaluation is already provided (see header comments in each python file).
+Please note that all required libraries are already imported so please DO NOT import anything new.
+
+The four problems are briefly discussed below. 
+
+1. Gradient Descent: This is a popular optimization problem to find solutions to differentiable equations.
+Typically, learning problems involve minimizing a cost function by appropriately setting model parameters. 
+In this task, we are given a set of (noisy) points on a line and we wish to retrieve model parameters (intercept and slope) through gradient descent.
+Please refer to 'gradient_descent.py' and inline comments for further details. 
+
+2. Eigenfaces: This is a popular application of learning a basis representation of input data. 
+The application of this technique is the basis for simple recognition/compression algorithms. 
+In this task, we want to learn orthonormal basis using PCA of images that correspond to faces.
+Please refer to 'eigenfaces.py' and inline comments for further details. 
+
+3. Classification: This is among the basic tasks of machine learning problems. 
+Here, we will learn a classifier to using groundtruth labels on the training data to be able to distinguish between two object classes.
+You will use the scikit library to learn two classifiers (svm and random forest).
+Feel free to explore the parameters of both models to maximize classifier performance. 
+Please refer to 'classification.py' and inline comments for further details. 
+
+4. Disparity map: This is among the basic tasks of 3D computer vision
+Here, given two differnce perspectives of the same scene, we will reconstruct an approximate of the depth map.
+This is called the disparity map (higher disparity is similar to lower depth).
+You will use the scikit library to implement the module. Feel free to explore the parameters 'downsample' and 'patchsize'
+Please refer to disparity.py and inline comments for further details.
+
diff --git a/MachineLearning/version_list.txt b/MachineLearning/version_list.txt
@@ -0,0 +1,109 @@
+alabaster==0.7.10
+angles==1.9.11
+astroid==1.5.3
+Babel==2.5.0
+backports.weakref==1.0rc1
+bleach==1.5.0
+chardet==3.0.4
+configparser==3.5.0
+cycler==0.10.0
+decorator==4.1.2
+docutils==0.14
+entrypoints==0.2.3
+html5lib==0.9999999
+imagesize==0.7.1
+imutils==0.4.3
+ipykernel==4.6.1
+ipython==6.1.0
+ipython-genutils==0.2.0
+ipywidgets==6.0.0
+isort==4.2.15
+jedi==0.10.2
+Jinja2==2.9.6
+jsonschema==2.6.0
+jupyter==1.0.0
+jupyter-client==5.1.0
+jupyter-console==5.2.0
+jupyter-core==4.3.0
+lazy-object-proxy==1.3.1
+lxml==3.8.0
+Mako==1.0.6
+Markdown==2.6.9
+MarkupSafe==1.0
+matplotlib==2.0.2
+mistune==0.7.4
+mock==2.0.0
+mpmath==0.19
+nbconvert==5.2.1
+nbformat==4.4.0
+networkx==1.11
+nose==1.3.7
+notebook==5.0.0
+numpy==1.13.1
+numpydoc==0.7.0
+olefile==0.44
+opencv==1.0.1
+pandas==0.20.3
+pandocfilters==1.4.2
+pbr==3.1.1
+pexpect==4.2.1
+pickleshare==0.7.4
+Pillow==3.4.2
+prompt-toolkit==1.0.15
+protobuf==3.4.0
+psutil==5.2.2
+ptyprocess==0.5.2
+pycodestyle==2.3.1
+pyflakes==1.6.0
+Pygments==2.2.0
+pygpu==0.6.9
+pylint==1.7.2
+pyparsing==2.2.0
+python-dateutil==2.6.1
+python-qt-binding==0.2.19
+pytz==2017.2
+PyWavelets==0.5.2
+pyzmq==16.0.2
+qt-dotgraph==0.2.32
+qt-gui==0.2.32
+qt-gui-py-common==0.2.32
+QtAwesome==0.4.4
+qtconsole==4.3.1
+QtPy==1.3.1
+requests==2.14.2
+rope-py3k==0.9.4.post1
+rosboost-cfg==1.11.14
+rosclean==1.11.14
+roscreate==1.11.14
+rosgraph==1.11.21
+roslint==0.10.0
+roslz4==1.11.21
+rosmaster==1.11.21
+rosparam==1.11.21
+scikit-image==0.13.0
+scikit-learn==0.19.0
+scipy==0.19.1
+simplegeneric==0.8.1
+singledispatch==3.4.0.3
+six==1.10.0
+sklearn-theano==0.0.1
+smach==2.0.1
+smclib==1.7.19
+snowballstemmer==1.2.1
+Sphinx==1.6.3
+sphinxcontrib-websupport==1.0.1
+spyder==3.2.3
+sympy==1.1.1
+tensorflow==1.3.0
+tensorflow-tensorboard==0.1.5
+terminado==0.6
+testpath==0.3
+Theano==0.9.0
+tornado==4.5.2
+traitlets==4.3.2
+wcwidth==0.1.7
+webencodings==0.5
+Werkzeug==0.12.2
+widgetsnbextension==3.0.1
+wrapt==1.10.11
+xdot==2.0.1