Cubic Regression (best Fit Line) In Javascript
Solution 1:
Here's a real, working bit of code to solve that cubic using the numeric.js library's uncmin
unconstrained minimiser as a least squares problem (jsbin here):
var data_x = [2,5,7,12,20,32,50];
var data_y = [5,10,15,20,25,30,35];
var cubic = function(params,x) {
return params[0] * x*x*x +
params[1] * x*x +
params[2] * x +
params[3];
};
var objective = function(params) {
var total = 0.0;
for(var i=0; i < data_x.length; ++i) {
var resultThisDatum = cubic(params, data_x[i]);
var delta = resultThisDatum - data_y[i];
total += (delta*delta);
}
return total;
};
var initial = [1,1,1,1];
var minimiser = numeric.uncmin(objective,initial);
console.log("initial:");
for(var j=0; j<initial.length; ++j) {
console.log(initial[j]);
}
console.log("minimiser:");
for(var j=0; j<minimiser.solution.length; ++j) {
console.log(minimiser.solution[j]);
}
I get the results:
0.0005750849851827991
-0.05886106462847641
2.1839575020602164
1.1276055079334206
To explain: we have a function 'cubic', which evaluates the general cubic function for a set of parameters params
and a value x
. This function is wrapped to create the objective function, which takes a set of params and runs each x value from our data set through the target function and calculates the sum of the squares. This function is passed to uncmin
from numeric.js with a set of initial values; uncmin
does the hard work and returns an object whose solution
property contains the optimised parameter set.
To do this without the global variables (naughty!), you can have an objective function factory thus:
var makeObjective = function(targetFunc,xlist,ylist) {
var objective = function(params) {
var total = 0.0;
for(var i=0; i < xlist.length; ++i) {
var resultThisDatum = targetFunc(params, xlist[i]);
var delta = resultThisDatum - ylist[i];
total += (delta*delta);
}
return total;
};
return objective;
};
Which you can use to manufacture objective functions:
var objective = makeObjective(cubic, data_x, data_y); // then carry on as before
Knowing how to do this practically would be of great help to a lot of people, so I'm glad this has come up.
Edit: Clarification on cubic
var cubic = function(params,x) {
returnparams[0] * x*x*x +
params[1] * x*x +
params[2] * x +
params[3];
};
Cubic is being defined as a function which takes an array of parameters params
and a value x
. Given params
, we can define a function f(x)
. For a cubic, that is f(x) = a x^3 + b x^2 + c x + d
so there are 4 parameters ([0]
to [3]
), and given those 4 param values we have a single function f(x)
with 1 input x
.
The code is structured to allow you to replace cubic
with another function of the same structure; it could be linear
with 2 parameters:
var linear = function(params, x) {
returnparams[0]*x + params[1];
};
The rest of the code will look at the length of params
in order to know how many parameters need modifying.
Note that this whole piece of code is trying to find the set of parameter values which produce a curve which best fits all the data; if you wanted to find a fit for the last 4 points of some data, you would pass only those values in data_x
and data_y
.
Solution 2:
I'd formulate this as a least squares problem. Let M be the n×4 matrix formed like this:
x_1^3 x_1^2 x_1 1
x_2^3 x_2^2 x_2 1
⋮ ⋮ ⋮
x_n^3 x_n^2 x_n 1
Then compute the 4×4 matrix A=M⋅M and the 4×1 column vector b=M⋅y and solve the linear system of equations Aξ=b. The resulting vector ξ will contain your coefficients a through d.
The above description makes it easy to understand what is going on, mathematically. For implementation, particularly for very large n, the above approach might however be infeasible. In those cases, you can build A and b directly, without explicitely constructing M. For example, A1,2=sum(x_i^3 * x_i^2 for all i)
. So you can iterate over all i and add the corresponding values to the corresponding matrix and vector entries.
Post a Comment for "Cubic Regression (best Fit Line) In Javascript"