Squeezing function defined over infinite range down to finite












3












$begingroup$


Abstract



What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.



$$f(x)=frac{1}{1+e^{-x}}$$



                                       Sigmoid function



This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:



$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$



                                       Relu



It's derivative is useful in modeling and is similar to normal distribution pdf function:



$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$



                                       df



It is also simple to evaluate for computer, it's derivative is expressed in terms of function itself. But $df(x)$ is defined over infinite range which makes it unsuitable for some modeling tasks where the range should be finite. Taking into account the fact that the most of the values close to zero are located outside [-10..10] range, this is shown by the following equation:



$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$



That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.



So how do we squeeze the range? It can be done through the use of $arctanh(x)$:



                                       arctanh(x)



That goes as an argument to $df$ function. So when approaching +/-1 arctanh would asymptotically approach +/-$infty$ thus when used as an argument it will map the argument range from infinite to finite:



$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$



                                       df(arctanh(x))



while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):



$$df(arctanh(x)*6)$$



yields to:



                                       df(6 arctanh(x))



thus getting the final equation equal to:



$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$



It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:



                                       enter image description here



Which could essentially serve as yet another way to approximate sine.



Problem



While playing with $F(x,n)$ I've noticed that it's hard to find values of $n$ that suffice some condition, for instance I want to find $n$ such that it best approximates $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$. Let's define an error function of an approximation:



$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$



or



$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$



                                       err(x, n)



Then we would need to find $n$ such that:



$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$



Please note that $n$ is real here



This is where things start getting messy, an integral over $F(x, n)$ doesn't seem to have an analytical (closed) form nor it is clear how to approach differentiation of such integral due to different integration and differentiation variables. I don't understand how to differentiate such anintegral, so thought it would be better to ask here?



Thank you in advance!










share|cite|improve this question











$endgroup$








  • 1




    $begingroup$
    Differentiating with respect to $n$? Is $n$ a real number or natural number here?
    $endgroup$
    – LoveTooNap29
    Jan 2 at 17:56










  • $begingroup$
    $n$ is real number @LoveTooNap29
    $endgroup$
    – Lu4
    Jan 2 at 17:57










  • $begingroup$
    FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
    $endgroup$
    – David M.
    Jan 3 at 1:41


















3












$begingroup$


Abstract



What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.



$$f(x)=frac{1}{1+e^{-x}}$$



                                       Sigmoid function



This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:



$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$



                                       Relu



It's derivative is useful in modeling and is similar to normal distribution pdf function:



$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$



                                       df



It is also simple to evaluate for computer, it's derivative is expressed in terms of function itself. But $df(x)$ is defined over infinite range which makes it unsuitable for some modeling tasks where the range should be finite. Taking into account the fact that the most of the values close to zero are located outside [-10..10] range, this is shown by the following equation:



$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$



That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.



So how do we squeeze the range? It can be done through the use of $arctanh(x)$:



                                       arctanh(x)



That goes as an argument to $df$ function. So when approaching +/-1 arctanh would asymptotically approach +/-$infty$ thus when used as an argument it will map the argument range from infinite to finite:



$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$



                                       df(arctanh(x))



while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):



$$df(arctanh(x)*6)$$



yields to:



                                       df(6 arctanh(x))



thus getting the final equation equal to:



$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$



It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:



                                       enter image description here



Which could essentially serve as yet another way to approximate sine.



Problem



While playing with $F(x,n)$ I've noticed that it's hard to find values of $n$ that suffice some condition, for instance I want to find $n$ such that it best approximates $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$. Let's define an error function of an approximation:



$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$



or



$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$



                                       err(x, n)



Then we would need to find $n$ such that:



$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$



Please note that $n$ is real here



This is where things start getting messy, an integral over $F(x, n)$ doesn't seem to have an analytical (closed) form nor it is clear how to approach differentiation of such integral due to different integration and differentiation variables. I don't understand how to differentiate such anintegral, so thought it would be better to ask here?



Thank you in advance!










share|cite|improve this question











$endgroup$








  • 1




    $begingroup$
    Differentiating with respect to $n$? Is $n$ a real number or natural number here?
    $endgroup$
    – LoveTooNap29
    Jan 2 at 17:56










  • $begingroup$
    $n$ is real number @LoveTooNap29
    $endgroup$
    – Lu4
    Jan 2 at 17:57










  • $begingroup$
    FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
    $endgroup$
    – David M.
    Jan 3 at 1:41
















3












3








3


0



$begingroup$


Abstract



What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.



$$f(x)=frac{1}{1+e^{-x}}$$



                                       Sigmoid function



This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:



$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$



                                       Relu



It's derivative is useful in modeling and is similar to normal distribution pdf function:



$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$



                                       df



It is also simple to evaluate for computer, it's derivative is expressed in terms of function itself. But $df(x)$ is defined over infinite range which makes it unsuitable for some modeling tasks where the range should be finite. Taking into account the fact that the most of the values close to zero are located outside [-10..10] range, this is shown by the following equation:



$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$



That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.



So how do we squeeze the range? It can be done through the use of $arctanh(x)$:



                                       arctanh(x)



That goes as an argument to $df$ function. So when approaching +/-1 arctanh would asymptotically approach +/-$infty$ thus when used as an argument it will map the argument range from infinite to finite:



$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$



                                       df(arctanh(x))



while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):



$$df(arctanh(x)*6)$$



yields to:



                                       df(6 arctanh(x))



thus getting the final equation equal to:



$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$



It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:



                                       enter image description here



Which could essentially serve as yet another way to approximate sine.



Problem



While playing with $F(x,n)$ I've noticed that it's hard to find values of $n$ that suffice some condition, for instance I want to find $n$ such that it best approximates $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$. Let's define an error function of an approximation:



$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$



or



$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$



                                       err(x, n)



Then we would need to find $n$ such that:



$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$



Please note that $n$ is real here



This is where things start getting messy, an integral over $F(x, n)$ doesn't seem to have an analytical (closed) form nor it is clear how to approach differentiation of such integral due to different integration and differentiation variables. I don't understand how to differentiate such anintegral, so thought it would be better to ask here?



Thank you in advance!










share|cite|improve this question











$endgroup$




Abstract



What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.



$$f(x)=frac{1}{1+e^{-x}}$$



                                       Sigmoid function



This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:



$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$



                                       Relu



It's derivative is useful in modeling and is similar to normal distribution pdf function:



$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$



                                       df



It is also simple to evaluate for computer, it's derivative is expressed in terms of function itself. But $df(x)$ is defined over infinite range which makes it unsuitable for some modeling tasks where the range should be finite. Taking into account the fact that the most of the values close to zero are located outside [-10..10] range, this is shown by the following equation:



$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$



That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.



So how do we squeeze the range? It can be done through the use of $arctanh(x)$:



                                       arctanh(x)



That goes as an argument to $df$ function. So when approaching +/-1 arctanh would asymptotically approach +/-$infty$ thus when used as an argument it will map the argument range from infinite to finite:



$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$



                                       df(arctanh(x))



while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):



$$df(arctanh(x)*6)$$



yields to:



                                       df(6 arctanh(x))



thus getting the final equation equal to:



$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$



It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:



                                       enter image description here



Which could essentially serve as yet another way to approximate sine.



Problem



While playing with $F(x,n)$ I've noticed that it's hard to find values of $n$ that suffice some condition, for instance I want to find $n$ such that it best approximates $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$. Let's define an error function of an approximation:



$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$



or



$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$



                                       err(x, n)



Then we would need to find $n$ such that:



$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$



Please note that $n$ is real here



This is where things start getting messy, an integral over $F(x, n)$ doesn't seem to have an analytical (closed) form nor it is clear how to approach differentiation of such integral due to different integration and differentiation variables. I don't understand how to differentiate such anintegral, so thought it would be better to ask here?



Thank you in advance!







integration derivatives optimization






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Jan 2 at 18:08









amWhy

1




1










asked Jan 2 at 17:45









Lu4Lu4

1285




1285








  • 1




    $begingroup$
    Differentiating with respect to $n$? Is $n$ a real number or natural number here?
    $endgroup$
    – LoveTooNap29
    Jan 2 at 17:56










  • $begingroup$
    $n$ is real number @LoveTooNap29
    $endgroup$
    – Lu4
    Jan 2 at 17:57










  • $begingroup$
    FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
    $endgroup$
    – David M.
    Jan 3 at 1:41
















  • 1




    $begingroup$
    Differentiating with respect to $n$? Is $n$ a real number or natural number here?
    $endgroup$
    – LoveTooNap29
    Jan 2 at 17:56










  • $begingroup$
    $n$ is real number @LoveTooNap29
    $endgroup$
    – Lu4
    Jan 2 at 17:57










  • $begingroup$
    FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
    $endgroup$
    – David M.
    Jan 3 at 1:41










1




1




$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56




$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56












$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57




$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57












$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41






$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41












0






active

oldest

votes











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3059745%2fsqueezing-function-defined-over-infinite-range-down-to-finite%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Mathematics Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3059745%2fsqueezing-function-defined-over-infinite-range-down-to-finite%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Human spaceflight

Can not write log (Is /dev/pts mounted?) - openpty in Ubuntu-on-Windows?

張江高科駅