Squeezing function defined over infinite range down to finite

Abstract

What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.

$$f(x)=frac{1}{1+e^{-x}}$$

Sigmoid function

This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:

$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$

Relu

It's derivative is useful in modeling and is similar to normal distribution pdf function:

$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$

It is also simple to evaluate for computer, it's derivative is expressed in terms of function itself. But $df(x)$ is defined over infinite range which makes it unsuitable for some modeling tasks where the range should be finite. Taking into account the fact that the most of the values close to zero are located outside [-10..10] range, this is shown by the following equation:

$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$

That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.

So how do we squeeze the range? It can be done through the use of $arctanh(x)$:

arctanh(x)

That goes as an argument to $df$ function. So when approaching +/-1 arctanh would asymptotically approach +/-$infty$ thus when used as an argument it will map the argument range from infinite to finite:

$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$

df(arctanh(x))

while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):

$$df(arctanh(x)*6)$$

yields to:

df(6 arctanh(x))

thus getting the final equation equal to:

$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$

It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:

enter image description here

Which could essentially serve as yet another way to approximate sine.

Problem

While playing with $F(x,n)$ I've noticed that it's hard to find values of $n$ that suffice some condition, for instance I want to find $n$ such that it best approximates $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$. Let's define an error function of an approximation:

$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$

$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$

err(x, n)

Then we would need to find $n$ such that:

$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$

Please note that $n$ is real here

This is where things start getting messy, an integral over $F(x, n)$ doesn't seem to have an analytical (closed) form nor it is clear how to approach differentiation of such integral due to different integration and differentiation variables. I don't understand how to differentiate such anintegral, so thought it would be better to ask here?

Thank you in advance!

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

1

$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56

$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57

$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41

add a comment |

Abstract

What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.

$$f(x)=frac{1}{1+e^{-x}}$$

Sigmoid function

This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:

$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$

Relu

It's derivative is useful in modeling and is similar to normal distribution pdf function:

$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$

$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$

That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.

So how do we squeeze the range? It can be done through the use of $arctanh(x)$:

arctanh(x)

$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$

df(arctanh(x))

while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):

$$df(arctanh(x)*6)$$

yields to:

df(6 arctanh(x))

thus getting the final equation equal to:

$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$

It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:

enter image description here

Which could essentially serve as yet another way to approximate sine.

Problem

$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$

$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$

err(x, n)

Then we would need to find $n$ such that:

$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$

Please note that $n$ is real here

Thank you in advance!

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

1

$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56

$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57

$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41

add a comment |

Abstract

What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.

$$f(x)=frac{1}{1+e^{-x}}$$

Sigmoid function

This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:

$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$

Relu

It's derivative is useful in modeling and is similar to normal distribution pdf function:

$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$

$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$

That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.

So how do we squeeze the range? It can be done through the use of $arctanh(x)$:

arctanh(x)

$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$

df(arctanh(x))

while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):

$$df(arctanh(x)*6)$$

yields to:

df(6 arctanh(x))

thus getting the final equation equal to:

$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$

It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:

enter image description here

Which could essentially serve as yet another way to approximate sine.

Problem

$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$

$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$

err(x, n)

Then we would need to find $n$ such that:

$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$

Please note that $n$ is real here

Thank you in advance!

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

Abstract

What I'm generally trying to do is to squeeze the first derivative of the sigmoid function defined over infinite range into finite. I.e.

$$f(x)=frac{1}{1+e^{-x}}$$

Sigmoid function

This function is heavily used as an activation function for neural networks. It's integral has close relation to another relu activation function which is also heavily used with neural networks:

$$int{f(x)}{dx}=ln(1+e^{-x})-ln(e^{-x})$$

Relu

It's derivative is useful in modeling and is similar to normal distribution pdf function:

$$frac{d}{dx}f(x)=frac{e^{-x}}{(1+e^{-x})^{2}}=df(x)$$

$$int_{-infty}^{infty}{df(x)}{dx}-int_{-10}^{10}{df(x)}{dx}=1-frac{e^{10}-e^{-10}}{(1+e^{10})(1+e^{-10})}=0.00009079573740486882...$$

That means that all of the values outside [-10..10] would introduce minor effect to the shape of function when it's range is squeezed from infinite to finite.

So how do we squeeze the range? It can be done through the use of $arctanh(x)$:

arctanh(x)

$$df(arctanh(x))=frac{sqrt{1-x^2}}{(1+frac{sqrt{1-x^2}}{x+1})^2(x+1)}$$

df(arctanh(x))

while the function doesn't even look similar to what was originally intended it is easy to get the shape close through a factor before arctanh(x):

$$df(arctanh(x)*6)$$

yields to:

df(6 arctanh(x))

thus getting the final equation equal to:

$$df(n*arctanh(x))=frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}=F(x,n).$$

It's interesting to see how $4*F(x,pi)$ is similar to $frac{sin(xpi+frac{pi}{2})}{2}+frac 12$:

enter image description here

Which could essentially serve as yet another way to approximate sine.

Problem

$$err(x, n) = (4·df(n·arctanh(x))-frac 12·sin(xpi+frac{pi}{2})-frac 12)^2$$

$$err(x, n) = (frac{(1-x^2)^{frac n2}}{(1+frac{(1-x^{2})^{frac n2}}{(x+1)^n})^2(x+1)^n}-(frac{sin(xpi+frac{pi}{2})}{2}+frac 12))^2.$$

err(x, n)

Then we would need to find $n$ such that:

$$frac{d}{dn}int_{-1}^{1}{err(x, n)}{dx}=0.$$

Please note that $n$ is real here

Thank you in advance!

integration derivatives optimization

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

edited Jan 2 at 18:08

amWhy

edited Jan 2 at 18:08

amWhy

edited Jan 2 at 18:08

amWhy

asked Jan 2 at 17:45

Lu4

1285

asked Jan 2 at 17:45

Lu4

1285

asked Jan 2 at 17:45

Lu4

1285

1

$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56

$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57

$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41

add a comment |

1

$begingroup$
Differentiating with respect to $n$? Is $n$ a real number or natural number here?
$endgroup$
– LoveTooNap29
Jan 2 at 17:56

$begingroup$
$n$ is real number @LoveTooNap29
$endgroup$
– Lu4
Jan 2 at 17:57

$begingroup$
FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule
$endgroup$
– David M.
Jan 3 at 1:41

Differentiating with respect to $n$? Is $n$ a real number or natural number here?

– LoveTooNap29
Jan 2 at 17:56

$n$ is real number @LoveTooNap29

– Lu4
Jan 2 at 17:57

FWIW, derivatives under integral signs can be calculated using the Leibniz integral rule

– David M.
Jan 3 at 1:41

add a comment |

0

active

oldest

votes

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3059745%2fsqueezing-function-defined-over-infinite-range-down-to-finite%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

0

active

oldest

votes

0

active

oldest

votes

draft saved

draft discarded

Thanks for contributing an answer to Mathematics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

4t4bMuUjU8B8omkwEcM HwIfR,nH,2LfutqU hLXW,1SX1U0bbQI 2h I0tJxepZMQWkiKDw 3emMIt,ok V

搜尋此網誌

Dtyjlui

Squeezing function defined over infinite range down to finite

Abstract

Problem

Abstract

Problem

Abstract

Problem

Abstract

Problem

0

Your Answer

Post as a guest

0

0

Post as a guest

Popular posts from this blog

Human spaceflight

Can not write log (Is /dev/pts mounted?) - openpty in Ubuntu-on-Windows?

Antiochus X Eusebes