论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network

2021-07-19 18:05

阅读：1169

标签：war font size filter ges nts eth enter pre

　　Optical Flow Estimation using a Spatial Pyramid Network

spynet

　　本文将经典的 spatial-pyramid formulation 和 deep learning 的方法相结合，以一种 coarse to fine approach，进行光流的计算。This estiamates large motions in a coarse to fine approach by warping one image of a pair at each pyramid level by the current flow estimate and compute an update to the flow.

　　我们利用 CNN 来进行每一层 flow 的更新，而不是传统方法中目标函数的最小化。与 FlowNet 相比，本文的方法不需要处理 large motions；这些已经在 pyramid 中处理了。该方法的主要优势有：

　　1. our Spatial Pyramid Network is much simpler and 96% smaller than FlowNet in terms of model parameters.

　　2. since the flow at each pyramid level is small (1 pixel), a convolutional approach applied to pairs of warped images is appropriate.

　　3. unlike FlowNet, the learned convolution filters appear similar to classical spatio-temporal filters, giving insight into the method and how to improve it.

　　现有方法存在的主要问题：

　　将两张图直接 stack大一起，放到 CNN 当中。当两帧图像之间的 motion 大于 one or a few pixels， spatial-temporal convolutional filters 将不会收到有效的相应。也就是说，if a convolutional window in one image does not overlap with related image pixels at the next time instant, no meaningful temporal filter can be learned.

　　这里需要解决两个关键性的问题：1. 长期依赖的问题；　　2. detailed, sub-pixel, optical flow and precise motion boundaries。FlowNet 是尝试在一个网络中解决这两个问题，而该方法则是用 CNN 来解决第二个问题，用现有的方法来解决第一个问题。

　　Approach：

　　本文用 spatial pyramid 的方式，from coarse to fine 的方法来解决 large motion的问题。

上一篇：什么是CSS hack?

下一篇：Mule ESB-3.Build a webservice proxy

文章来自：搜素材网的编程语言模块，转载请注明文章出处。
文章标题：论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network
文章链接：http://soscw.com/essay/106358.html

亲，登录后才可以留言！

论文笔记之：Optical Flow Estimation using a Spatial Pyramid Network

评论

热门文章

推荐文章

最新文章

置顶文章