Abstract: In this article, we propose a general triple channel hybrid spatial (images-based), temporal (motion-based) based and depth contour (distance-based) architecture from RGB or RGB-D cameras ...