Deep Learning Applications

Oct 9, 2015


Applications

DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

Some like it hot - visual guidance for preference prediction

Deep Learning Algorithms with Applications to Video Analytics for A Smart City: A Survey

Deep Relative Attributes

Deep-Spying: Spying using Smartwatch and Deep Learning

Camera identification with deep convolutional networks

An Analysis of Deep Neural Network Models for Practical Applications

8 Inspirational Applications of Deep Learning

16 Open Source Deep Learning Models Running as Microservices

Makeup like a superstar: Deep Localized Makeup Transfer Network

Deep Cascaded Bi-Network for Face Hallucination

DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation

Autoencoding Blade Runner

A guy trained a machine to “watch” Blade Runner. Then things got seriously sci-fi.

http://www.vox.com/2016/6/1/11787262/blade-runner-neural-network-encoding

Deep Convolution Networks for Compression Artifacts Reduction

Deep GDashboard: Visualizing and Understanding Genomic Sequences Using Deep Neural Networks

Instagram photos reveal predictive markers of depression

How an Algorithm Learned to Identify Depressed Individuals by Studying Their Instagram Photos

IM2CAD

Fast, Lean, and Accurate: Modeling Password Guessability Using Neural Networks

Defeating Image Obfuscation with Deep Learning

Detecting Music BPM using Neural Networks

Generative Visual Manipulation on the Natural Image Manifold

Deep Impression: Audiovisual Deep Residual Networks for Multimodal Apparent Personality Trait Recognition

Deep Gold: Using Convolution Networks to Find Minerals

Predicting First Impressions with Deep Learning

Judging a Book By its Cover

Image Credibility Analysis with Effective Domain Transferred Deep Networks

A novel image tag completion method based on convolutional neural network

Learning Two-Branch Neural Networks for Image-Text Matching Tasks

https://arxiv.org/abs/1704.03470

Age Estimation

Deeply-Learned Feature for Age Estimation

Age and Gender Classification using Convolutional Neural Networks

Group-Aware Deep Feature Learning For Facial Age Estimation

Local Deep Neural Networks for Age and Gender Classification

https://arxiv.org/abs/1703.08497

Face Aging

Recurrent Face Aging

Face Aging With Conditional Generative Adversarial Networks

Emotion Recognition / Expression Recognition

Real-time emotion recognition for gaming using deep convolutional network features

Emotion Recognition in the Wild via Convolutional Neural Networks and Mapped Binary Patterns

DeXpression: Deep Convolutional Neural Network for Expression Recognition

DEX: Deep EXpectation of apparent age from a single image

EmotioNet: EmotioNet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild

How Deep Neural Networks Can Improve Emotion Recognition on Video Data

Peak-Piloted Deep Network for Facial Expression Recognition

Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution

A Recursive Framework for Expression Recognition: From Web Images to Deep Models to Game Dataset

FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition

EmotionNet Challenge

Baseline CNN structure analysis for facial expression recognition

Facial Expression Recognition using Convolutional Neural Networks: State of the Art

DAGER: Deep Age, Gender and Emotion Recognition Using Convolutional Neural Network

Deep generative-contrastive networks for facial expression recognition

https://arxiv.org/abs/1703.07140

Convolutional Neural Networks for Facial Expression Recognition

https://arxiv.org/abs/1704.06756

End-to-End Multimodal Emotion Recognition using Deep Neural Networks

Spatial-Temporal Recurrent Neural Network for Emotion Recognition

https://arxiv.org/abs/1705.04515

Facial Emotion Detection Using Convolutional Neural Networks and Representational Autoencoder Units

https://arxiv.org/abs/1706.01509

Attribution Prediction

PANDA: Pose Aligned Networks for Deep Attribute Modeling

Predicting psychological attributions from face photographs with a deep neural network

Learning Human Identity from Motion Patterns

Pose Estimation

DeepPose: Human Pose Estimation via Deep Neural Networks

Heterogeneous multi-task learning for human pose estimation with deep convolutional neural network

Flowing ConvNets for Human Pose Estimation in Videos

Structured Feature Learning for Pose Estimation

Convolutional Pose Machines

Model-based Deep Hand Pose Estimation

Stacked Hourglass Networks for Human Pose Estimation

Chained Predictions Using Convolutional Neural Networks

DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

Real-time Human Pose Estimation from Video with Convolutional Neural Networks

Region Ensemble Network: Improving Convolutional Network for Hand Pose Estimation

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation

Human Pose Detection Mining Body Language from Videos

OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library

Learning Feature Pyramids for Human Pose Estimation

Multi-Context Attention for Human Pose Estimation

Human Pose Estimation with TensorFlow

https://github.com/eldar/pose-tensorflow

Sentiment Prediction

From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction

Predict Sentiment From Movie Reviews Using Deep Learning

Neural Sentiment Classification with User and Product Attention

From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction

Place Recognition

NetVLAD: CNN architecture for weakly supervised place recognition

PlaNet - Photo Geolocation with Convolutional Neural Networks

Visual place recognition using landmark distribution descriptors

Low-effort place recognition with WiFi fingerprints using deep learning

Deep Learning Features at Scale for Visual Place Recognition

Place recognition: An Overview of Vision Perspective

https://arxiv.org/abs/1707.03470

Camera Relocalization

PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization

Modelling Uncertainty in Deep Learning for Camera Relocalization

Random Forests versus Neural Networks - What’s Best for Camera Relocalization?

Deep Convolutional Neural Network for 6-DOF Image Localization

Image-based Localization with Spatial LSTMs

VidLoc: 6-DoF Video-Clip Relocalization

Towards CNN Map Compression for camera relocalisation

Camera Relocalization by Computing Pairwise Relative Poses Using Convolutional Neural Network

Counting Objects

Towards perspective-free object counting with deep learning

Using Convolutional Neural Networks to Count Palm Trees in Satellite Images

Count-ception: Counting by Fully Convolutional Redundant Counting

https://arxiv.org/abs/1703.08710

Counting Objects with Faster R-CNN

Drone-based Object Counting by Spatially Regularized Regional Proposal Network

https://arxiv.org/abs/1707.05972

FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras

Crowd Counting / Crowd Analysis

Large scale crowd analysis based on convolutional neural network

Deep People Counting in Extremely Dense Crowds

Crossing-line Crowd Counting with Two-phase Deep Neural Networks

Cross-scene Crowd Counting via Deep Convolutional Neural Networks

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network

CrowdNet: A Deep Convolutional Network for Dense Crowd Counting

Crowd Counting by Adapting Convolutional Neural Networks with Side Information

Fully Convolutional Crowd Counting On Highly Congested Scenes

Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction

Multi-scale Convolutional Neural Networks for Crowd Counting

Mixture of Counting CNNs: Adaptive Integration of CNNs Specialized to Specific Appearance for Crowd Counting

https://arxiv.org/abs/1703.09393

Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks - Counting, Detection, and Tracking

https://arxiv.org/abs/1705.10118

ResnetCrowd: A Residual Deep Learning Architecture for Crowd Counting, Violent Behaviour Detection and Crowd Density Level Classification

Image Crowd Counting Using Convolutional Neural Network and Markov Random Field

A Survey of Recent Advances in CNN-based Single Image Crowd Counting and Density Estimation

https://arxiv.org/abs/1707.01202

Spatiotemporal Modeling for Crowd Counting in Videos

CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting

Switching Convolutional Neural Network for Crowd Counting

Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs

Activity Recognition

Implementing a CNN for Human Activity Recognition in Tensorflow

Concurrent Activity Recognition with Multimodal CNN-LSTM Structure

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

Deploying Tensorflow model on Andorid device for Human Activity Recognition

Music Classification / Sound Classification

Explaining Deep Convolutional Neural Networks on Music Classification

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

Convolutional Recurrent Neural Networks for Music Classification

CNN Architectures for Large-Scale Audio Classification

SoundNet: Learning Sound Representations from Unlabeled Video

Deep Learning ‘ahem’ detector

GenreFromAudio: Finding the genre of a song with Deep Learning

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

On the Robustness of Deep Convolutional Neural Networks for Music Classification

NSFW Detection / Classification

Nipple Detection using Convolutional Neural Network

Applying deep learning to classify pornographic images and videos

MODERATE, FILTER, OR CURATE ADULT CONTENT WITH CLARIFAI’S NSFW MODEL

WHAT CONVOLUTIONAL NEURAL NETWORKS LOOK AT WHEN THEY SEE NUDITY

Open Sourcing a Deep Learning Solution for Detecting NSFW Images

Miles Deep - AI Porn Video Editor

Image Reconstruction / Inpainting

Context Encoders: Feature Learning by Inpainting

Semantic Image Inpainting with Perceptual and Contextual Losses

High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

Face Image Reconstruction from Deep Templates

https://www.arxiv.org/abs/1703.00832

Image Restoration

Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections

Image Restoration Using Convolutional Auto-encoders with Symmetric Skip Connections

Image Completion with Deep Learning in TensorFlow

Deeply Aggregated Alternating Minimization for Image Restoration

A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction

Generative Face Completion

MemNet: A Persistent Memory Network for Image Restoration

Image Super-Resolution

Super-Resolution.Benckmark

Image Super-Resolution Using Deep Convolutional Networks

Learning a Deep Convolutional Network for Image Super-Resolution

Shepard Convolutional Neural Networks

Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution

Deeply-Recursive Convolutional Network for Image Super-Resolution

Accurate Image Super-Resolution Using Very Deep Convolutional Networks

Super-Resolution with Deep Convolutional Sufficient Statistics

Deep Depth Super-Resolution : Learning Depth Super-Resolution using Deep Convolutional Neural Network

Local- and Holistic- Structure Preserving Image Super Resolution via Deep Joint Component Learning

End-to-End Image Super-Resolution via Deep and Shallow Convolutional Networks

Accelerating the Super-Resolution Convolutional Neural Network

srez: Image super-resolution through deep learning

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Is the deconvolution layer the same as a convolutional layer?

  • intro: A note on Real­Time Single Image and Video Super­Resolution Using an Efficient Sub­Pixel Convolutional Neural Network.
  • arxiv: http://arxiv.org/abs/1609.07009

Amortised MAP Inference for Image Super-resolution

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Super-Resolution on Satellite Imagery using Deep Learning

Neural Enhance: Super Resolution for images using deep learning.

Texture Enhancement via High-Resolution Style Transfer for Single-Image Super-Resolution

EnhanceNet: Single Image Super-Resolution through Automated Texture Synthesis

Learning a Mixture of Deep Networks for Single Image Super-Resolution

Dual Recovery Network with Online Compensation for Image Super-Resolution

Super-resolution Using Constrained Deep Texture Synthesis

Pixel Recursive Super Resolution

GUN: Gradual Upsampling Network for single image super-resolution

Single Image Super-resolution with a Parameter Economic Residual-like Convolutional Neural Network

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

Single Image Super-Resolution Using Multi-Scale Convolutional Neural Network

Super-Resolution via Deep Learning

High-Quality Face Image SR Using Conditional Generative Adversarial Networks

https://arxiv.org/abs/1707.00737

Enhanced Deep Residual Networks for Single Image Super-Resolution

Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network

Single Image Super-Resolution with Dilated Convolution based Multi-Scale Information Learning Inception Module

Attention-Aware Face Hallucination via Deep Reinforcement Learning

https://arxiv.org/abs/1708.03132

Video Super-resolution

Detail-revealing Deep Video Super-resolution

End-to-End Learning of Video Super-Resolution with Motion Compensation

Image Denoising

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

Medical image denoising using convolutional denoising autoencoders

Rectifier Neural Network with a Dual-Pathway Architecture for Image Denoising

Non-Local Color Image Denoising with Convolutional Neural Networks

Joint Visual Denoising and Classification using Deep Learning

Deep Convolutional Denoising of Low-Light Images

Deep Class Aware Denoising

End-to-End Learning for Structured Prediction Energy Networks

Block-Matching Convolutional Neural Network for Image Denoising

https://arxiv.org/abs/1704.00524

When Image Denoising Meets High-Level Vision Tasks: A Deep Learning Approach

https://arxiv.org/abs/1706.04284

Wide Inference Network for Image Denoising

https://arxiv.org/abs/1707.05414

Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising

Image Denoising via CNNs: An Adversarial Approach

Image Haze Removal

DehazeNet: An End-to-End System for Single Image Haze Removal

An All-in-One Network for Dehazing and Beyond

Joint Transmission Map Estimation and Dehazing using Deep Networks

https://arxiv.org/abs/1708.00581

Image Rain Removal / De-raining

Clearing the Skies: A deep network architecture for single-image rain removal

Joint Rain Detection and Removal via Iterative Region Dependent Multi-Task Learning

Image De-raining Using a Conditional Generative Adversarial Network

Fence Removal

Deep learning based fence segmentation and removal from an image using a video sequence

Blur Detection and Removal

Learning to Deblur

Learning a Convolutional Neural Network for Non-uniform Motion Blur Removal

End-to-End Learning for Image Burst Deblurring

Deep Video Deblurring

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

From Motion Blur to Motion Flow: a Deep Learning Solution for Removing Heterogeneous Motion Blur

Motion Deblurring in the Wild

Deep Face Deblurring

https://arxiv.org/abs/1704.08772

Learning Blind Motion Deblurring

Image Compression

An image compression and encryption scheme based on deep learning

Full Resolution Image Compression with Recurrent Neural Networks

Image Compression with Neural Networks

Lossy Image Compression With Compressive Autoencoders

End-to-end Optimized Image Compression

CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression

Semantic Perceptual Image Compression using Deep Convolution Networks

Generative Compression

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

https://arxiv.org/abs/1703.10114

Learning Convolutional Networks for Content-weighted Image Compression

https://arxiv.org/abs/1703.10553

Real-Time Adaptive Image Compression

Image Quality Assessment

Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

Image Matting

Deep Image Matting

Fast Deep Matting for Portrait Animation on Mobile Phone

  • intro: ACM Multimedia Conference (MM) 2017
  • intro: does not need any interaction and can realize real-time matting with 15 fps
  • arxiv: https://arxiv.org/abs/1707.08289

Image Blending

GP-GAN: Towards Realistic High-Resolution Image Blending

Image Enhancement

Deep Bilateral Learning for Real-Time Image Enhancement

Aesthetic-Driven Image Enhancement by Adversarial Learning

Abnormality Detection / Anomaly Detection

Toward a Taxonomy and Computational Models of Abnormalities in Images

Depth Prediction / Depth Estimation

Deep Convolutional Neural Fields for Depth Estimation from a Single Image

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue

Depth from a Single Image by Harmonizing Overcomplete Local Network Predictions

Deeper Depth Prediction with Fully Convolutional Residual Networks

Single image depth estimation by dilated deep residual convolutional neural network and soft-weight-sum inference

https://arxiv.org/abs/1705.00534

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

Texture Synthesis

Texture Synthesis Using Convolutional Neural Networks

Texture Networks: Feed-forward Synthesis of Textures and Stylized Images

Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks

Texture Synthesis with Spatial Generative Adversarial Networks

Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis

Deep TEN: Texture Encoding Network

Diversified Texture Synthesis with Feed-forward Networks

Image Synthesis

Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis

Generative Adversarial Text to Image Synthesis

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Image Tagging

Fast Zero-Shot Image Tagging

Flexible Image Tagging with Fast0Tag

Sampled Image Tagging and Retrieval Methods on User Generated Content

Image Matching

Learning Fine-grained Image Similarity with Deep Ranking

Learning to compare image patches via convolutional neural networks

MatchNet: Unifying Feature and Metric Learning for Patch-Based Matching

Fashion Style in 128 Floats

Fully-Trainable Deep Matching

Local Similarity-Aware Deep Feature Embedding

Convolutional neural network architecture for geometric matching

Image Editing

Neural Photo Editing with Introspective Adversarial Networks

Deep Feature Interpolation for Image Content Changes

Invertible Conditional GANs for image editing

Semantic Facial Expression Editing using Autoencoded Flow

Face Swap

Fast Face-swap Using Convolutional Neural Networks

Face Editing

Neural Face Editing with Intrinsic Image Disentangling

Music Tagging

Automatic tagging using deep convolutional neural networks

Music tagging and feature extraction with MusicTaggerCRNN

https://keras.io/applications/#music-tagging-and-feature-extraction-with-musictaggercrnn

Action Recognition

Single Image Action Recognition by Predicting Space-Time Saliency

https://arxiv.org/abs/1705.04641

CTR Prediction

Deep CTR Prediction in Display Advertising

DeepFM: A Factorization-Machine based Neural Network for CTR Prediction

Deep Interest Network for Click-Through Rate Prediction

Cryptography

Learning to Protect Communications with Adversarial Neural Cryptography

Adversarial Neural Cryptography in Theano

Embedding Watermarks into Deep Neural Networks

Cyber Security

Collection of Deep Learning Cyber Security Research Papers

Lip Reading

LipNet: Sentence-level Lipreading

LipNet: End-to-End Sentence-level Lipreading

Lip Reading Sentences in the Wild

Combining Residual Networks with LSTMs for Lipreading

Event Recognition

Better Exploiting OS-CNNs for Better Event Recognition in Images

Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images

IOD-CNN: Integrating Object Detection Networks for Event Recognition

https://arxiv.org/abs/1703.07431

Others

Selfai: Predicting Facial Beauty in Selfies

Selfai: A Method for Understanding Beauty in Selfies

Deep Learning Enables You to Hide Screen when Your Boss is Approaching

Blogs

40 Ways Deep Learning is Eating the World

https://medium.com/intuitionmachine/the-ultimate-deep-learning-applications-list-434d1425da1d#.rxq8xvbfz

Applications

http://www.deeplearningpatterns.com/doku.php/applications

Systematic Approach To Applications Of Deep Learning

https://gettocode.com/2016/11/25/systematic-approach-to-applications-of-deep-learning/

Resources

Deep Learning Gallery - a curated collection of deep learning projects

http://deeplearninggallery.com/