HISNet: a Human Image Segmentation Network aiding

Abstract

The bokeh effect in photography has gained unquestionable popularity since improvements in smartphone cameras, for this effect brings out the attention of the image onto the subject and enhances the overall quality of the photo. Generally, these effects are applicable via dual-lens cameras for auto-focusing onto the subject. However, smartphones with a single lens rely on software to generate such an effect. This paper proposes a deep learning pipeline to generate depth-aware segmentation maps in human images via segmentation and depth estimation networks. The given paper provides a concatenations-based decoder for segmentation applying and experimenting with features learned through state-of-the-art encoder architectures, further we form an encoding concatenation between two prominent encoders to provide an ensemble model for learning segments. Adding to the effect we use a prominent depth estimation architecture and combine it with our segmentation results to generate dept-aware segmentation maps for achieving photos with more focus on human subjects, where the out-of-focus regions appear to be blurred out. The methodology produces compelling bokeh effects, comparable with shots taken via a dual-lens mobile camera or DSLR. During the experimentations of human segmentation, some benchmark results are reported with our best-considered model. Training on Supervisely Persons dataset achieved an IOU score of 95.88%, whereas training the same network on the EG1800 dataset achieved a state-of-the-art IOU of 96.89%. The final segmentation model thus provided some very closely accurate segmentation maps suitable for our task.

Details

Title

HISNet: a Human Image Segmentation Network aiding bokeh effect generation

Author

Gupta, Shaurya¹; Vishwakarma, Dinesh Kumar¹

¹ Delhi Technological University, Biometric Research Laboratory, Department of Information Technology, Delhi, India (GRID:grid.440678.9) (ISNI:0000 0001 0674 5044)

Pages

12469-12492

Publication year

2023

Publication date

Mar 2023

Publisher

Springer Nature B.V.

ISSN

13807501

e-ISSN

15737721

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1007/s11042-022-13900-1

ProQuest document ID

2781944842

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

HISNet: a Human Image Segmentation Network aiding bokeh effect generation

Content area

Abstract

Details

Suggested sources