Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dipesh Gyawali

Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

Mar 03, 2024

Dipesh Gyawali, Jian Zhang, BB Karki

Figure 1 for Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

Figure 2 for Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

Figure 3 for Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

Figure 4 for Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation

Abstract:Point cloud segmentation, which helps us understand the environment of specific structures and objects, can be performed in class-specific and class-agnostic ways. We propose a novel region-based transformer model called Region-Transformer for performing class-agnostic point cloud segmentation. The model utilizes a region-growth approach and self-attention mechanism to iteratively expand or contract a region by adding or removing points. It is trained on simulated point clouds with instance labels only, avoiding semantic labels. Attention-based networks have succeeded in many previous methods of performing point cloud segmentation. However, a region-growth approach with attention-based networks has yet to be used to explore its performance gain. To our knowledge, we are the first to use a self-attention mechanism in a region-growth approach. With the introduction of self-attention to region-growth that can utilize local contextual information of neighborhood points, our experiments demonstrate that the Region-Transformer model outperforms previous class-agnostic and class-specific methods on indoor datasets regarding clustering metrics. The model generalizes well to large-scale scenes. Key advantages include capturing long-range dependencies through self-attention, avoiding the need for semantic labels during training, and applicability to a variable number of objects. The Region-Transformer model represents a promising approach for flexible point cloud segmentation with applications in robotics, digital twinning, and autonomous vehicles.

* 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4 VISAPP: VISAPP, 341-348, 2024 , Rome, Italy
* 8 pages, 5 figures, 3 tables

Via

Access Paper or Ask Questions

Comparative Analysis of CPU and GPU Profiling for Deep Learning Models

Sep 05, 2023

Dipesh Gyawali

Abstract:Deep Learning(DL) and Machine Learning(ML) applications are rapidly increasing in recent days. Massive amounts of data are being generated over the internet which can derive meaningful results by the use of ML and DL algorithms. Hardware resources and open-source libraries have made it easy to implement these algorithms. Tensorflow and Pytorch are one of the leading frameworks for implementing ML projects. By using those frameworks, we can trace the operations executed on both GPU and CPU to analyze the resource allocations and consumption. This paper presents the time and memory allocation of CPU and GPU while training deep neural networks using Pytorch. This paper analysis shows that GPU has a lower running time as compared to CPU for deep neural networks. For a simpler network, there are not many significant improvements in GPU over the CPU.

* 6 pages, 11 figures

Via

Access Paper or Ask Questions

Age Range Estimation using MTCNN and VGG-Face Model

Apr 17, 2021

Dipesh Gyawali, Prashanga Pokharel, Ashutosh Chauhan, Subodh Chandra Shakya

Figure 1 for Age Range Estimation using MTCNN and VGG-Face Model

Figure 2 for Age Range Estimation using MTCNN and VGG-Face Model

Figure 3 for Age Range Estimation using MTCNN and VGG-Face Model

Figure 4 for Age Range Estimation using MTCNN and VGG-Face Model

Abstract:The Convolutional Neural Network has amazed us with its usage on several applications. Age range estimation using CNN is emerging due to its application in myriad of areas which makes it a state-of-the-art area for research and improve the estimation accuracy. A deep CNN model is used for identification of people's age range in our proposed work. At first, we extracted only face images from image dataset using MTCNN to remove unnecessary features other than face from the image. Secondly, we used random crop technique for data augmentation to improve the model performance. We have used the concept of transfer learning in our research. A pretrained face recognition model i.e VGG-Face is used to build our model for identification of age range whose performance is evaluated on Adience Benchmark for confirming the efficacy of our work. The performance in test set outperformed existing state-of-the-art by substantial margins.

* 11th IEEE International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2020
* 6 pages, 10 figures

Via

Access Paper or Ask Questions

Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Apr 05, 2020

Dipesh Gyawali, Alok Regmi, Aatish Shakya, Ashish Gautam, Surendra Shrestha

Figure 1 for Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Figure 2 for Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Figure 3 for Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Figure 4 for Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Abstract:Waste is a wealth in a wrong place. Our research focuses on analyzing possibilities for automatic waste sorting and collecting in such a way that helps it for further recycling process. Various approaches are being practiced managing waste but not efficient and require human intervention. The automatic waste segregation would fit in to fill the gap. The project tested well known Deep Learning Network architectures for waste classification with dataset combined from own endeavors and Trash Net. The convolutional neural network is used for image classification. The hardware built in the form of dustbin is used to segregate those wastes into different compartments. Without the human exercise in segregating those waste products, the study would save the precious time and would introduce the automation in the area of waste management. Municipal solid waste is a huge, renewable source of energy. The situation is win-win for both government, society and industrialists. Because of fine-tuning of the ResNet18 Network, the best validation accuracy was found to be 87.8\%.

* 6 pages, 13 figures

Via

Access Paper or Ask Questions