استخراج نقشه‌ دو‌بعدی فضاهای داخلی ساختمان با روش‌های یادگیری عمیق از ابرنقاط لیزر ‏اسکنر‌های همراه

حیدری مظفر, مرتضی; دالوند, زهرا

doi:10.22061/jrsgr.2025.12127.1100

تعداد نشریات	11
تعداد شماره‌ها	228
تعداد مقالات	2,322
تعداد مشاهده مقاله	3,684,725
تعداد دریافت فایل اصل مقاله	2,705,882

	استخراج نقشه‌ دو‌بعدی فضاهای داخلی ساختمان با روش‌های یادگیری عمیق از ابرنقاط لیزر ‏اسکنر‌های همراه
پژوهش های سنجش از دور و اطلاعات مکانی
مقالات آماده انتشار، پذیرفته شده، انتشار آنلاین از تاریخ 18 شهریور 1404 اصل مقاله (1.65 M)
نوع مقاله: مقاله پژوهشی
شناسه دیجیتال (DOI): 10.22061/jrsgr.2025.12127.1100
نویسندگان
مرتضی حیدری مظفر^* ¹؛ زهرا دالوند²
¹گروه مهندسی عمران، دانشکده فنی و مهندسی، دانشگاه بوعلی سینا، همدان، ایران
²گروه کامپیوتر، دانشکده مهندسی، دانشگاه بوعلی سینا
تاریخ دریافت: 14 اردیبهشت 1404، تاریخ بازنگری: 11 مرداد 1404، تاریخ پذیرش: 15 شهریور 1404
چکیده
پیشینه و اهداف: استخراج نقشه‌ دوبعدی فضاهای داخلی ساختمان در حوزه‌های مختلف ازجمله معماری، نقشه‌برداری، مدل‌سازی اطلاعات ساختمان (BIM)، رباتیک و واقعیت مجازی نقش حیاتی دارد. اسکنرهای لیزری همراه، ساختار هندسی محیط را با دقت میلی‌متری اندازه‌گیری کرده و نتایج حاصل را به صورت داده‌های ابرنقاط ثبت می‌کنند. داده‌های ابرنقاط، منبع غنی اطلاعات برای استخراج نقشه دوبعدی فضاهای داخلی ساختمان هستند. با‌‌این‌حال، عواملی مانند نویز ناشی از بازتاب‌های سطحی، انسداد دید توسط اجسام درون ساختمان و تراکم غیریکنواخت نقاط، پردازش این نوع داده‌ را با چالش مواجه می‌کنند. در ابتدا استخراج نقشه‌ دوبعدی ساختمان، با تکیه بر روش‌های کلاسیک هندسی انجام می‌شد و در سال‌های اخیر، روش‌های مبتنی بر یادگیری عمیق به‌دلیل توانایی بالا در درک الگوهای پیچیده و مقاومت در برابر نویز به‌طور فزاینده‌ای مورد توجه قرار گرفته‌اند. هدف این پژوهش، ارائه یک چارچوب موثر برای استخراج نقشه‌ دوبعدی فضاهای داخلی ساختمان از داده‌های ابرنقاط با استفاده از روش‌های یادگیری عمیق و مقایسه عملکرد آن با روش‌های کلاسیک است. روش‌ها‌: در این پژوهش، یک چارچوب موثر برای استخراج نقشه دوبعدی فضاهای داخلی ساختمان از داده‌های ابرنقاط پیشنهاد شده است که شامل سه گام متوالی پیش‌پردازش داده‌ها، پیاده‌سازی و ارزیابی نهایی مدل‌ها است. این چارچوب امکان مقایسه مستقیم روش‌های کلاسیک و یادگیری عمیق را در یک بستر مشترک فراهم می‌کند. داده‌های ابرنقاط دارای ساختار گسسته و غیر‌ساخت‌یافته هستند و پردازش مستقیم آن‌ها دشوار است. در گام پیش‌پردازش داده‌ها، ابتدا با نگاشت ابرنقاط به فضای دوبعدی، تصاویر تراکم تولید شد تا پیچیدگی محاسباتی کاهش یابد. درگام دوم، دو مدل یادگیری عمیق U-Net و Pix2Pix و الگوریتم کلاسیک تبدیل هاف پیاده‌سازی شد و تصاویر تراکم به‌عنوان ورودی مشترک این روش‌ها استفاده شد. در گام سوم به منظور ارزیابی عملکرد روش پیشنهادی، آزمایش‌ها روی مجموعه‌داده‌های دردسترس FloorNet و Structure3D انجام شد. داده‌های ورودی به سه مجموعه داده آموزشی، اعتبارسنجی و آزمون تقسیم شده‌اند و به ‌منظور ارتقای تعمیم‌پذیری، تکنیک‌های داده‌افزایی اعمال شده است. ارزیابی عملکرد مدل‌ها با استفاده از معیارهای Dice Score و IoU انجام شد. یافته‌ها: مدل‌های یادگیری عمیق در نمونه‌های فاقد انسداد، عملکرد قابل قبولی داشته و دقت بیش از ۹۰٪ نشان دادند. به‌ویژه، مدل U-Net در مجموعه داده Structure3D به دقت ۹۷٪ در معیار Dice Score دست یافت. با‌این‌حال، مدل‌ها درنمونه‌هایی که شامل عارضه انسداد بودند، نتوانستند نقشه را به طور کامل استخراج کنند. در مقابل، الگوریتم تبدیل هاف در تشخیص خطوط عملکرد قابل قبول داشت اما به‌دلیل عدم تشخیص ساختار توپولوژیک، در تولید خروجی‌های منسجم و قابل‌استفاده برای مدل‌سازی نقشه‌های داخلی محدودیت دارد. هم‌چنین آزمون و خطا برای تنظیم مقادیر پارامترها، باعث افزایش قابل توجه زمان اجرای الگوریتم شد. نتیجه‌گیری: یافته‌های این پژوهش نشان داد که روش‌های یادگیری عمیق، در صورت وجود داده‌های کامل، قادر به استخراج دقیق و ساخت‌یافته نقشه‌ دوبعدی از داده‌های ابرنقاط هستند اما در شرایط واقعی که عارضه انسداد اجتناب‌ناپذیر است، توسعه‌ مدل‌های مقاوم‌ نسبت به داده ناقص ضروری است. برای این منظور بهره‌گیری از معماری‌های ترکیبی، استفاده از منابع اطلاعاتی مکمل نظیر تصاویر RGB یا داده‌های عمق مسیر پژوهش‌های آینده است. چارچوب پیشنهادی این پژوهش، گامی مؤثر در جهت مقایسه‌ی نظام‌مند روش‌های استخراج نقشه دوبعدی و ایجاد بستر توسعه برای مدل‌های پیشرفته‌تر در کاربردهای واقعی تلقی می‌شود.
کلیدواژه‌ها
نقشه دوبعدی؛ ابر نقاط؛ تصویر تراکم؛ یادگیری عمیق‎ ‎‏ ‏
عنوان مقاله [English]
‎2D Floorplan Extraction of Indoor Spaces from Mobile Laser Scanner Point Clouds ‎Using Deep Learning Methods
نویسندگان [English]
M. Heidarimozaffar¹؛ Z. Dalvand²
¹Department of Civil Engineering, Faculty of Engineering, Bu-Ali Sina University, Hamadan, Iran
²Department of Computer Science, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran
چکیده [English]
Background and Objectives: The extraction of 2D floorplans of building interiors plays a vital role in various domains, including architecture, surveying, building information modeling (BIM), robotics, and virtual reality. Mobile laser scanners capture the geometric structure of indoor environments with millimeter-level accuracy and record the results as point cloud data. Point clouds are a rich source of information for generating 2D floorplans of indoor spaces. However, surface reflection noise, occlusions caused by indoor objects, and the non-uniform density of points pose significant challenges for processing such data. Initially, 2D floorplan extraction relied on classical geometric methods. In recent years, however, deep learning-based approaches have gained increasing attention due to their strong ability to understand complex patterns and their robustness to noise. The main objective of this study is to present an effective framework for extracting 2D floorplans of building interiors from point cloud data using deep learning methods and to compare its performance with that of classical techniques. Methods: In this study, an effective framework is proposed for extracting 2D floorplans of indoor building spaces from point cloud data, consisting of three sequential steps: data preprocessing, model implementation, and final evaluation. This framework enables a direct comparison between classical methods and deep learning approaches within a unified setting. Point cloud data are inherently discrete and unstructured, making direct processing challenging. In the preprocessing step, point clouds were projected onto a 2D space to generate density images, thereby reducing computational complexity. In the second step, two deep learning models, U-Net and Pix2Pix, as well as the classical Hough Transform algorithm, were implemented, with the density images serving as a common input for all methods. In the third step, the proposed framework was evaluated using publicly available datasets, including FloorNet and Structure3D. The input data were split into training, validation, and test sets, and data augmentation techniques were applied to improve model generalization. The performance of the models was assessed using the Dice Score and Intersection over Union (IoU) metrics. Findings: Deep learning models demonstrated satisfactory performance on samples without occlusions, achieving accuracy levels above 90%. In particular, the U-Net model achieved a Dice Score of 97% on the Structure3D dataset. However, in samples containing occlusions, the models were unable to fully extract the floorplans. In contrast, the Hough Transform algorithm performed reasonably well in line detection but exhibited limitations in generating coherent and topologically valid outputs suitable for indoor map modeling due to its inability to capture topological structure. Moreover, the trial-and-error process required to tune the algorithm’s parameters significantly increased its runtime. Conclusion: The findings of this study indicate that deep learning methods, when provided with complete data, are capable of accurately and structurally extracting 2D floorplans from point clouds. However, under real-world conditions where occlusion is inevitable, developing models that are robust to incomplete data becomes essential. To address this challenge, future research directions include employing hybrid architectures and incorporating complementary data sources such as RGB images or depth maps. The proposed framework in this study serves as an effective step toward the systematic comparison of 2D floorplan extraction methods and provides a foundation for developing more advanced models suitable for real-world applications.
کلیدواژه‌ها [English]
Floorplan, Point Clouds, Density IMAGE, Deep Learning

مراجع
[1] Poux F. The smart point cloud: Structuring 3D intelligent point data [dissertation]: Universite de Liege (Belgium); 2019. doi:10.13140/RG.2.2.20457.75367. [2] Heidari Mozaffar M, Varshosaz M, Saadatseresht M. Combining 2D and 3D Edges for Detecting Occluded Regions in Terrestrial Laser Scanner Point Clouds. Journal of Soft Computing and Information Technology. 2015;4(4):64-76. [3] Westoby M, Brasington J, Glasser N, Hambrey M, Reynolds M. ‘Structure-from-Motion’photogrammetry: A low-cost, effective tool for geoscience applications. Geomorphology. 2012;179:300-314. doi:10.1016/j.geomorph.2012.08.021. [4] McManamon P. Lidar technologies and Systems. USA: Society of Photo-Optical Instrumentation Engineers (SPIE); 2019. doi:10.1117/3.2518254 [5] Riveiro B, Lindenbergh R. Laser scanning: an emerging technology in structural engineering. China: CRC Press/Balkema; 2019. doi:10.1201/9781351018869 [6] Heidari Mozaffar M, Hossein SA. Extracting FaçadePoints of Urban Buildings from Mobile Laser Scanner Point Clouds. Remote Sensing and Geospatial Information Research. 2024;2(2):179-186. doi:10.22061/jrsgr.2023.1990. [7] Lu Q, Lee S. A semi-automatic approach to detect structural components from CAD drawings for constructing as-is BIM objects. Computing in Civil Engineering 2017. p. 84-91. doi:10.1061/9780784480823.011. [8] Kumar A, Masane Y, Dhakhwa S. Disaster management plan for residential-cum-commercial property in Mumbai: a case study. Environment, Development and Sustainability. 2021;23(8):12598-12614. doi:10.1007/s10668-020-01129-7. [9] Gerstweiler G, Furlan L, Timofeev M, Kaufmann H. Extraction of structural and semantic data from 2D floor plans for interactive and immersive VR real estate exploration. Technologies. 2018;6(4):101. doi:10.3390/technologies6040101. [10] Pexman K. Automated Floor Plan and Building Model Creation for Cultural Heritage Buildings from Laser Scanner Data. 2021. [11] Guo Y, Wang H, Hu Q, Liu H, Liu L, Bennamoun M. Deep learning for 3d point clouds: A survey. IEEE transactions on pattern analysis and machine intelligence. 2020;43(12):4338-4364. doi:10.1109/TPAMI.2020.3005434. [12] Golparvar F, Mohammadzadeh A. Semantic Segmentation of point cloud data acquired from Industrial Sites using Terrestrial Laser Scanner based on Graph Methods. Iran: K. N. Toosi University; 2021. [13] Pizarro P, Hitschfeld N, Sipiran I, Saavedra J. Automatic floor plan analysis and recognition. Automation in Construction. 2022;140:104-348. doi:10.1016/j.autcon.2022.104348. [14] Cai R, Li H, Xie J, Jin X. Accurate floorplan reconstruction using geometric priors. Computers & Graphics. 2022;102:360-369. doi:10.1016/j.cag.2021.10.011. [15] Fang H, Lafarge F, Pan C, Huang H. Floorplan generation from 3D point clouds: A space partitioning approach. ISPRS Journal of Photogrammetry and Remote Sensing. 2021;175:44-55. doi:10.1016/j.isprsjprs.2021.02.012. [16] Alpaydin E. Introduction to machine learning. Cambridge, MA: MIT press; 2020. doi:10.7551/mitpress/13811.001.0001 [17] Kim M, Lee D, Kim T, Oh S, Cho H. Automated extraction of geometric primitives with solid lines from unstructured point clouds for creating digital buildings models. Automation in Construction. 2023;145:104642. doi:10.1016/j.autcon.2022.104642. [18] Chen J, Qian Y, Furukawa Y. Heat: Holistic edge attention transformer for structured reconstruction. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition; 2022. doi:10.1109/CVPR52688.2022.00377. [19] Su J, Tung K, Peng C, Wonka P, Chu H. Slibo-net: Floorplan reconstruction via slicing box representation with local geometry regularization. Advances in Neural Information Processing Systems. 2023;36:48781-48792. [20] Yue Y, Kontogianni T, Schindler K, Engelmann F. Connecting the dots: Floorplan reconstruction using two-level queries. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; 2023. [21] Zheng J, Zhang J, Li J, Tang R, Gao S, Zhou Z. Structured3d: A large photo-realistic dataset for structured 3d modeling. Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IX 16; 2020: Springer. doi:10.1007/978-3-030-58545-7_30. [22] Liu C, Wu J, Furukawa Y. Floornet: A unified framework for floorplan reconstruction from 3d scans. Proceedings of the European conference on computer vision (ECCV); 2018. Cham: Springer. doi:10.13140/RG.2.2.20457.75367. [23] Aggarwal N, Karl W. Line detection in images through regularized Hough transform. IEEE transactions on image processing. 2006;15(3):582-591. doi:10.1109/TIP.2005.863021. [24] Yang R, Deng C, Yu K, Li Z, Pan L. A new way for cartesian coordinate transformation and its precision evaluation. Remote Sensing. 2022;14(4):864. doi:10.3390/rs14040864. [25] Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18; 2015. Cham: Springer. doi:10.1007/978-3-319-24574-4_28. [26] Goodfellow I, Bengio Y, Courville A, Bengio Y. Deep learning. USA: MIT press Cambridge; 2016. doi:10.4258/hir.2016.22.4.351 [27] Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research. 2014;15(1):1929-1958. doi:10.5555/2627435.2670313. [28] Isola P, Zhu J, Zhou T, Efros A. Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE conference on computer vision and pattern recognition; 2017. doi:10.1109/CVPR.2017.632. [29] Nambiyar A, Kannan R. Defective Edge Detection Using Cascaded Ensemble Canny Operator. arXiv preprint arXiv:241114868. 2024. doi:10.48550/arXiv.2411.14868. [30] Sathyanarayanan S, Tantri B. Confusion matrix-based performance evaluation metrics. African Journal of Biomedical Research. 2024:4023-4031. doi:10.53555/AJBR.v27i4S.4345. [31] Heidari Mozaffar M, Varshosaz M, Saadatseresht M. Occlusion Area as Suitable Guidance for Terrestrial Laser Scanner Localization. ISSGE. 2020;10(2):199-207. doi:20.1001.1.2322102.1399.10.2.15.8.
آمار تعداد مشاهده مقاله: 153 تعداد دریافت فایل اصل مقاله: 86

سامانه مدیریت نشریات علمی. طراحی و پیاده سازی از سیناوب

پیوندهای مفید

آمار

استخراج نقشه‌ دو‌بعدی فضاهای داخلی ساختمان با روش‌های یادگیری عمیق از ابرنقاط لیزر ‏اسکنر‌های همراه