Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,142 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Research Articles by Furkan Gözükara
|
| 2 |
+
|
| 3 |
+
A curated collection of research articles and theses by **Furkan Gözükara** and collaborators, spanning **2012-2025**.
|
| 4 |
+
|
| 5 |
+
- Furkan Gözükara on X : https://x.com/FurkanGozukara
|
| 6 |
+
- Furkan Gözükara on Google Scholar : https://scholar.google.com/citations?view_op=list_works&hl=en&hl=en&user=_2_KAUsAAAAJ
|
| 7 |
+
- Furkan Gözükara on LinkedIn : https://www.linkedin.com/in/furkangozukara/
|
| 8 |
+
- Furkan Gözükara on YouTube : https://www.youtube.com/SECourses
|
| 9 |
+
- Furkan Gözükara on Medium : https://medium.com/@furkangozukara
|
| 10 |
+
|
| 11 |
+
## At a Glance
|
| 12 |
+
|
| 13 |
+
- **10 works** across journal articles, an MSc thesis, and a PhD thesis
|
| 14 |
+
- Core themes: **product search**, **record linkage**, **focused web crawling**, **sentiment analysis**, **cyber forensics**, and **human-computer interaction**
|
| 15 |
+
- Includes both **method papers** and **full-system theses** that connect crawling, normalization, matching, ranking, and evaluation
|
| 16 |
+
|
| 17 |
+
## Research Themes
|
| 18 |
+
|
| 19 |
+
- E-commerce search, comparison shopping, and product intelligence
|
| 20 |
+
- Product identity clustering, record linkage, and noisy-data normalization
|
| 21 |
+
- Focused web crawling and large-scale data extraction
|
| 22 |
+
- Sentiment analysis for Turkish and English text
|
| 23 |
+
- Cyber forensics and evidentiary risk analysis
|
| 24 |
+
- Air-writing recognition and human-computer interaction
|
| 25 |
+
|
| 26 |
+
## Quick Index
|
| 27 |
+
|
| 28 |
+
| Year | Title | PDF | Type | Venue / Source | Focus |
|
| 29 |
+
|---|---|---|---|---|---|
|
| 30 |
+
| 2025 | [Letter and Person Recognition in Freeform Air-Writing Using Machine Learning Algorithms](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Letter_and_Person_Recognition_in_Freeform_Air-Writing_Using_Machine_Learning_Algorithms.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Letter_and_Person_Recognition_in_Freeform_Air-Writing_Using_Machine_Learning_Algorithms.pdf) | Journal article | IEEE Access, Vol. 13 | Air-writing, person recognition |
|
| 31 |
+
| 2021 | [An Incremental Hierarchical Clustering Based System For Record Linkage In E-Commerce Domain](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/An_Incremental_Hierarchical_Clustering_Based_System_For_Record_Linkage_In_E-Commerce_Domain.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/An_Incremental_Hierarchical_Clustering_Based_System_For_Record_Linkage_In_E-Commerce_Domain.pdf) | Journal article | The Computer Journal *(uploaded PDF is an advance-article version)* | Record linkage, product matching |
|
| 32 |
+
| 2021 | [Challenges and Possible Severe Legal Consequences of Application Users Identification from CNG-Logs](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Challenges_And_Possible_Severe_Legal_Consequences_Of_Application_Users_Identification_From_Cng-Logs.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Challenges_And_Possible_Severe_Legal_Consequences_Of_Application_Users_Identification_From_Cng-Logs.pdf) | Journal article | Forensic Science International: Digital Investigation, Vol. 39 | CGNAT / cyber forensics |
|
| 33 |
+
| 2017 | [Efficient Feature Selection for Product Labeling over Unstructured Data](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Efficient_Feature_Selection_For_Product_Labeling_Over_Unstructured_Data.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Efficient_Feature_Selection_For_Product_Labeling_Over_Unstructured_Data.pdf) | Journal article | IJACSA, Vol. 8, No. 7 | Feature selection, clustering |
|
| 34 |
+
| 2017 | [Focused Web Crawler Development Challenges: ECCrawler](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Focused_Web_Crawler_Development_Challenges_Eccrawler.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Focused_Web_Crawler_Development_Challenges_Eccrawler.pdf) | Journal article | International Journal of Computer Science and Engineering, Vol. 6, Issue 1 | Focused crawling, systems engineering |
|
| 35 |
+
| 2016 | [An Experimental Investigation of Document Vector Computation Methods for Sentiment Analysis of Turkish and English Reviews](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/An_Experimental_Investigation_Of_Document_Vector_Computation_Methods_For_Sentiment_Analysis_Of_Turkish_And_English_Reviews.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/An_Experimental_Investigation_Of_Document_Vector_Computation_Methods_For_Sentiment_Analysis_Of_Turkish_And_English_Reviews.pdf) | Journal article | Çukurova University Journal of the Faculty of Engineering and Architecture, Vol. 31, No. 2 | Sentiment analysis |
|
| 36 |
+
| 2016 | [A Product Search Engine Supporting "Best Product" Queries](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/A_Product_Search_Engine_Supporting_Best_Product_Queries.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/A_Product_Search_Engine_Supporting_Best_Product_Queries.pdf) | Journal article | Çukurova University Journal of the Faculty of Engineering and Architecture, Vol. 31, Special Issue 2 | Product ranking, query processing |
|
| 37 |
+
| 2016 | [Product Search Engine Using Product Name Recognition and Sentiment Analysis](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Product_Search_Engine_Using_Product_Name_Recognition_And_Sentiment_Analysis.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Product_Search_Engine_Using_Product_Name_Recognition_And_Sentiment_Analysis.pdf) | PhD thesis | Çukurova University | Full product-search-engine architecture |
|
| 38 |
+
| 2015 | [New Metrics for Clustering of Identical Products over Imperfect Data](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/New_Metrics_For_Clustering_Of_Identical_Products_Over_Imperfect_Data.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/New_Metrics_For_Clustering_Of_Identical_Products_Over_Imperfect_Data.pdf) | Journal article | Turkish Journal of Electrical Engineering and Computer Sciences, Vol. 23, No. 4 | Similarity metrics, evaluation |
|
| 39 |
+
| 2012 | [Fiyat Karşılaştırmalı Ürün Arama Motoru Geliştirme](https://huggingface.co/MonsterMMORPG/ResearchArticles/blob/main/Fiyat_Kar%C5%9F%C4%B1la%C5%9Ft%C4%B1rmal%C4%B1_%C3%9Cr%C3%BCn_Arama_Motoru_Geli%C5%9Ftirme.pdf) | [PDF](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Fiyat_Kar%C5%9F%C4%B1la%C5%9Ft%C4%B1rmal%C4%B1_%C3%9Cr%C3%BCn_Arama_Motoru_Geli%C5%9Ftirme.pdf) | MSc thesis | Mersin University | Price-comparison search engine |
|
| 40 |
+
|
| 41 |
+
## Detailed Timeline
|
| 42 |
+
|
| 43 |
+
<a id="2025-air-writing"></a>
|
| 44 |
+
|
| 45 |
+
### 2025 - [Letter and Person Recognition in Freeform Air-Writing Using Machine Learning Algorithms](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Letter_and_Person_Recognition_in_Freeform_Air-Writing_Using_Machine_Learning_Algorithms.pdf)
|
| 46 |
+
|
| 47 |
+
**Type:** Journal article
|
| 48 |
+
**Venue:** IEEE Access, Vol. 13
|
| 49 |
+
**Focus:** Air-writing, letter recognition, person recognition, IMU-based interaction
|
| 50 |
+
|
| 51 |
+
This paper introduces a wearable-glove pipeline for **freeform air-writing analysis** that jointly models **letter recognition** and **writer recognition**. It uses IMU signals, Fourier and wavelet feature extraction, and multiple machine-learning baselines, while also contributing a **public Turkish alphabet air-writing dataset**. The study reports that **SubSpace KNN** performs best under the tested settings.
|
| 52 |
+
|
| 53 |
+
<a id="2021-record-linkage"></a>
|
| 54 |
+
|
| 55 |
+
### 2021 - [An Incremental Hierarchical Clustering Based System For Record Linkage In E-Commerce Domain](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/An_Incremental_Hierarchical_Clustering_Based_System_For_Record_Linkage_In_E-Commerce_Domain.pdf)
|
| 56 |
+
|
| 57 |
+
**Type:** Journal article
|
| 58 |
+
**Venue:** The Computer Journal *(the uploaded PDF is an advance-article version dated 2021 rather than a later issue-formatted PDF)*
|
| 59 |
+
**Focus:** Record linkage, incremental clustering, product-title matching
|
| 60 |
+
|
| 61 |
+
This work presents a **dynamic / incremental Hierarchical Agglomerative Clustering (HAC)** system for grouping identical products crawled from different e-commerce websites. The method uses **bag-of-words title representations**, **domain-specific matching / filtering**, and **ELKI-based evaluation**, and reports **96.25% F-measure** on the experimental setup. The paper also emphasizes **dataset release** and evaluation reproducibility.
|
| 62 |
+
|
| 63 |
+
<a id="2021-cgn-logs"></a>
|
| 64 |
+
|
| 65 |
+
### 2021 - [Challenges and Possible Severe Legal Consequences of Application Users Identification from CNG-Logs](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Challenges_And_Possible_Severe_Legal_Consequences_Of_Application_Users_Identification_From_Cng-Logs.pdf)
|
| 66 |
+
|
| 67 |
+
**Type:** Journal article
|
| 68 |
+
**Venue:** Forensic Science International: Digital Investigation, Vol. 39
|
| 69 |
+
**Focus:** CGNAT, reverse tracking, cyber forensics, evidentiary risk
|
| 70 |
+
|
| 71 |
+
This paper studies how **carrier-grade NAT / CGNAT logs** can be misused in reverse-tracking workflows and how such misuse can lead to **false attribution** in criminal investigations. Using the **ByLock** case context in Turkey and a comparison with **EncroChat**, it analyzes the technical and legal consequences of flawed identification pipelines.
|
| 72 |
+
|
| 73 |
+
<a id="2017-feature-selection"></a>
|
| 74 |
+
|
| 75 |
+
### 2017 - [Efficient Feature Selection for Product Labeling over Unstructured Data](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Efficient_Feature_Selection_For_Product_Labeling_Over_Unstructured_Data.pdf)
|
| 76 |
+
|
| 77 |
+
**Type:** Journal article
|
| 78 |
+
**Venue:** International Journal of Advanced Computer Science and Applications (IJACSA), Vol. 8, No. 7
|
| 79 |
+
**Focus:** Feature selection, product labeling, clustering under unstructured data
|
| 80 |
+
|
| 81 |
+
This study proposes a **feature-selection algorithm** for labeling identical products collected from noisy, heterogeneous web sources. The paper frames product labeling as a **clustering problem over unstructured feature vectors** and shows that the proposed method improves clustering quality compared with baseline approaches.
|
| 82 |
+
|
| 83 |
+
<a id="2017-eccrawler"></a>
|
| 84 |
+
|
| 85 |
+
### 2017 - [Focused Web Crawler Development Challenges: ECCrawler](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Focused_Web_Crawler_Development_Challenges_Eccrawler.pdf)
|
| 86 |
+
|
| 87 |
+
**Type:** Journal article
|
| 88 |
+
**Venue:** International Journal of Computer Science and Engineering, Vol. 6, Issue 1
|
| 89 |
+
**Focus:** Focused crawling, multithreading, .NET systems engineering
|
| 90 |
+
|
| 91 |
+
This paper documents the engineering of **EcCrawler**, a hand-crafted focused crawler for e-commerce websites built with **C#**, **.NET 4.5**, and **MS-SQL Server 2014**. It focuses on practical implementation topics such as **threading**, **exception handling**, **HTTP compression**, **duplicate handling**, and **database communication**, and reports **over 400% crawling-speed improvement** and **over 100% UI-responsiveness improvement** from the proposed optimizations.
|
| 92 |
+
|
| 93 |
+
<a id="2016-document-vectors"></a>
|
| 94 |
+
|
| 95 |
+
### 2016 - [An Experimental Investigation of Document Vector Computation Methods for Sentiment Analysis of Turkish and English Reviews](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/An_Experimental_Investigation_Of_Document_Vector_Computation_Methods_For_Sentiment_Analysis_Of_Turkish_And_English_Reviews.pdf)
|
| 96 |
+
|
| 97 |
+
**Type:** Journal article
|
| 98 |
+
**Venue:** Çukurova University Journal of the Faculty of Engineering and Architecture, Vol. 31, No. 2
|
| 99 |
+
**Focus:** Sentiment analysis, vectorization, feature selection, Turkish and English reviews
|
| 100 |
+
|
| 101 |
+
This article compares **document-vector construction choices** for sentiment analysis, including **TF / TF-IDF variants**, **tokenization**, **feature selection**, **preprocessing**, and **vector normalization** under an **SVM** classifier. On the collected Turkish product-reviews dataset, it reports a best result of **91.33% accuracy**.
|
| 102 |
+
|
| 103 |
+
<a id="2016-best-product-queries"></a>
|
| 104 |
+
|
| 105 |
+
### 2016 - [A Product Search Engine Supporting "Best Product" Queries](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/A_Product_Search_Engine_Supporting_Best_Product_Queries.pdf)
|
| 106 |
+
|
| 107 |
+
**Type:** Journal article
|
| 108 |
+
**Venue:** Çukurova University Journal of the Faculty of Engineering and Architecture, Vol. 31, Special Issue 2
|
| 109 |
+
**Focus:** Product ranking, comparison shopping, query processing
|
| 110 |
+
|
| 111 |
+
This work presents a product-search-engine system that supports **"find the best products for a given category"** queries. The system integrates a **focused crawler**, **record linkage**, **sentiment analysis**, and a **query engine**, and reports **96.25% F-measure** in record linkage together with **100% precision** in the evaluated most-related-products search setting.
|
| 112 |
+
|
| 113 |
+
<a id="2016-product-search-thesis"></a>
|
| 114 |
+
|
| 115 |
+
### 2016 - [Product Search Engine Using Product Name Recognition and Sentiment Analysis](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Product_Search_Engine_Using_Product_Name_Recognition_And_Sentiment_Analysis.pdf)
|
| 116 |
+
|
| 117 |
+
**Type:** PhD thesis
|
| 118 |
+
**Institution:** Çukurova University, Department of Computer Engineering
|
| 119 |
+
**Focus:** End-to-end product search engine architecture
|
| 120 |
+
|
| 121 |
+
This dissertation brings the main threads of the repository together into a **full product search engine**: focused crawling, product-name matching / record linkage, sentiment analysis, and a user-facing search system. The abstract reports **472% crawler performance boost**, **91.08% sentiment-analysis accuracy**, **96.25% F-measure** for record linkage, and **100% precision** for most-related-products search in the thesis setup.
|
| 122 |
+
|
| 123 |
+
<a id="2015-clustering-metrics"></a>
|
| 124 |
+
|
| 125 |
+
### 2015 - [New Metrics for Clustering of Identical Products over Imperfect Data](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/New_Metrics_For_Clustering_Of_Identical_Products_Over_Imperfect_Data.pdf)
|
| 126 |
+
|
| 127 |
+
**Type:** Journal article
|
| 128 |
+
**Venue:** Turkish Journal of Electrical Engineering and Computer Sciences, Vol. 23, No. 4
|
| 129 |
+
**Focus:** Similarity metrics, performance metrics, imperfect web-crawled product data
|
| 130 |
+
|
| 131 |
+
This paper formalizes **product identity-clustering** for web-crawled commercial products described by noisy, incomplete, and structurally inconsistent data. It proposes **new similarity metrics** and **new evaluation metrics** for this setting and shows that legacy measures such as Euclidean and cosine similarity are weaker on the tested product-clustering problem.
|
| 132 |
+
|
| 133 |
+
<a id="2012-msc-thesis"></a>
|
| 134 |
+
|
| 135 |
+
### 2012 - [Fiyat Karşılaştırmalı Ürün Arama Motoru Geliştirme](https://huggingface.co/MonsterMMORPG/ResearchArticles/resolve/main/Fiyat_Kar%C5%9F%C4%B1la%C5%9Ft%C4%B1rmal%C4%B1_%C3%9Cr%C3%BCn_Arama_Motoru_Geli%C5%9Ftirme.pdf)
|
| 136 |
+
|
| 137 |
+
**Type:** MSc thesis
|
| 138 |
+
**Institution:** Mersin University, Department of Computer Engineering
|
| 139 |
+
**Focus:** Price-comparison search, normalization, feature extraction, clustering
|
| 140 |
+
|
| 141 |
+
This master's thesis lays the early foundation for a **price-comparison product search engine**. It covers **focused collection of product data**, **noise removal / normalization**, **feature-vector extraction**, and **clustering of identical products** across sources, and it also includes an English abstract under the title **"Developing Product Price Comparison Search Engine."**
|
| 142 |
+
|