|
|
| --- |
| tags: |
| - bertopic |
| library_name: bertopic |
| pipeline_tag: text-classification |
| --- |
| |
| # TopicModelling |
|
|
| This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model. |
| BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets. |
|
|
| ## Usage |
|
|
| To use this model, please install BERTopic: |
|
|
| ``` |
| pip install -U bertopic |
| ``` |
|
|
| You can use the model as follows: |
|
|
| ```python |
| from bertopic import BERTopic |
| topic_model = BERTopic.load("heyitskim1912/TopicModelling") |
| |
| topic_model.get_topic_info() |
| ``` |
|
|
| ## Topic overview |
|
|
| * Number of topics: 36 |
| * Number of training documents: 1845 |
|
|
| <details> |
| <summary>Click here for an overview of all topics.</summary> |
| |
| | Topic ID | Topic Keywords | Topic Frequency | Label | |
| |----------|----------------|-----------------|-------| |
| | -1 | growth rate - immersive - subscriber - future - long term | 10 | Outliers | |
| | 0 | expense growth - quarter versus - operating margin - lower expected - headwind | 564 | Decreased Operating Income and Higher Expenses | |
| | 1 | going forward - pricing actions - weâ ve - latin america - subscriber base | 153 | Cost Drivers and Pricing Strategy | |
| | 2 | distributed computing - process automation - windows 11 - power platform - active users | 146 | Digital Transformation | |
| | 3 | strong prior - continued strength - benefiting - revenue increased - driven growth | 128 | Revenue Growth and Performance | |
| | 4 | revenue growth - growth cable - growth sequentially - operating results - increased versus | 123 | Operating Results and Revenue Performance | |
| | 5 | growth strategy - profitability - efficiencies - resilience - significant progress | 66 | Resilience and Growth Strategy in Challenging Times | |
| | 6 | innovate - diversify - expanding opportunity - leveraging - customization | 50 | Driving Growth through Innovation and Expansion | |
| | 7 | personalizing experiences - transformative - experiences make - unparalleled - human connection | 48 | Transforming Entertainment through Unparalleled Storytelling | |
| | 8 | growth continued - strong job - demand strong - revenue growth - momentum microsoft | 45 | Sustained Growth and Engagement in Gaming and Office Consumer | |
| | 9 | guidance provided - year 2022 - integration costs - reopening - based current | 31 | Forward-looking Statements and Cautionary Statements | |
| | 10 | unwavering - reinvention - human connection - strategic investments - leadership team | 29 | Commitment to Reinvention and Partner Support | |
| | 11 | disneyland paris - entire quarter - including shanghai - navigating - limited number | 27 | Impact of Pandemic on Disney Theme Parks | |
| | 12 | staffing issues - shutdowns china - quarter diluted - onset pandemic - impacting | 26 | Supply Chain Disruptions and Inflationary Costs during Omicron Variant | |
| | 13 | strong demand - really strong - growth driving - great content - opportunity market | 26 | Strong Demand and Increased Per Capita Spending | |
| | 14 | improvement cloud - increased slightly - improvements azure - margin businesses - revenue mix | 25 | Improved Gross Margin Percentage in Cloud Services | |
| | 15 | star wars - pixar - jungle cruise - franchises including - disney day | 25 | Upcoming Content Expansion and Exciting Releases | |
| | 16 | consistently strong - continued growth - strong revenue - strong quarter - stellar performance | 25 | Revenue Growth and Performance | |
| | 17 | optimize - believe prudent - singularly focused - enhancements - executing reinvention | 24 | Commitment to Reinvention and Partner Support | |
| | 18 | customer experience - consumer products - iced coffee - coffee innovation - selling iced | 24 | Digital Engagement | |
| | 19 | strong food - diverse customer - strategically manage - growth digital - marketing solutions | 21 | Audience Engagement | |
| | 20 | espn advertising - continue perform - entertainment titles - nba finals - general entertainment | 20 | Sports Broadcasting and Streaming Success | |
| | 21 | innovation driving - growth continued - drive healthy - advanced security - demand microsoft | 19 | Microsoft Windows Commercial Products and Cloud Services Growth | |
| | 22 | revenue growth - segment revenue - enterprise services - productivity business - personal computing | 18 | Microsoft Revenue Outlook by Business Segments | |
| | 23 | customizing - experience possible - connect enables - feel good - loyalty program | 18 | Digital Rewards and Enhanced Customer Experience | |
| | 24 | strong annuity - significant growth - strong execution - demand strong - increased commitment | 17 | Microsoft Windows Commercial Products and Cloud Services Growth | |
| | 25 | driven growth - rewards program - growth asia - new stores - business q4 | 17 | Expansion in China | |
| | 26 | million subscriptions - million subscribers - sales mix - paid subscribers - q1 results | 16 | Continuous Growth of Subscriptions and ESPN+ | |
| | 27 | intensify - remain stable - stronger expected - fy2022 - anticipated half | 14 | Continuous Growth of Subscriptions and ESPN+ | |
| | 28 | increased slightly - increased constant - income increased - increased 22 - margin dollars | 14 | Significant Growth in Operating Income and Expenses | |
| | 29 | resonating - windows 11 - disneyland paris - alluded - advertisers | 14 | Successful Launch of Disney+ Ad-Supported Subscription | |
| | 30 | growing demand - increase versus - support growing - simplifying - fiscal 2023 | 13 | Investment in Cloud Services to Meet Growing Demand | |
| | 31 | experiences make - accretive business - integrations teams - connect enables - metrics | 13 | Reinventing Retail Partner Experience for Growth | |
| | 32 | excellence innovation - strong momentum - innovation audience - storytelling excellence - leading benefits | 13 | Driving Performance and Innovation through Strategic Investments | |
| | 33 | metrics - non gaap - currency dollars - fiscal 2022 - constant currency | 12 | Currency-Based Outlook for Performance Forecasting | |
| | 34 | robust demand - strong performance - driven growth - growth global - revenue increased | 11 | Strong Revenue Growth in Channel Development | |
| |
| </details> |
|
|
| ## Training hyperparameters |
|
|
| * calculate_probabilities: True |
| * language: None |
| * low_memory: False |
| * min_topic_size: 10 |
| * n_gram_range: (1, 1) |
| * nr_topics: None |
| * seed_topic_list: None |
| * top_n_words: 10 |
| * verbose: True |
| |
| ## Framework versions |
| |
| * Numpy: 1.22.4 |
| * HDBSCAN: 0.8.29 |
| * UMAP: 0.5.3 |
| * Pandas: 1.5.3 |
| * Scikit-Learn: 1.2.2 |
| * Sentence-transformers: 2.2.2 |
| * Transformers: 4.30.2 |
| * Numba: 0.56.4 |
| * Plotly: 5.13.1 |
| * Python: 3.10.12 |
| |