Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis

Detalhes bibliográficos
Autor(a) principal: Saura, José Ramón
Data de Publicação: 2021
Outros Autores: Reyes-Menéndez, Ana, deMatos, Nelson, Correia, Marisol B.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.1/17205
Resumo: The startup business ecosystem in India has experienced exponential growth. The amount of investment in Indian startups in the last decade demonstrates the strong interest of the technology industry to these business models based on innovation. In this context, the present study aims to identify investment opportunities for investors in Indian startups by identifying key indicators that characterize the startup ecosystem in India. To this end, a three steps data mining method is developed using data mining techniques. First, a sentiment analysis (SA), a machine learning approach that classifies the topics into groups expressing feelings, is applied to a dataset. Next, we develop a Latent Dirichlet Allocation (LDA) model, a topic-modeling technique that divides the sample of n = 14.531 tweets from Twitter into topics, using user-generated content (UGC) as data. Finally, in order to identify the characteristics of each topic we apply textual analysis (TA) to identify key indicators. The originality of the present study lies in the methodological process used for data analysis. Our results also contribute to the literature on startups. The results demonstrate that the Indian startup ecosystem is influenced by areas such as fintech, innovation, crowdfunding, hardware, funds, competition, artificial intelligence, augmented reality and electronic commerce. Of note, in view of the exploratory approach of the present study, the results and implications should be taken as descriptive, rather than determining for future investments in the Indian startup ecosystem.
id RCAP_99fdcf7d4ecd4d242d53e1f64861766d
oai_identifier_str oai:sapientia.ualg.pt:10400.1/17205
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysisIdentificação de oportunidades de negócios de startups da UGC no twitter bate-papo: uma análise exploratóriaStartups opportunitiesUser-generated contentSentiment analysisElectronic commerceThe startup business ecosystem in India has experienced exponential growth. The amount of investment in Indian startups in the last decade demonstrates the strong interest of the technology industry to these business models based on innovation. In this context, the present study aims to identify investment opportunities for investors in Indian startups by identifying key indicators that characterize the startup ecosystem in India. To this end, a three steps data mining method is developed using data mining techniques. First, a sentiment analysis (SA), a machine learning approach that classifies the topics into groups expressing feelings, is applied to a dataset. Next, we develop a Latent Dirichlet Allocation (LDA) model, a topic-modeling technique that divides the sample of n = 14.531 tweets from Twitter into topics, using user-generated content (UGC) as data. Finally, in order to identify the characteristics of each topic we apply textual analysis (TA) to identify key indicators. The originality of the present study lies in the methodological process used for data analysis. Our results also contribute to the literature on startups. The results demonstrate that the Indian startup ecosystem is influenced by areas such as fintech, innovation, crowdfunding, hardware, funds, competition, artificial intelligence, augmented reality and electronic commerce. Of note, in view of the exploratory approach of the present study, the results and implications should be taken as descriptive, rather than determining for future investments in the Indian startup ecosystem.UIDB/04470/2020, UIDB/04020/2020MDPISapientiaSaura, José RamónReyes-Menéndez, AnadeMatos, NelsonCorreia, Marisol B.2021-10-07T19:53:34Z2021-092021-09-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.1/17205eng0718-187610.3390/jtaer16060108info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-24T10:29:20Zoai:sapientia.ualg.pt:10400.1/17205Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:07:13.719383Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
Identificação de oportunidades de negócios de startups da UGC no twitter bate-papo: uma análise exploratória
title Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
spellingShingle Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
Saura, José Ramón
Startups opportunities
User-generated content
Sentiment analysis
Electronic commerce
title_short Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
title_full Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
title_fullStr Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
title_full_unstemmed Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
title_sort Identifying startups business opportunities from UGC on twitter chatting: an exploratory analysis
author Saura, José Ramón
author_facet Saura, José Ramón
Reyes-Menéndez, Ana
deMatos, Nelson
Correia, Marisol B.
author_role author
author2 Reyes-Menéndez, Ana
deMatos, Nelson
Correia, Marisol B.
author2_role author
author
author
dc.contributor.none.fl_str_mv Sapientia
dc.contributor.author.fl_str_mv Saura, José Ramón
Reyes-Menéndez, Ana
deMatos, Nelson
Correia, Marisol B.
dc.subject.por.fl_str_mv Startups opportunities
User-generated content
Sentiment analysis
Electronic commerce
topic Startups opportunities
User-generated content
Sentiment analysis
Electronic commerce
description The startup business ecosystem in India has experienced exponential growth. The amount of investment in Indian startups in the last decade demonstrates the strong interest of the technology industry to these business models based on innovation. In this context, the present study aims to identify investment opportunities for investors in Indian startups by identifying key indicators that characterize the startup ecosystem in India. To this end, a three steps data mining method is developed using data mining techniques. First, a sentiment analysis (SA), a machine learning approach that classifies the topics into groups expressing feelings, is applied to a dataset. Next, we develop a Latent Dirichlet Allocation (LDA) model, a topic-modeling technique that divides the sample of n = 14.531 tweets from Twitter into topics, using user-generated content (UGC) as data. Finally, in order to identify the characteristics of each topic we apply textual analysis (TA) to identify key indicators. The originality of the present study lies in the methodological process used for data analysis. Our results also contribute to the literature on startups. The results demonstrate that the Indian startup ecosystem is influenced by areas such as fintech, innovation, crowdfunding, hardware, funds, competition, artificial intelligence, augmented reality and electronic commerce. Of note, in view of the exploratory approach of the present study, the results and implications should be taken as descriptive, rather than determining for future investments in the Indian startup ecosystem.
publishDate 2021
dc.date.none.fl_str_mv 2021-10-07T19:53:34Z
2021-09
2021-09-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.1/17205
url http://hdl.handle.net/10400.1/17205
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 0718-1876
10.3390/jtaer16060108
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133316622843904