Channel: Dictionary-based text analysis- dealing with length - Data Science Stack Exchange

Dictionary-based text analysis- dealing with length

January 21, 2024, 6:04 am

≪ Previous: Answer by Erwan for Dictionary-based text analysis- dealing with length

I am working on an analysis using a dictionary-based text-as-data approach. I have a dataset of texts (n=1200), and I am applying a dictionary of 50 words (I tokenize the text with each word being one token). The texts greatly vary in terms of length, so I try to take length into consideration in my models. I first tried to divide the dictionary count in each text by text length (dictionary count/text length = k). Because I use a regression model, I then take the square root of k to normalize the data (which I use as a dependent variable). In a second model, I did not divide the dictionary count by text length, but I controlled for length as a predictor in a linear regression model (I still take the square root of the dictionary count). The results across these models are substantially different (Especially in terms of statistical significance). I am struggling to decide which model is better, as I could not locate the papers on the subject matter in my field (political science) or elsewhere. Any suggestions?

↧

↧

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

Trending Articles

A Wall Street guide to watches

August 5, 2015, 7:32 am

Who Is Junior Pope?| Biography| Profile| History Of Nollywood Actor “Pope...

July 26, 2017, 8:45 am

Consuelo Ortiga y Rey: The "Crush ng Bayan" in Rizal's Time

August 4, 2013, 11:32 pm

Gangland murders in Dublin (1990-94)

April 17, 2020, 1:54 am

Guntur District Police Officers Mobile Numbers

April 17, 2017, 2:10 am

Pengalaman Rawatan di Klinik Dr. Ko

October 15, 2021, 7:41 am

100+ Short Whatsapp Status in English | Short Status Quotes Words

March 22, 2017, 12:27 am

Happy Birthday Wishes for Bhabhi in Hindi & English |हैप्पी बर्थडे भाभी

March 13, 2020, 3:01 am

CINTA JANGAN PERGI (1 - 26 TAMAT)

January 30, 2014, 7:31 am

Bar Rescue - The Prime Bar (WildeFire Bistro) Update

September 15, 2019, 6:50 am

Who Is Jennifer Hines? Bryan Olesen Wife Is Mother Of 3 Kids

March 5, 2024, 2:19 am

AUDIO | Diamond Platnumz ft Mugabe - LawaMa | Download

July 25, 2014, 8:00 am

Tuck Mill sells for £1.4 million

April 15, 2013, 5:22 am

NAT, NCAE, LAPG, SREYA, ELNA and PHIL-IR Materials and Reviewers

February 27, 2017, 6:16 pm

Read GOS (Generic Object Service) Picture Attachments and Display it into...

February 14, 2014, 1:08 pm

Romantic And Impressive Birthday Wishes For Girlfriend - Best Birthday Wishes...

January 30, 2020, 8:41 am

Actress Piumi Botheju New hot and Sexy Photo

February 24, 2013, 8:50 pm

Gulabi kallu Lyrics and translation | GAV / Govindhudu andhari vadele (2014)

September 16, 2014, 6:33 am

Mothey Mandal Sarpanch Wardmumber Mobile Numbers List Part I Nalgonda...

April 19, 2017, 9:30 am

Windows 11 Highly Compressed ISO - 10 MB

July 1, 2021, 2:00 pm

More Pages to Explore .....

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

© 2024 //www.rssing.com