u/First-Ad-862

Need to analyze pdf research papers for word frequencies. I'm pretty green when it comes to R studio and have only used it for statistics using an excel file so I'm super confused on how to change the pdf file to a text file for data extraction. I understand that the library(tm) is used for this, but I'm having a hard time finding resources on how to change the document and filter for word frequency with some words being viewed as multi-word units (i.e "climate change" over "climate" and "change").

reddit.com
u/First-Ad-862 — 1 month ago