tm.plugin.webmining-package | Retrieve structured, textual data from various web sources |
assignValues | Extract Main HTML Content from DOM |
auth.google.reader | Authentification and token retrieval from the Google Reader web service. |
calcDensity | Extract Main HTML Content from DOM |
corpus.update | Update/Extend 'WebCorpus' with new feed items. |
corpus.update.WebCorpus | Update/Extend 'WebCorpus' with new feed items. |
encloseHTML | Enclose Text Content in HTML tags |
encloseHTML.character | Enclose Text Content in HTML tags |
encloseHTML.PlainTextDocument | Enclose Text Content in HTML tags |
extract | Extract main content from 'TextDocument's. |
extract.PlainTextDocument | Extract main content from 'TextDocument's. |
extractContentDOM | Extract Main HTML Content from DOM |
extractHTMLStrip | Simply strip HTML Tags from Document |
feedquery | Buildup string for feedquery. |
getEmpty | Retrieve Empty Corpus Elements through '$postFUN'. |
getEmpty.WebCorpus | Retrieve Empty Corpus Elements through '$postFUN'. |
getLinkContent | Get main content for corpus items, specified by links. |
getMainText | Extract Main HTML Content from DOM |
getURL | Copy of RCurl:::getURL() including a little bugfix for the .encoding parameter. |
GoogleBlogSearchSource | Get feed data from Google Blog Search (<URL: http://www.google.com/blogsearch>). |
GoogleFinanceSource | Get feed Meta Data from Google Finance. |
GoogleNewsSource | Get feed data from Google News Search <URL: http://news.google.com/> |
GoogleReaderSource | Retrieve feeds through the Google Reader API. |
json_content | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
NYTimesSource | Get feed data from NYTimes Article Search (<URL: http://developer.nytimes.com/docs/read/article_search_api>). |
parse | Wrapper/Convenience function to ensure right encoding for different Platforms |
rbloggers | WebCorpus retrieved from the Google Reader API for the R-Bloggers blog consisting only of meta data (no main content available). Length of retrieved corpus is 1000. |
readGoogle | Get feed Meta Data from Google Finance. |
readGoogleBlogSearch | Get feed data from Google Blog Search (<URL: http://www.google.com/blogsearch>). |
readGoogleReader | Retrieve feeds through the Google Reader API. |
readNYTimes | Get feed data from NYTimes Article Search (<URL: http://developer.nytimes.com/docs/read/article_search_api>). |
readReutersNews | Get feed data from Reuters News RSS feed channels. Reuters provides numerous feed |
readTwitter | Get feed data from Twitter Search API (<URL: https://dev.twitter.com/docs/api/1/get/search>). |
readWeb | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
readWebHTML | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
readWebJSON | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
readWebXML | Read content from WebXMLSource/WebHTMLSource/WebJSONSource. |
readYahoo | Get feed data from Yahoo! Finance. |
readYahooInplay | Get News from Yahoo Inplay. |
removeNonASCII | Remove non-ASCII characters from Text. |
removeNonASCII.PlainTextDocument | Remove non-ASCII characters from Text. |
removeTags | Extract Main HTML Content from DOM |
ReutersNewsSource | Get feed data from Reuters News RSS feed channels. Reuters provides numerous feed |
source.update | Update WebXMLSource/WebHTMLSource/WebJSONSource |
source.update.WebHTMLSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
source.update.WebJSONSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
source.update.WebXMLSource | Update WebXMLSource/WebHTMLSource/WebJSONSource |
tm.plugin.webmining | Retrieve structured, textual data from various web sources |
trimWhiteSpaces | Trim White Spaces from Text Document. |
TwitterSource | Get feed data from Twitter Search API (<URL: https://dev.twitter.com/docs/api/1/get/search>). |
WebCorpus | WebCorpus constructor function. |
webmining | Retrieve structured, textual data from various web sources |
WebSource | Read Web Content and respective Link Content from feedurls. |
YahooFinanceSource | Get feed data from Yahoo! Finance. |
YahooInplaySource | Get News from Yahoo Inplay. |
yahoonews | WebCorpus retrieved from Yahoo! News for the search term "Microsoft" through the YahooNewsSource. Length of retrieved corpus is 20. |
YahooNewsSource | Get feed data from Yahoo! News (<URL: http://news.yahoo.com/>). |