Robot | Path | Permission |
GoogleBot | / | ✔ |
BingBot | / | ✔ |
BaiduSpider | / | ✔ |
YandexBot | / | ✔ |
Title | Full-text data from English-Corpora.org: billions of words of downloadable |
Description | Full-text corpus Full-text data from large online |
Keywords | English corpus corpora word frequency genres n-grams collocates |
WebSite | corpusdata.org |
Host IP | 216.239.38.21 |
Location | United States |
Site | Rank |
US$3,209,501
Last updated: 2023-05-06 05:40:48
corpusdata.org has Semrush global rank of 3,297,804. corpusdata.org has an estimated worth of US$ 3,209,501, based on its estimated Ads revenue. corpusdata.org receives approximately 370,327 unique visitors each day. Its web server is located in United States, with IP address 216.239.38.21. According to SiteAdvisor, corpusdata.org is safe to visit. |
Purchase/Sale Value | US$3,209,501 |
Daily Ads Revenue | US$2,963 |
Monthly Ads Revenue | US$88,879 |
Yearly Ads Revenue | US$1,066,542 |
Daily Unique Visitors | 24,689 |
Note: All traffic and earnings values are estimates. |
Host | Type | TTL | Data |
corpusdata.org. | A | 3600 | IP: 216.239.38.21 |
corpusdata.org. | A | 3600 | IP: 216.239.32.21 |
corpusdata.org. | A | 3600 | IP: 216.239.34.21 |
corpusdata.org. | A | 3600 | IP: 216.239.36.21 |
corpusdata.org. | AAAA | 3600 | IPV6: 2001:4860:4802:34::15 |
corpusdata.org. | AAAA | 3600 | IPV6: 2001:4860:4802:38::15 |
corpusdata.org. | AAAA | 3600 | IPV6: 2001:4860:4802:36::15 |
corpusdata.org. | AAAA | 3600 | IPV6: 2001:4860:4802:32::15 |
corpusdata.org. | NS | 21600 | NS Record: ns-cloud-e3.googledomains.com. |
corpusdata.org. | NS | 21600 | NS Record: ns-cloud-e2.googledomains.com. |
corpusdata.org. | NS | 21600 | NS Record: ns-cloud-e4.googledomains.com. |
corpusdata.org. | NS | 21600 | NS Record: ns-cloud-e1.googledomains.com. |
corpusdata.org. | MX | 3600 | MX Record: 10 alt3.aspmx.l.google.com. |
corpusdata.org. | MX | 3600 | MX Record: 5 alt2.aspmx.l.google.com. |
corpusdata.org. | MX | 3600 | MX Record: 10 alt4.aspmx.l.google.com. |
corpusdata.org. | MX | 3600 | MX Record: 1 aspmx.l.google.com. |
corpusdata.org. | MX | 3600 | MX Record: 5 alt1.aspmx.l.google.com. |
corpusdata.org. | TXT | 3600 | TXT Record: v=spf1 include:_spf.google.com ~all |
Full-text corpus data introduction Overview Using the data Limitations (10/200) format/samples Overview Database/SQL corpora related sites English-Corpora.org Word frequency Collocates N-grams WordAndPhrase Academic vocabulary get data Purchase data Samples: 1-10 million words This site contains downloadable, full-text corpus data from ten large corpora of English -- iWeb , COCA , COHA , NOW , Coronavirus , GloWbE , TV Corpus , Movies Corpus , SOAP Corpus , Wikipedia -- as well as the Corpus del Espaol and the Corpus do Portugus . The data is being used at hundreds of universities throughout the world, as well as in a wide range of companies (Amazon, Apple, Samsung, IBM, Netflix, Allstate Insurance, Capital One, Educational Testing Services, Oxford University Press, Dictionary.com, Grammarly, Sketch Engine, an extremely large US-based social media company, and many others) . With this full-text data, you have the actual corpora on your computer , and you can use the data in any way |
HTTP/1.1 302 Found Location: http://www.corpusdata.org Date: Tue, 26 Oct 2021 20:35:58 GMT Content-Type: text/html; charset=UTF-8 Server: ghs Content-Length: 222 X-XSS-Protection: 0 X-Frame-Options: SAMEORIGIN HTTP/1.1 301 Moved Permanently Content-Length: 150 Content-Type: text/html; charset=UTF-8 Location: https://www.corpusdata.org/ Server: Microsoft-IIS/10.0 Date: Tue, 26 Oct 2021 20:35:58 GMT HTTP/2 200 cache-control: private content-length: 44529 content-type: text/html server: Microsoft-IIS/10.0 set-cookie: ASPSESSIONIDAUTCBTDQ=NMPPPEHDLENBMIMOPGBOALHM; secure; path=/ date: Tue, 26 Oct 2021 20:35:58 GMT |
Domain Name: CORPUSDATA.ORG Registry Domain ID: D402200000001912673-LROR Registrar WHOIS Server: whois.google.com Registrar URL: https://domains.google.com Updated Date: 2019-06-06T00:20:38Z Creation Date: 2017-03-29T16:04:04Z Registry Expiry Date: 2029-03-29T16:04:04Z Registrar: Google LLC Registrar IANA ID: 895 Registrar Abuse Contact Email: registrar-abuse@google.com Registrar Abuse Contact Phone: +1.8772376466 Domain Status: clientTransferProhibited https://icann.org/epp#clientTransferProhibited Registrant Organization: Contact Privacy Inc. Customer 1244281699 Registrant State/Province: ON Registrant Country: CA Name Server: NS-CLOUD-E1.GOOGLEDOMAINS.COM Name Server: NS-CLOUD-E2.GOOGLEDOMAINS.COM Name Server: NS-CLOUD-E3.GOOGLEDOMAINS.COM Name Server: NS-CLOUD-E4.GOOGLEDOMAINS.COM DNSSEC: unsigned URL of the ICANN Whois Inaccuracy Complaint Form https://www.icann.org/wicf/) >>> Last update of WHOIS database: 2021-09-14T19:54:16Z <<< |