Top Domains by Extracted Triples for Extractor html-mf-hresume


Back to Statistics

This page contains the list of top domains using the Microformats hResume of the extraction of December 2014 of the Web Data Commons project. The page shows the top domains employing Microformats hResume within their websites, ordered by the number of triples found in the crawl corpus.


  1. newcastle.edu.au (442,603 triples)
  2. whatclinic.com (6,598 triples)
  3. eurecom.fr (5,267 triples)
  4. signpostmarv.name (756 triples)
  5. indeed.com (580 triples)
  6. nyu.edu (358 triples)
  7. handi-cv.com (283 triples)
  8. webdirections.org (192 triples)
  9. memory-alpha.org (184 triples)
  10. cadremploi.fr (178 triples)
  11. chadlindstrom.ca (160 triples)
  12. github.io (148 triples)
  13. ntoll.org (148 triples)
  14. justinhileman.com (142 triples)
  15. dotrob.com (133 triples)
  16. changelog.ca (122 triples)
  17. rogerrohrbach.com (114 triples)
  18. paulschmidt.org (110 triples)
  19. morero.se (96 triples)
  20. dustyjewett.com (93 triples)
  21. alexrosenkranz.com (90 triples)
  22. visual-assault.org (89 triples)
  23. reidsrow.com (86 triples)
  24. paulisageek.com (82 triples)
  25. monks.co (77 triples)
  26. arsmachina.com.br (76 triples)
  27. brandi.org (72 triples)
  28. stjean.co (70 triples)
  29. cafemom.com (68 triples)
  30. stanford.edu (65 triples)
  31. thomasraukamp.cc (63 triples)
  32. stereoartist.com (63 triples)
  33. pkqk.net (61 triples)
  34. fraudain.fr (60 triples)
  35. evanmullins.com (60 triples)
  36. shrvy.com (59 triples)
  37. mattwilliamsnyc.com (59 triples)
  38. paulirish.com (59 triples)
  39. yuvalsadan.com (58 triples)
  40. hairextensionstucson.com (57 triples)
  41. cincgats.com (57 triples)
  42. whitelargepixel.com (57 triples)
  43. jaysudarma.com (57 triples)
  44. jnorton.co.uk (57 triples)
  45. rodriguezfernando.com (57 triples)
  46. basilosman.com (56 triples)
  47. lerdorf.com (55 triples)
  48. xorax.info (54 triples)
  49. opx.pl (53 triples)
  50. clintandrewhall.com (51 triples)
  51. boldlyopen.com (50 triples)
  52. lionel-girard.fr (50 triples)
  53. veerasundar.com (50 triples)
  54. sarahhenderson.info (46 triples)
  55. profesionalactivo.com (46 triples)
  56. suda.co.uk (46 triples)
  57. glennnorwood.co.uk (45 triples)
  58. timothymorgan.info (45 triples)
  59. fix.is (44 triples)
  60. alanmillan.com (42 triples)
  61. fiquett.com (42 triples)
  62. momarortotinon.com (41 triples)
  63. fluid-cv.appspot.com (40 triples)
  64. gsnedders.com (39 triples)
  65. imshopping.com (39 triples)
  66. thedoyletreatment.com (38 triples)
  67. judbd.com (38 triples)
  68. maxcutler.com (37 triples)
  69. jonathanblackburn.com (37 triples)
  70. blogspot.com (37 triples)
  71. cardwellit.com (36 triples)
  72. villeassinen.com (36 triples)
  73. admitmeplease.org (35 triples)
  74. l2fprod.com (34 triples)
  75. arthurdick.com (34 triples)
  76. furf.com (33 triples)
  77. anti-personnel.com (32 triples)
  78. monkey.org (31 triples)
  79. fernandoosorio.net (30 triples)
  80. voronenko.info (29 triples)
  81. finds.org.uk (28 triples)
  82. thejourneyler.org (28 triples)
  83. ryanjoy.com (27 triples)
  84. creativ-e-motion.fr (27 triples)
  85. glennjones.net (26 triples)
  86. thepracticalsysadmin.com (26 triples)
  87. jasonwhutchinson.com (26 triples)
  88. superfreshstudio.com (22 triples)
  89. hotel-hotel.com (22 triples)
  90. franalburquerque.info (21 triples)
  91. nickescobedo.com (20 triples)
  92. jakubhajek.cz (20 triples)
  93. seanyo.ca (19 triples)
  94. thisiskenson.be (18 triples)
  95. freerepublic.com (17 triples)
  96. roqz.net (17 triples)
  97. free.fr (14 triples)
  98. wolerized.com (14 triples)
  99. kevinlawver.com (13 triples)
  100. redmediadigital.com (12 triples)
  101. mfagreensboro.org (12 triples)
  102. sut.ac.jp (10 triples)
  103. shiningray.cn (9 triples)
  104. johnfallsopp.com (8 triples)
  105. opera-video.com (8 triples)
  106. tus.ac.jp (8 triples)
  107. syniverse.com (8 triples)
  108. bunda.co.id (8 triples)
  109. braithwaite.ca (7 triples)
  110. coleinteriordesign.com (7 triples)
  111. modelmayhem.com (7 triples)
  112. alapetite.fr (6 triples)
  113. jobspice.com (6 triples)
  114. fabiolocati.com (6 triples)
  115. scottblackman.com (5 triples)
  116. mforos.com (5 triples)
  117. kompetansebors.no (5 triples)
  118. buaa.edu.cn (4 triples)
  119. alfamartku.com (4 triples)
  120. damiandesigns.com (4 triples)
  121. tomorrow-focus.de (4 triples)
  122. stefan-koch.name (4 triples)
  123. tomhenrich.com (4 triples)
  124. microsoft.com (4 triples)
  125. yesterdayusa.com (4 triples)
  126. care2.com (3 triples)
  127. edu.int (3 triples)
  128. jwilde.me (3 triples)
  129. my3gb.com (3 triples)
  130. anthonycalzadilla.com (2 triples)
  131. pixelpunchout.com (2 triples)
  132. tomorrow-focus.com (2 triples)
  133. advanceweb.com (2 triples)
  134. juicycanvas.com (2 triples)
  135. cmu.edu (2 triples)
  136. kuskafesi.com.tr (2 triples)
  137. whitemeadowtemple.org (2 triples)
  138. guillaumelassiat.com (2 triples)
  139. crumpmortgage.com (2 triples)
  140. mirra.co.za (2 triples)
  141. knodal.com (2 triples)
  142. ctvoices.org (2 triples)
  143. diodon349.com (2 triples)
  144. amnesiavivace.it (2 triples)
  145. ox.ac.uk (1 triples)
  146. jcaruso.com (1 triples)
  147. nomadcode.com (1 triples)
  148. cesarrodas.com (1 triples)
  149. jessecravens.com (1 triples)
  150. isberg.eu (1 triples)
  151. zdrojak.cz (1 triples)
  152. premierpilatesny.com (1 triples)
  153. wright.edu (1 triples)
  154. ianhung.com (1 triples)
  155. gold.ac.uk (1 triples)