Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Trouble importing XML

    Hi All,
    Apologies for a new post on this topic. However, I searched through the forum and didn't find an existing resolution to my query.

    I'm trying to import xml files from USPTO website into Stata (ver. 14.1) but haven't been successful. I tried the File>Import>XML Data from the point and click options as well as using the cmd <xmluse> with option <doctype(dta)>. However, I get the the message <unrecognizable XML doctype>. I also searched through the net and this forum and found out that I'm not the only one having trouble importing xml files into stata, specially the ones provided by USPTO.

    Here's a link to the files I'm trying to import <https://bulkdata.uspto.gov/data2/patent/grant/redbook/bibliographic/2015/>. There's also a file with extn. <dtd> at the bottom, which I think has got something to do with importing the xml files, but I'm not sure what and how to handle that. These are rather heavy files, which is why I'm also unable to convert them online through xml>csv converters.

    It would be great help if someone can advice me on how to import such files into Stata correctly, or any program that can convert .xml files into another format that can be imported into Stata easily.

    Appreciate any help in this regard

    Thanks
    ash

  • #2
    Originally posted by Ash Sharma View Post
    I searched through the forum and didn't find an existing resolution to my query.

    Dear Ash, please search this forum again more thoroughly. My solution from October 2014 still leads to data:

    Comment


    • #3
      Dear Sergiy,
      Thanks for sharing that information. I already looked at that info and given that Pantelis was trying to import a similar file, I tried the recommendations, but haven't been successful. I also tried the software that Pantelis mentioned on the forum to convert xml files. However, given the size of the files, it is not a feasible solution.

      These are definitely DTD (I checked by opening in notepad++) and there's also a file at the bottom with DTD extn. (which, I think has something to do with opening the other files).>> https://bulkdata.uspto.gov/data2/pat...fulltext/2015/

      Also, could you explain how to remove the dtd and open the files in excel, to save them in another format?

      Kindly excuse my lack of expertise at Stata.

      Thanks
      ash

      Comment

      Working...
      X