Using literal.eval(data) from the ast package also throws an error.Īs the order of the fields and the legth for each field is not stable I am stuck on how to reformat that file in order to conform to JSON.Current Build: FortKnox Personal Firewall v7.0.205 FortKnox Personal Firewall 22.0.330. Could be a great product but things like that confuse IMO and I shouldnt be forced to install just to know what version it actually is. Using regex, I was able to catch some of the instances, but it does not catch everything: The other thing is posted information about different version numbers on the exact same product.A simple data.replace('\'', '\"') is not possible, as the "text" fields contain tweets which may contain ' or " themselves.then there are the tweets in json (order of fields not stable) starting with a space, one tweet per line. ![]() As this does not conform with the JSON standard, the file can not be processed by R or Python.Įvery about 500 lines start with meta info + meta information for the users, etc. ![]() I have a large JSON file (~700.000 lines, 1.2GB filesize) containing twitter data that I need to preprocess for data and network analysis.ĭuring the data collection an error happend: Instead of using " as a seperator ' was used.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |