mirror of
https://github.com/nhammer514/textfiles-politics.git
synced 2025-06-25 14:50:47 -04:00
lots of stuff, python names, xquery to html, xml regex clean-ish
This commit is contained in:
parent
8838b667df
commit
780f7e4c00
342 changed files with 246606 additions and 843 deletions
6
pythonCode/read.md
Normal file
6
pythonCode/read.md
Normal file
|
@ -0,0 +1,6 @@
|
|||
# PythonCode
|
||||
## Notes
|
||||
- You will need to uncomment the ```nlp = spacy.cli.download("en_core_web_lg")``` to download the language model stuff. u can uncomment it once it downloads.
|
||||
- The ```re``` library includes our Regex functions. It is called using ```regex```. It uses standard regular expression stuff.
|
||||
- Everytime ```main.py``` launches, ```outputNames.txt``` clears. It will need to go through the entirety of our files, which still has to be done. Will all of the files work???
|
||||
- We will need to modify code so that it can produce new ```.xml``` files. Probably best to output files in new directory or something once we get started on that.
|
Loading…
Add table
Add a link
Reference in a new issue