程序代写代做代考 concurrency Excel assembly distributed system Hive chain file system compiler Bayesian decision tree assembler database computer architecture interpreter mips Hidden Markov Mode c++ discrete mathematics scheme javascript computational biology algorithm Bayesian network data structure Java python matlab gui cache CGI jquery data science courseScraper-checkpoint
courseScraper-checkpoint In [1]: import urllib2 #specify the url wiki = “http://guide.berkeley.edu/courses/compsci/” page = urllib2.urlopen(wiki) from bs4 import BeautifulSoup soup = BeautifulSoup(page, “lxml”) In [34]: res = [] for t in soup.find_all(‘h3’, class_=”courseblocktitle”): alls = t.find_all() res.append(‘ ‘.join(x.string for x in alls).replace(u’\xa0’, ‘ ‘)) # alls = soup.find_all(‘h3’, class_=”courseblocktitle”)[0].find_all() # ‘ ‘.join(x.string for x in alls).replace(u’\xa0’, ‘ ‘) […]