Pixelatus @pixelatus - Tumblr Blog

Writing a crawler in less than 10 minutes

Hola !

Came across Scrapy a few days ago. It was quite simple and easy to get started with a simple crawler.

This post assumes you have read something about scrapy and explains the steps to create a simple application using scrapy.

1) Install scrapy

apt-get install scrapy

2) Start a project with scrapy:

scrapy startproject myscraper

3) Define Items to be scraped in items.py:

Inside myscraper/myscraper/items.py , you will find a template to define what items need to be scraped. If you are familiar with Django, this is like models except that there are no data types. All Items are Fields()

https://gist.github.com/3563409

4) Define allowed domains/start urls :

Once you have defined items, open a new file ,say myspider.py in myscraper/myscraper/spiders/. Subclass from BaseSpider ( or from CrawlSpider if you want to specify which links to allow/deny). Provide a crawler name and specify a start_url for the crawler.

https://gist.github.com/3563319

Specify what to extract from the urls in the parse method. Scrapy uses Xpath by default to select contents. You can read more about Xpath from here. Parse method accepts the http response object and returns an Item. In the above example, for bookname , I have assigned contents of title tag and contents of span tag with price attribute to ‘price’

5) Define Pipelines:

Its important to store the scraped data somewhere. For this purpose, scrapy provides pipelines. Define your own pipelines in pipelines.py and enable that class in ITEM_PIPELINE settings in settings.py.

For eg, to write to a json file:

https://gist.github.com/3563350

and in settings.py file :

ITEM_PIPELINES = ['myscraper.pipelines.MyScraperPipeline']

6) Run your crawler

Thats it ! You’re done. Run your crawler using:

scrapy crawl avispider

You can see the scraped items in results.out file. :) Hope this was useful. You can see a full fledged application using scrapy here

#blog

shythemes

Text like and reblog buttons

In a previous tutorial, I demonstrated how to make like and reblog buttons as SVG icons. Now, I will show you how to make like and reblog buttons appear as text, like I did in my prism theme :)

Keep reading

#blog

h2>

pre> ##

#blog

<##pre>

function getData(){}

#blog

pre>

< pre>

#blog #kecskee

all

open

closed

#blog

all

open

closed

#blog

all

open

closed

#blog

all

open

closed

#blog

#blog

##let x = 2; if(x == 2){ console.log('kecskeee)};

#blog

let x = 2; if(x == 2){ console.log('kecskeee)};

#blog

html

<pre>

let x = 2;

if(x == 2){

console.log('kecskeee)};

</pre>

#blog

codes for tumblr themes & pages. Contribute to n0nspace/tumblr-themes development by creating an account on GitHub.

#blog

shythemes

Text like and reblog buttons

In a previous tutorial, I demonstrated how to make like and reblog buttons as SVG icons. Now, I will show you how to make like and reblog buttons appear as text, like I did in my prism theme :)

Keep reading

#blog

nonspace-moved

hi! i really love your photon theme, but to use it, i'd have to add a filter system. do you know how i could do this? or would you be willing to do it? i think it would be a super addition! and don't worry if you can't help, i'll google it if you say no! have a lovely day and new year!!

Hey, you’re right, I had sort of been planning on doing it as well but somehow forgot about it over other stuff!

I had some time and since this was so nicely asked I decided to quickly added. I wasn’t sure on the styling or the placement of the categories but you can change that if you like!

The code is here.

All you’ll need to do is set the filters you need here (the dot in front of the name is important):

all

open

closed

You can add more than that ofc, just copy/paste.

And then add the respective classes to the character names in the character navigation, here (important, here without! the dot):

one

two

three

...

The plugin I used is Isotope. You can find more information on it or additional functions here. There’s ways to have it filter by several tag as well etc.

Hope this helps!

pixelatus

anything!testing

#blog

nonspace

I am using PAGE #04 AFTERIMAGE v.3 Master and was wondering if it is possible to have multiple tags on a character to allow for cross searching. ie. being able to select group 1 and group 3 and having only characters with both those tags show up. I'm not sure what exactly the right term is for html so I'm having a hard time finding any coding help for this so I thought I would come to you. If this is not a question you have time for let me know and I'll try finding an answer on my own. Thanks!

Hey! It’s definitely possible.

I hope you don’t mind me publishing this, I figured it might be relevant to others especially seeing as the post linked in the code seems to have gotten lost during the blog move.

Filter Options:

https://isotope.metafizzy.co/filtering.html

This is the Isotope plug-in’s guide on how the filter options work. In the second box you can see the filter selectors. It’s written in jQuery but what you see after filter: ‘ ‘ correlates with the data-filter attribute you have in the navigation.

// filter .metal items$grid.isotope({ filter: '.metal' });// filter .alkali OR .alkaline-earth items$grid.isotope({ filter: '.alkali, .alkaline-earth' });// filter .metal AND .transition items$grid.isotope({ filter: '.metal.transition' });// show all items$grid.isotope({ filter: '*' });

So in your case if you want to only show characters with both tags: character 1 AND character 2 you will need to write the classes into the data-filter it as in the third example. e.g. group 1 AND group 3; the classes WITHOUT a space in between:

Group 1 & 3

Hope this helps!

Trending Blogs

Recently Viewed Blogs

Pixelatus