Connecting a Python file with a SQL Database

HummelsM · May 23, 2018, 8:13am

Hello, everyone. I need a little help. I am basically trying to make a crawler and then connect it to a database so that the crawled data can be stored and indexed there for retrieval later. But since I’m new to some aspects of Python and SQL, I need a little help. Any advice?

HummelsM · May 23, 2018, 12:31pm

That’s the problem. As far as a connection between the two is concerned, I don’t have a code yet, not really. That’s what I am here for, to get an idea on how to handle this.

RedRaiderJoe · May 24, 2018, 4:11am

Try looking into the pyodbc module. I’ve used it quite a bit for connecting to Microsoft Access Databases.

owel · May 24, 2018, 4:30am

Microsoft provides a python MSSQL driver. pymssql

https://docs.microsoft.com/en-us/sql/connect/python/python-driver-for-sql-server?view=sql-server-2017

HummelsM · May 25, 2018, 5:27am

@owel So I download the driver, then what? What do I do next? Do I have to configure something with SQL or something?

owel · May 25, 2018, 6:42am

There are several example codes here.
http://pymssql.org/en/stable/pymssql_examples.html

This is just the driver for python to talk to an MSSQL server.

But you still need to know how to create/manage databases, tables and views in SQL Server, and know how to construct SQL commands. The driver is not magic, it’s just a go-between bridge between python and MSSQL server.

If you’ll be managing SQL databases, (creating tables, fields, indexes, views, stored procedures, full text, triggers, etc) you need to have Enterprise Manager software installed on your computer, and more importantly know how to use it.

https://docs.microsoft.com/en-us/sql/ssms/download-sql-server-management-studio-ssms?view=sql-server-2017

HummelsM · May 27, 2018, 7:24am

I’m making the SQL connection using Visual Studio, but I’m unclear on how I can get the .py file, which is the crawler, to store crawled data into the SQL database. Suggestions?

HummelsM · May 27, 2018, 3:36pm

from scrapy.spiders import CrawlSpider, Rule
from scrapy.linkextractors import LinkExtractor

class ElectronicsSpider(CrawlSpider):
name = “electronics”
allowed_domains = [“www.olx.com.pk”]
start_urls = [
‘https://www.olx.com.pk/tv-video-audio/’,
‘https://www.olx.com.pk/games-entertainment/’
]

rules = (
    Rule(LinkExtractor(allow=(), restrict_css=('.pageNextPrev',)),
         callback="parse_item",
         follow=True),)

def parse_item(self, response):
    print('Processing..' + response.url)
    # print(response.text)

Silber8806 · December 15, 2018, 3:56am

First off! What database are you using? Postgres, MySQL, Mongo…Hbase? Once you figure out that, you want to look if there is a client library that you can install and if you need to download native drivers (or if they are available). Native drivers are just drivers that the developers wrote to connect to the database instead of an open standard like say: ODBC. I use mostly postgres and so psycopg2 library works well. For psycopg2, I don’t think drivers are required on the client (but might be different for windows or might be a dependency that gets downloaded).

Usually, though this isn’t always the case, you establish a database connection using a connection function or by initiating a database engine that has a connect function. During this connection, you’ll have to provide a host, password and potentially a database name (if multiple entities exist within the database). Typically, you will either connect or get an error. This is normally binary: succeed or fail. If you get an error, there is a good chance that: 1. your credentials are wrong or 2. you are not establishing a connection to the database. The later is usually the result of: firewall, TCP/IP (internet) connection not being established or database permissions being off. You can usually figure this out through the error message.

If you succeed, it is often great to first query for the databases or users available within the database. This guarantees you are connected and have access to the right user/database. Afterwards, I’d query against actual data entities like relational tables or document stores. If you do this, you typically get returned an iterable object or generator, which you can manipulate.

Amelia1 · January 12, 2019, 9:54pm

Well depending on what sql database you are using you can pip install pymssql for microsoft sql (mssql), psycopg2 for postgres (psql) or mysqldb for mysql databases Here are a few examples of using it

Microsoft sql
\\
import pymssql

conn = pymssql.connect(server=server, user=user, password=password, database=db)
cursor = conn.cursor()

cursor.execute(“SELECT COUNT(MemberID) as count FROM Members WHERE id = 1”)
row = cursor.fetchone()

conn.close()

print(row)
//////

Postgres

\\
import psycopg2

conn = psycopg2.connect(database=db, user=user, password=password, host=host, port=“5432”)
cursor = conn.cursor()

cursor.execute(‘SELECT COUNT(MemberID) as count FROM Members WHERE id = 1’)
row = cursor.fetchone()

conn.close()

print(row)
/////

mysql

\\
import MySQLdb

conn = MySQLdb.connect(host=host, user=user, passwd=passwd, db=db)
cursor = conn.cursor()

cursor.execute(‘SELECT COUNT(MemberID) as count FROM Members WHERE id = 1’)
row = cursor.fetchone()

conn.close()

print(row)
//////

Topic		Replies	Views
How to Run SQL into Python - begginer user	1	251	January 25, 2023
Python Web App to receive HTTP POST and add to SQL Database? Python	2	307	September 30, 2021
SQL database file to xlsx Python	2	8955	September 19, 2022
Python and Dynamic SQL Python	5	1598	August 18, 2023
Data Analysis with python Python	8	532	February 17, 2024

Connecting a Python file with a SQL Database

Related topics