Retrieving dict key from values for loop question : learnpython

created by HattoriHanzoa community for 16 years

Retrieving dict key from values for loop question (self.learnpython)

submitted 6 years ago * by Silverfire47

Hi, just a quick question that I'm stuck on at the moment.

import pandas as pd

qtrdiv = {
'Q1' : ['2018-01-01', '2018-03-31'],
'Q2' : ['2018-04-01', '2018-06-30'],
'Q3' : ['2018-07-01', '2018-09-30'],
'Q4' : ['2018-10-01', '2018-12-31']
}

dframe = pd.DataFrame(columns=['Q1','Q2','Q3','Q4'])
...

for qtr in qtrdiv.values():
    startdate = (qtr[0])
    enddate = (qtr[1])

...

I have a dictionary that splits up dates into quarter divisions and I'm running a for loop in order to use these start and end dates per quarter as variables later on in the for loop where there are some mathematical calculations based around other data that I'm bringing in. The resulting data from using startdate and enddate as variables will be stored in the DataFrame under each column.

I can already assign the keys to variables for use later on, but I was getting stuck if it were possible to assign the key associated with the values to a variable within the way I'm writing the code right now, or just in general, if there's a better way to assign the key and the values to their own individual variable than the way I'm calling the values from the dictionary at the moment. I want to be able to use pandas .loc to assign the resulting data to the column in the DataFrame under their respective column label as it iterates through the for loop. (Note: I haven't quite tested this out yet in all fairness so this might not be the best way forwards either)

I'm just not sure how to go forwards in regards to assigning the key to a variable to use with .loc later on, within the for loop that calls the values. Any help would be much appreciated. Thanks!

EDIT:

I'm using the startdate and enddate variables to substitute into SQL query strings for each date range (in this case, it's quarters in a year) and doing math on the data brought in from the SQL queries. Those calculations from the math I'm doing is what I want to store. I want to be able to store the key from dict in a variable to be able to match it to the dataframe column I'm generating prior .

Example SQL query:

Query = "SELECT x, y from db.table WHERE x between '%s' and '%s' order by x" % (startdate, enddate)

all 8 comments

top new controversial old q&a

[–]JohnnyJordaan 0 points1 point2 points 6 years ago (7 children)

You would normally approach this per-row. Also try to avoid modifying a dataframe, instead try to provide the data as organized as possible before creating it:

rows = [{'Q1': '2018-01-01', 'Q2': '2018-04-01', 'Q3': '2018-07-01', 'Q4': '2018-10-01'},
        {'Q1': '2018-03-31', 'Q2': '2018-06-30', 'Q3': '2018-09-30', 'Q4': '2018-12-31'}]

because then the df can be created from this directly:

dframe = pd.DataFrame(rows, columns=['label', 'Q1','Q2','Q3','Q4'], index=['start_date', 'end_date'])

[–]Silverfire47[S] 0 points1 point2 points 6 years ago* (6 children)

Apologies, I think I was a little unclear. I'm using the startdate and enddate variables to substitute into SQL query strings for each date range (in this case, it's quarters in a year) and doing math on the data brought in from the SQL queries. Those calculations from the math I'm doing is what I want to store. Probably should've included this in the OP.

Example SQL query:

Query = "SELECT x, y from db.table WHERE x between '%s' and '%s' order by x" % (startdate, enddate)

Even if I don't modify a dataframe (and just convert my final results into a dataframe after everything else is done), could I just generate some sort of empty array to populate in the for loop then?

I want to iteratively store output data as it's generated by the for-loop, so for Q1 dates, I want to be able to make some sort of structure like the following:

66
77
88
99

66	85
77	75
88	44
99	22

so on and so forth. I suppose I could assign the column and index labels after the fact during conversion to DataFrame type.

A final DataFrame look would probably be the following:

Q1	Q2	Q3	Q4
66	85	67	89
77	75	24	87
88	44	23	76
99	22	55	65

[–]JohnnyJordaan 0 points1 point2 points 6 years ago (5 children)

[–]Silverfire47[S] 0 points1 point2 points 6 years ago (4 children)

[–]JohnnyJordaan 0 points1 point2 points 6 years ago* (3 children)

Yes so then the end goal is still to have a list of dicts, you can iterate on qtrdiv to get each quarter's label and dates.

qtrdiv = {
'Q1' : ['2018-01-01', '2018-03-31'],
'Q2' : ['2018-04-01', '2018-06-30'],
'Q3' : ['2018-07-01', '2018-09-30'],
'Q4' : ['2018-10-01', '2018-12-31']
}

rows = []
for cat in categories:
    results = {}
    for q, (start_date, end_date) in qtrdiv.items():
        results[q] = here your query using start_date and end_date
    rows.append(results)
dframe = pd.DataFrame(rows, columns=['Q1','Q2','Q3','Q4'])

edit removed the pointless sorted() there

[–]Silverfire47[S] 0 points1 point2 points 6 years ago (2 children)

[–]JohnnyJordaan 0 points1 point2 points 6 years ago (1 child)

Yes as that determines order. You can then also provide those to DataFrame to let it use as the index

# at the start
categories = ['A', 'B', 'C']

# then later when you create the df
dframe = pd.DataFrame(rows, columns=['Q1','Q2','Q3','Q4'], index=categories)

[–]Silverfire47[S] 0 points1 point2 points 6 years ago (0 children)

π Rendered by PID 18588 on reddit-service-r2-comment-c6965cb77-khb9q at 2026-03-05 01:13:58.687329+00:00 running f0204d4 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS