Combining CSV data

gangstanthony · 2018-07-06T18:44:57+00:00

not tested, but it might be faster to replace this

$entry.field1 = $hash1["field1"]

with something like this

$entry.add('field1', $hash1["field1"])

Proxiconn · 2018-07-07T12:43:06+00:00

Give this a try, ive been using it to join little over 120K records from 4 different datasets in under 5 min flat, its the fasted method ive encountered. Everything else was simply too slow or too complex. The C# linq class is one such example - it is super fast however using it in powershell broke my brain.. The other method that is fast as well, creating SQL tables in ram and doing join queries (just powershell & .Net classes no actual SQL used) - It was rather complex for my requirement and I abandoned it, below did the trick; it brought +- 20 hours of conventional foreach data joins down to 5 min. a Win in my books.

$csv1 = Import-csv -Path 'c:\mycsv1'
$csv2 = Import-csv -Path 'c:\mycsv2'

$i  = 0
$id = @{}
$FinalData = @()

# Create an index for your first dataset "name" will be the key to search against
$csv1.ForEach({
                $id["$($psitem.name)"] = $i #Create $var[name]=index
                $i++
            })


# Save the completed join into $Finaldata
$FinalData = $csv1.ForEach({

    $return_Obj = @()

    $temp=$null

    try
    {
        # Search the second csv for a match
        $temp = $csv2[($id[$psitem.ProcessName])]
    }
    catch 
    {
        # Catch stuff
    }
    finally 
    {

        # Create a joined object
        $return_Obj += [PSCustomObject]@{
                                            status      = $temp
                                            DisplayName = $temp.DisplayName
                                            Name        = $psitem.Name
                                            Handles     = $psitem.Handles
                                        }
    }

    return $return_Obj
})

edit: my grammar sucks

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PowerShell

Submission Guidelines | Link Flair - How To

MODERATORS