Re: "Best Fit" Algorithm Request && Teach A Man To Fish - Algorithms Discussion Boards

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Radhakrishnan G.18-May-10 3:44

Radhakrishnan G.

18-May-10 3:44

Yes, I agree with you

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

harold aptroot17-Apr-10 14:31

harold aptroot

17-Apr-10 14:31

It's similar to job scheduling, where the blocks are "jobs" (length of job = length of block) and the layers of the wall would represent the "machines" (number of layers = number of machines)

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Som Shekhar17-Apr-10 20:46

Som Shekhar

17-Apr-10 20:46

Are there some good algorithms following this approach?

I have been working in this area and would love to learn more.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

harold aptroot18-Apr-10 2:01

harold aptroot

18-Apr-10 2:01

IIRC Integer Linear Programming can solve it optimally, but everyone seems to use heuristic algorithms because solving an ILP problem is too slow for them. But ILP is not an algorithm but a way of specifying a problem. You can solve it with Branch and Bound (I may give it a try and see how slow it would really be).
It's quite annoying to search for good information about job scheduling with full in-advance knowledge - most papers about job scheduling deal with the online version and/or throw in some extra unknowns and restrictions.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Som Shekhar18-Apr-10 2:18

Som Shekhar

18-Apr-10 2:18

harold aptroot wrote:
veryone seems to use heuristic algorithms because solving an ILP problem is too slow for them

This is exactly what i found. Further, all algorithms that are created in this area are more of customized or proprietary.

Job Scheduling is a major and vast topic and solving it by usage of rectangles, is breaking down the problem into smaller problem. I hope to see such algorithms...

Re: "Best Fit" Algorithm Request && Teach A Man To Fish [modified]

harold aptroot18-Apr-10 2:38

harold aptroot

18-Apr-10 2:38

Time for some code! Smile | :)

This code is clearly crap, I just wrote it quickly in a console app to see whether it would work. It calculates the optimal solution with a simple branch and bound algorithm (without any tricks that can make it faster) - all cases that I tested worked.

For anything above 16 jobs it becomes really really slow. As a quick guestimate, the time complexity is about O(m^j) where m is the number of machines and j is the number of jobs.

static int[] Schedule(int[] jobs, int machines)
{
    int[] targets = new int[jobs.Length];
    int[] lengths = new int[machines];
    _bestFoundScore = int.MaxValue;
    _bestSolution = null;
    _schedule(jobs, targets, 0, lengths);
    return _bestSolution;
}
static int _bestFoundScore = int.MaxValue;
static int[] _bestSolution = null;
static void _schedule(int[] jobs, int[] targets, int index, int[] lengths)
{
    if (index == jobs.Length)
    {   // reached bottom
        int max = 0;
        for (int j = 0; j < lengths.Length; j++)
            if (lengths[j] > max)
                max = lengths[j];
        if (max >= _bestFoundScore)
            return; // not good

        // found better
        _bestFoundScore = max;
        int[] res = new int[targets.Length];
        Array.Copy(targets, res, res.Length);
        _bestSolution = res;
    }
    else
    {
        for (int i = 0; i < lengths.Length; i++)
        {   // for each machine

            // add unused job to that machine
            targets[index] = i + 1;
            lengths[i] += jobs[index];
            // check bounds
            int max = 0;
            for (int j = 0; j < lengths.Length; j++)
                if (lengths[j] > max)
                    max = lengths[j];
            if (max < _bestFoundScore)
            {
                // recurse for next job
                _schedule(jobs, targets, index + 1, lengths);
            }
            // undo changes
            targets[index] = 0;
            lengths[i] -= jobs[index];
        }
    }
}

Edit: but I don't think using rectangles will help at all - if anything, it will just make it harder without having any benefits. The problem would still be the same, but you'd have "the width of a rectangle" instead of just "some integer or float" which is really just the same thing.

modified on Sunday, April 18, 2010 8:44 AM

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Som Shekhar18-Apr-10 2:56

Som Shekhar

18-Apr-10 2:56

Hey Harold,

Thanks for the code. This piece of code is basically an implementation of nested recursions. Usually, algorithms have been developed to use some shortcuts in order to achieve the optimum.

harold aptroot wrote:
For anything above 16 jobs it becomes really really slow

It will get even slower if you add the type of jobs, type of machines, duration of job, efficiencies and duration of machine operation. I wonder what speed would be then.

The idea of rectangles is not to complicate the problem but to be able to see the problem differently.

Consider solving a maze from within the maze or sitting on the top of it. The perspective makes the whole of the difference. And hence the rectangle example. Another benefit is to represent the scheduling graphically and allow manual placement of pieces into a box of available resources.

Lets share in case we come across something. I have been trying to find something useful for a long time. When I saw your comment, I got optimistic Smile | :)

Cheers.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

harold aptroot18-Apr-10 3:16

harold aptroot

18-Apr-10 3:16

It has one shortcut - as soon as a partial solution becomes worse than the currently-best-found solution it bails out and starts on the next partial solution.

I tried adding a greedy approximation so it could discard more partial results in the beginning when it hasn't found a good potential solution yet, but on random problem instances with m=3 and j=20 it only helped slightly more than a third of a percentage on average (which is basically nothing)

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Som Shekhar18-Apr-10 3:40

Som Shekhar

18-Apr-10 3:40

buddy. we both know that nested recursions will ultimately be slow and they have one inherent problem "SCALING". you can't scale them. They are only good for small problems.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

harold aptroot18-Apr-10 3:44

harold aptroot

18-Apr-10 3:44

Yea I know, do you know anything better though?
4.5 seconds (in release mode) on average for m=3 j=20, btw (with that naive implementation)

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Som Shekhar18-Apr-10 3:53

Som Shekhar

18-Apr-10 3:53

Oh. I totally appreciate the effort. The fact is that you created a working code in a nick of time.

However it is not an algorithm that we both are looking for. Because it cannot be scaled to real life problems. However fast you make it. With every single increase in job or machine, the time will double up.

I work with a range of around 500+ machines and around 1000 jobs to be allocated. And then there are multiple variations and factors to calculate the duration.

May be we can collaborate and work on an algorithm. I am very interested in working on it with a set objective and deliverables. If we succeed, its gonna be a huge success.

What say?

Som

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Michael Fritzius18-Apr-10 9:40

Michael Fritzius

18-Apr-10 9:40

Wow: Looks like my question is akin to smacking a wasp nest with a cricket bat. Glad to see it sparked some convo.

So with what I found that was suggested by Harold and Luc, it seems like I could get away with just a simple priority list and just do the fat jobs first. There are no dependencies (job x has to be done before job y), so there's nothing stopping me from doing the jobs in any order.

But I'm not sold that this is the way to achieve the quickest completion time. I might be wrong but there might be some cases where this wouldn't work. So I'll have to look around a little more--fortunately the links and ideas given were a good step toward what I was looking for.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Luc Pattyn18-Apr-10 10:02

Luc Pattyn

18-Apr-10 10:02

fishing season ain't over yet. Laugh | :laugh:

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

Prolific encyclopedia fixture proof-reader browser patron addict?
We all depend on the beast below.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Luc Pattyn18-Apr-10 10:07

Luc Pattyn

18-Apr-10 10:07

Here is a simple example where biggest job first does not yield the optimum:

two machines; five jobs: 3,3,2,2,2

Smile | :)

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

Prolific encyclopedia fixture proof-reader browser patron addict?
We all depend on the beast below.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

harold aptroot18-Apr-10 11:39

harold aptroot

18-Apr-10 11:39

What will be the typical size of the instances of the problem that you want to solve?
For up to about 20 jobs solving the problem optimally with branch and bound is still doable..
Otherwise, it seems that in practice it is usual to use a genetic algorithm or simulated annealing - both have a tweakable time/quality trade-off and without too much trickery you can make it so that the result is guaranteed to be no worse than that of the greedy algorithm.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Michael Fritzius19-Apr-10 11:59

Michael Fritzius

19-Apr-10 11:59

Eh... maybe 50 jobs. And the number of jobs keeps on growing too...

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Luc Pattyn19-Apr-10 12:45

Luc Pattyn

19-Apr-10 12:45

IMO the more jobs the more "biggest job first" gets nearer to the optimal solution.

Smile | :)

Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles]

Prolific encyclopedia fixture proof-reader browser patron addict?
We all depend on the beast below.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Tim Craig19-Apr-10 12:48

Tim Craig

19-Apr-10 12:48

Luc Pattyn wrote:
IMO the more jobs the more "biggest job first" gets nearer to the optimal solution

I vote for "pays most". Poke tongue | ;-P

The wonderful thing about the Darwin Awards is that everyone wins, especially the members of the audience.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish [modified]

harold aptroot19-Apr-10 12:56

harold aptroot

19-Apr-10 12:56

Ok with the code I gave earlier that would take more than a million years Smile | :)

With a larger number of jobs, the greedy algorithm comes closer to the optimal solution, and there is a semi-greedy algorithm that is always* at least as good as the simple greedy algorithm but usually better:

1) Let s be the sum of all job-lengths and let bound be (s+m-1)/m (where m is the number of machines)
2) For each job in descending order (by length), assign it to the first machine where it wouldn't cause the sum of the lengths of the jobs assigned to that machine so far to exceed bound, if there is no such machine mark the job as "non scheduled".
3) If there are no jobs marked "non scheduled", return the schedule.
4) Otherwise, take the length (mlength) of the schedule of the machine with the shortest schedule, and take the length (jlength) of the smallest job.
5) Change the bound to bound+(jlength-(bound-mlength)) (IOW, make it so that you could add the smallest job to the smallest schedule without exceeding the bound)
6) Make all jobs non-marked and go back to step 2.

It is, of course, slower than the usual greedy approach, but (usually, at least) very fast compared to the optimal approach.

It always terminates, because jlength-(bound-mlength) can not be zero. If it were, that job would have been scheduled in step 2, so it wouldn't be the smallest "non scheduled" job. So the bound will grow until everything fits (potentially in a sub-optimal way).

* I have no proof, but after a large number of random tests I still had no disproof. But you could always run the usual greedy algorithm as well and take the best result, it takes essentially no time anyway.

edit: I didn't make this up myself, btw.

modified on Monday, April 19, 2010 7:26 PM

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Tadeusz Westawic23-Apr-10 2:24

Tadeusz Westawic

23-Apr-10 2:24

An aerosol of solvent is my first choice for a wasps nest.
No Smoking.
No Electric.
etc

Tadeusz Westawic

An ounce of Clever is worth a pound of Experience.

Re: "Best Fit" Algorithm Request && Teach A Man To Fish

Tadeusz Westawic23-Apr-10 3:57

Tadeusz Westawic

23-Apr-10 3:57

Hello All,

I was just stopping-by to make a rare post and I poked into this thread.

In my mind I think of these kinds of problems as 'least energy state' problems. These very often correlate to conditions in analog setttings. By thinking one's way forth-and-back through analog-digital conversions, some unnoticed feature of the problem often presents itself. Your analogue thougtht-picture of blocks in a tall wall is an example.

I imagined myself with a large appliance box and spheres of varying diameter representing the jobs. With spheres in the box I shake the box, stop, note the arrangement of spheres, and repeat.

Anyone else with a thought picture?

Tadeusz Westawic

An ounce of Clever is worth a pound of Experience.

CSMA / CA

hairy_hats14-Apr-10 4:59

hairy_hats

14-Apr-10 4:59

Need C# codez urgentzzzzz!!!!!

Not really, I have code. Smile | :)

However, I am after some advice: under embedded .NET, is this better implemented under event-driven code or in a more linear fashion with the dreaded Application.DoEvents()? An event-driven implementation looks a bit of a brain-f*** IMO...

Re: CSMA / CA

Anshul R9-Jun-10 4:14

Anshul R

9-Jun-10 4:14

Event Driven Code

Beating LabView's "Spreadsheet String To Array" function

PaulowniaK7-Apr-10 20:21

PaulowniaK

7-Apr-10 20:21

I have this 3rd party software that spits out a list of numbers formatted into text. They are integers but of varying number of digits. I usually get around 10,000 numbers in one string.

I'm turning all these numbers into an array of integers in C++ but it's taking me over 200 milli-seconds.

I was told that LabView can do the same thing in less than a milli-second.

Then a kind CP patron ran an experiment on C# and told me it only takes up to 20 milli-seconds.

I'm using CString::Find() to look for the space that separates each number in the string and CString::Mid() to get a piece of string per number and finally atoi() to turn it into integer. I know CString is slow, but I'm a bit surprised at the difference in the time of execution.

What is LabView doing that is so wonderful?
How can I improve my code in C++? Someone said something about not using Mid() to create new string each time but to put a section of char* directly into atoi(). I'm not sure how to do that except for doing things like memcpy in which case I might as well create a new CString as I do now.

Any ideas appreciated!

Almost, but not quite, entirely unlike... me...

Re: Beating LabView's "Spreadsheet String To Array" function

Phil Martin7-Apr-10 20:29

Phil Martin

7-Apr-10 20:29

One major observation:

- Get a profiler, and profile your code.

One minor observation:
- it is likely the over-allocation and deleting of memory of CString::Mid killing your performance.

Two options:

- modify the string in place, replacing spaces with \x0's, and use atoi to convert values.

- parse the digits by hand. Pseudo-pseudo code:
set n to 0.
Look at next character
Is it a space? store the integer in the result list.
Is it a number. set n = n * 10 + (int)character - 48.

Also:

- pre-size the resulting list to roughly input string length / estimated digit size, if you happen to know the string length up front.

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.